Publications by numbr crunch - Blog

IDEAS for BloGGING tOPICS

03.12.2014

Review of the University of Washington Data Science CertificationInitial analysis of the Titanic Data Set from KaggleCriteo Labs Ad-Click data Kaggle Competition experienceOnline sources for datasetsWhat data tools stack I am using and whyHow to use Github Related To leave a comment for the author, please follow the link and comment on their bl...

658 sym

The Good oL’ Titanic Kaggle Competition pt. 1

05.12.2014

After that I began playing around with logistic regression. So far, none of my attempts at logistic regression have improved my score but I have some ideas for tomorrow (already reached my submission limit for today). I do realize now that I need to have a plan with my logistic regression models, I need to determine which features have the best ...

4010 sym 2 img

CLassification Options for Titanic Competition

10.12.2014

Logistic RegressionNeural NetsNaive BayesDecision Trees Related To leave a comment for the author, please follow the link and comment on their blog: numbr crunch - Blog. R-bloggers.com offers daily e-mail updates about R news and tutorials about learning R and many other topics. Click here if you're looking to post or find an R/data-scienc...

458 sym

Titanic Kaggle competition pt. 2

01.01.2015

Logistic Regression Continued I’m finally getting back to tackling the Titanic competition. In my last entry, I had started with some basic models (only females live, only 1st and 2nd class females live, etc), and then moved onto logistic regression. My logistic regression model at the time was not performing that well but I was also only usin...

5103 sym 6 img

Creating WEB APPS WITH SHINY – REACTIVE GRAPH OF AVERAGE US HOUSEHOLD INCOME BY RACE

13.01.2015

I finally finished my first Shiny application! You can click this link to check it out. Below is a screenshot of what it looks like. I got all the data from the United States Census Bureau website. What the app does is provide a comparison of average income by race for different income groups between 1972 and 2013. If that doesn’t make mu...

3191 sym 2 img

REVIEW OF THE UNIVERSITY OF WASHINGTON DATA SCIENCE CERTIFICATE PROGRAM

16.01.2015

When I was looking for Data Science certificate programs back in 2013, there were only a few available and most had only graduated one or two cohorts. Even worse, I could not find a single review for any of them. So, this is my review of the University of Washington Data Science certificate. Background: I ended up choosing the University of...

4378 sym 2 img

HOW TO HOST YOUR SHINY APP ON AMAZON EC2 FOR MAC OSX

26.01.2015

In a previous post I had mentioned that I would provide a hosting guide for Shiny applications so other people wouldn’t have to suffer through all the same issues I did when trying to do this. Just for reference, I am running OSX Yosemite. For the SSH client you’ll be using Terminal, for the SFTP client you’ll be using Cyberduck. Much of t...

5830 sym 10 img

using the httr package to retrieve data from apis in R

20.02.2015

For a project I’m working on, I needed to access residential electricity rates and associated coordinate information (lat/long) for locations in the US. After a little searching, I found that data.gov offers the rate information in two forms: a static list of approximate rates by region and an API, which returns annual average utility rates ($/...

5922 sym