Publications by JOURNEYOFANALYTICS
How to raise money on Kickstarter – extensive EDA and prediction tutorial
In this tutorial, we will explore the characterisitcs of projects on Kickstarter and try to understand what separates the winners from the projects that failed to reach their funding goals. Qs for Exploratory Analysis: We will start our analysis with the aim of answering the following questions: How many projects were successful on Kickstarter, ...
7118 sym R (8018 sym/32 pcs) 26 img
Automated Email Reports with R
R is an amazing tool to perform advanced statistical analysis and create stunning visualizations. However, data scientists and analytics practitioners do not work in silos, so these analysis have to be copied and emailed to senior managers and partners teams. Cut-copy-paste sounds great, but if it is a daily or periodic task, it is more useful ...
8587 sym R (71 sym/1 pcs) 2 img
India vs US – Kaggle Users & Data Scientists
Introduction This is an analysis of the Kaggle 2018 survey dataset. In my analysis I am trying to understand the similarities and differences between men and women users from US and India, since these are the two biggest segments of the respondent population. The number of respondents who chose someting other than Male/Female is quite low, so I e...
7086 sym R (375 sym/1 pcs) 18 img
Data Science Job in 90 days – Book Review
Are you an R-programmer or Datascience enthusiast looking for a break in the datascience field? If so, my latest book “Data Science Jobs – land a lucrative job in 90 days” will help you find one quickly. [Author’s note – The ebook is FREE ONLY until midnight this Sunday (May 26th). So hurry and grab your copy today.] As an analytics ...
3399 sym 2 img
How to Become a Data Scientist
This question and its variations are the most searched topics on Google. As a practicing datascience professional, and manager to boot, dozens of people ask me this question every week. This post is my honest and detailed answer. Step 1 – Coding & ML skills You need to master programming in either R or Python. If you don’t know which to pi...
5366 sym 4 img
Email Automation for Google Trends
This blogpost will teach you set up automated email reports to view how search volumes i.e. Google Trends vary over time. Email automation for Google Trends over time The email report will also include important search terms that are “rising” or near a “breakout point”. This can be really useful as the breakout keywords indicate users ac...
7455 sym 8 img
Mapping Anthony Bourdain’s Travels
Travel maps tutorial Anthony Bourdain was an amazing personality – chef, author, world traveler, TV showhost. I loved his shows as much for the exotic locations as for the yummilicious local cuisine. So I was delighted to find a dataset that included all travel location data, from all episodes of his 3 hit TV shows. Dataset attributed to Christ...
5014 sym 10 img
DataScience Portfolio Ideas for Students & Beginners
A lot has been written on the importance of a portfolio if you are looking for a DataScience role. Ideally, you should document your learning journey so that you can reuse code, write well-documented code and also improve your data storytelling skills. DataScience Portfolio Ideas However, most students and beginners get stumped on what to include...
9083 sym 8 img
Social Network Visualization with R
In this month’s we are going to look at data analysis and visualization of social networks using R programming. Social Networks – Data Visualization Friendster Networks Mapping Friendster was a yesteryear social media network, something akin to Facebook. I’ve never used it but it is one of those easily available datasets where you have a l...
4136 sym R (527 sym/1 pcs) 8 img
Social Network Visualization with R
In this month’s we are going to look at data analysis and visualization of social networks using R programming. Social Networks – Data Visualization Friendster Networks Mapping Friendster was a yesteryear social media network, something akin to Facebook. I’ve never used it but it is one of those easily available datasets where you have a l...
4136 sym R (527 sym/1 pcs) 8 img