Publications by anu - Journey of Analytics Team

Sberbank Machine Learning Series – Post 2 – Mind maps & Hypothesis

29.05.2017

This is the second post of the Sberbank Russia housing set analysis, where we will narrow down the variables of interest and create a roadmap to understand which factors significantly impact the target variable (price_doc). You can read the introductory first post here. Analysis Roadmap: This Kaggle dataset has ~290 variables, so having a clear ...

7494 sym 2 img

Monte Carlo Simulations in R

01.08.2017

In today’s tutorial, we are going to learn how to implement Monte Carlo Simulations in R. Logic behind Monte Carlo: Monte Carlo Simulations in R Monte Carlo simulation (also known as the Monte Carlo Method) is a statistical technique that allows us to compute all the possible outcomes of an event. This makes it extremely helpful in risk assessm...

5230 sym R (118 sym/1 pcs) 2 img

Top US Cities with Highest Rent

15.08.2017

In this post, we will use the Zillow rent dataset to perform  exploratory and inferential statistics. Our main goal is to identify the most expensive real estate cities in US. Input Files: The Kaggle dataset contains two files with rental prices for 13000+ cities across the time frame Nov 2010 – Jan 2017. One file contains values for rent, th...

6224 sym R (1804 sym/4 pcs) 20 img

Who wants to work at Google?

16.01.2018

In this tutorial, we will explore the open roles at Google, and try to see what common attributes Google is looking for, in future employees. This dataset comes from the Kaggle site, and contains text information about job location, title, department, minimum and preferred qualifications and the responsibilities of the position. Using this datas...

4966 sym 20 img

Top 10 Most Valuable Data Science Skills in 2020

25.01.2020

The first month of the new decade is almost at an end. It’s also “job-hunting” time when students start looking for internships and employees think about switching roles and companies, in search of better salaries and opportunities. If you fall into one of these categories, then here are the Top 10 skills your resume absolutely needs to inc...

9025 sym 8 img

Top 10 Most Valuable Data Science Skills in 2020

25.01.2020

The first month of the new decade is almost at an end. It’s also “job-hunting” time when students start looking for internships and employees think about switching roles and companies, in search of better salaries and opportunities. If you fall into one of these categories, then here are the Top 10 skills your resume absolutely needs to inc...

9025 sym 8 img