Publications by Alex Yarosh
Reshaping Data with pandas
Course Description Often data is in a human-readable format, but it’s not suitable for data analysis. This is where pandas can help—it’s a powerful tool for reshaping DataFrames into different formats. In this course, you’ll grow your data scientist and analyst skills as you learn how to wrangle string columns and nested data contained in a...
36215 sym Python (64835 sym/176 pcs) 1 img
Python Data Science Toolbox (Part 2)
Course Description In this second Python Data Science Toolbox course, you’ll continue to build your Python data science skills. First, you’ll learn about iterators, objects you have already encountered in the context of for loops. You’ll then learn about list comprehensions, which are extremely handy tools for all data scientists working in P...
40917 sym Python (14537 sym/36 pcs) 1 img
Python Data Science Toolbox (Part 1)
Course Description It’s time to push forward and develop your Python chops even further. There are tons of fantastic functions in Python and its library ecosystem. However, as a data scientist, you’ll constantly need to write your own functions to solve problems that are dictated by your data. You will learn the art of function writing in this ...
36883 sym Python (15315 sym/35 pcs) 1 img
Intermediate Data Visualization with ggplot2
Course Description This ggplot2 course builds on your knowledge from the introductory course to produce meaningful explanatory plots. Statistics will be calculated on the fly and you’ll see how Coordinates and Facets aid in communication. You’ll also explore details of data visualization best practices with ggplot2 to help make sure you have a ...
32590 sym R (19345 sym/36 pcs) 1 img
Introduction to NumPy
Course Description NumPy is an essential Python library. TensorFlow and scikit-learn use NumPy arrays as inputs, and pandas and Matplotlib are built on top of NumPy. In this Introduction to NumPy course, you’ll become a master wrangler of NumPy’s core object: arrays! Using data from New York City’s tree census, you’ll create, sort, filter, ...
30199 sym Python (16832 sym/42 pcs) 1 img
Introduction to Data Visualization with ggplot2
Course Description The ability to produce meaningful and beautiful data visualizations is an essential part of your skill set as a data scientist. This course, the first R data visualization tutorial in the series, introduces you to the principles of good visualizations and the grammar of graphics plotting concepts implemented in the ggplot2 packag...
30084 sym R (21532 sym/40 pcs) 1 img 1 tbl
Introduction to Data Visualization with Seaborn
Course Description Seaborn is a powerful Python library that makes it easy to create informative and attractive data visualizations. This 4-hour course provides an introduction to how you can use Seaborn to create a variety of plots, including scatter plots, count plots, bar plots, and box plots, and how you can customize your visualizations. You�...
30840 sym Python (15185 sym/30 pcs) 1 img
Introduction to Statistics in R
Course Description Statistics is the study of how to collect, analyze, and draw conclusions from data. It’s a hugely valuable tool that you can use to bring the future into focus and infer the answer to tons of questions. For example, what is the likelihood of someone purchasing your product, how many calls will your support team receive, and how...
34123 sym R (11766 sym/24 pcs) 7 img
Introduction to Data Visualization with Matplotlib
Course Description Visualizing data in plots and figures exposes the underlying patterns in the data and provides insights. Good visualizations also help you communicate your data to others, and are useful to data analysts and other consumers of the data. In this course, you will learn how to use Matplotlib, a powerful Python data visualization lib...
32051 sym Python (11746 sym/30 pcs) 1 img
Joining Data with dplyr
Course Description Often in data science, you’ll encounter fascinating data that is spread across multiple tables. This course will teach you the skills you’ll need to join multiple tables together to analyze them in combination. You’ll practice your skills using a fun dataset about LEGOs from the Rebrickable website. The dataset contains inf...
25361 sym R (16916 sym/47 pcs) 1 img