Publications by Rstats on Julia Silge

Practice using lubridate… THEATRICALLY

25.08.2019

I am so pleased to now be an RStudio-certified tidyverse trainer! ???? I have been teaching technical content for decades, whether in a university classroom, developing online courses, or leading workshops, but I still found this program valuable for my own professonal development. I learned a lot that is going to make my teaching better, and I a...

7295 sym R (6092 sym/16 pcs) 12 img

(Re)Launching my supervised machine learning course

22.09.2019

Today I am happy to announce a new(-ish), free, online, interactive course that I have developed, Supervised Machine Learning: Case Studies in R! ???? Supervised machine learning in R Predictive modeling, or supervised machine learning, is a powerful tool for using data to make predictions about the world around us. Once you understand the basic...

3482 sym 2 img

Opioid prescribing habits in Texas

11.10.2019

A paper I worked on was just published in a medical journal. This is quite an odd thing for me to be able to say, given my academic background and the career path I have had, but there you go! The first author of this paper is a long-time friend of mine working in anesthesiology and pain management, and he obtained data from the Texas Prescriptio...

8368 sym R (9481 sym/11 pcs) 8 img 2 tbl

Modeling salary and gender in the tech industry

30.12.2019

One of the biggest projects I have worked on over the past several years is the Stack Overflow Developer Survey, and one of the most unique aspects of this survey is the extensive salary data that is collected. This salary data is used to power the Stack Overflow Salary Calculator, and has been used by various folks to explore how people who use ...

15348 sym R (25187 sym/17 pcs) 10 img 4 tbl

#TidyTuesday and tidymodels

04.02.2020

This week I started my new job as a software engineer at RStudio, working with Max Kuhn and other folks on tidymodels. I am really excited about tidymodels because my own experience as a practicing data scientist has shown me some of the areas for growth that still exist in open source software when it comes to modeling and machine learning. Almo...

6682 sym R (9108 sym/13 pcs) 10 img

#TidyTuesday hotel bookings and recipes

10.02.2020

Last week I published my first screencast showing how to use the tidymodels framework for machine learning and modeling in R. Today, I’m using this week’s #TidyTuesday dataset on hotel bookings to show how to use one of the tidymodels packages recipes with some simple models! Here is the code I used in the video, for those who prefer reading...

6123 sym R (12326 sym/12 pcs) 10 img

Hyperparameter tuning and #TidyTuesday food consumption

17.02.2020

Last week I published a screencast demonstrating how to use the tidymodels framework and specifically the recipes package. Today, I’m using this week’s #TidyTuesday dataset on food consumption around the world to show hyperparameter tuning! Here is the code I used in the video, for those who prefer reading instead of or in addition to video....

2813 sym R (6424 sym/7 pcs) 2 img

Preprocessing and resampling using #TidyTuesday college data

09.03.2020

I’ve been publishing screencasts demonstrating how to use the tidymodels framework, from first getting started to how to tune machine learning models. Today, I’m using this week’s #TidyTuesday dataset on college tuition and diversity at US colleges to show some data preprocessing steps and how to use resampling! Here is the code I used in ...

4728 sym R (11608 sym/13 pcs) 6 img

LASSO regression using tidymodels and #TidyTuesday data for The Office

16.03.2020

I’ve been publishing screencasts demonstrating how to use the tidymodels framework, from first steps in modeling to how to tune more complex models. Today, I’m using this week’s #TidyTuesday dataset on The Office to show how to build a LASSO regression model and choose regularization parameters! Here is the code I used in the video, for th...

5217 sym R (10251 sym/15 pcs) 6 img

Tuning random forest hyperparameters with #TidyTuesday trees data

25.03.2020

I’ve been publishing screencasts demonstrating how to use the tidymodels framework, from first steps in modeling to how to tune more complex models. Today, I’m using a #TidyTuesday dataset from earlier this year on trees around San Francisco to show how to tune the hyperparameters of a random forest model and then use the final best model. H...

5387 sym R (5838 sym/17 pcs) 12 img