Publications by Nguyen Bui

round 1 data crunching

26.08.2020

Load libraries suppressMessages(library(tidyverse)) suppressMessages(library(reticulate)) suppressMessages(library(lubridate)) suppressMessages(library(readxl)) suppressMessages(library(plotly)) suppressMessages(library(leaflet)) suppressMessages(library(leafpop)) suppressMessages(library(patchwork)) #set working directory setwd('C:/Us...

362 sym R (17674 sym/42 pcs) 4 img

m5_test

23.08.2020

setwd('C:/Users/DellPC/Desktop/Corner/Py_source_code/Project/Store Item Demand Forecasting') R packages suppressMessages(library(tidyverse)) suppressMessages(library(zoo)) # Set up for python environment in Rstudio library(reticulate) EDA + Prophet + MLP Neural Network Forecasting Execitove Summary: This tutorial consists of comprehensive ...

2147 sym R (21389 sym/63 pcs)

Radar plot

22.08.2020

library(reticulate) Loading Data and preparing it for plotting import numpy as np import pandas as pd import os import re os.chdir('C:/Users/DellPC/Desktop/Corner/Py_source_code/Project/kaggle-survey-2019') kaggle_multiple_choice = pd.read_csv('multiple_choice_responses.csv', low_memory = False) kaggle_multiple_choice.head(3) ## ...

231 sym R (8680 sym/36 pcs)

NFL ATTENDANCE

01.08.2020

FIRST PROJECT IN TIDYMODELS - NFL ATTANDANCE This week I started my new job as a software engineer at Rstudio, working with Max Kuhn and other folks on tidymodels. I am really excited about tidymodels because my own experience as a practicing data scientist has shown me some of the areas for growth that still exist in open source software when ...

5911 sym R (10643 sym/25 pcs) 5 img

Bootstrap resampling tidy models

31.07.2020

BOOTSTRAP RESAMPLING WITH TIDYTUESDAY BEER PRODUCTION DATA I’ve been publishing screencasts demonstrating how to use the tidymodels framwork, from first steps in modeling to how to tune more complex models. Today, I’m using this week;s TidyTuesday dataset on beer production to show how to use bootstrap resampling t oestimate model parameter...

2103 sym R (9422 sym/32 pcs) 4 img

Get started with tidymodels and TidyTuesday Palmer Penguins

29.07.2020

GET STARTED WITH TIDYMODELS AND TIDYTUESDAY PALMER PENGUINS Lately I’ve been publishing screncasts demonstrating how to use tidymodels framework, from first steps in modeling to how to evaluate complex models. Today’s screencast is good for folks just getting started with tidymodels, using this week’s TidyTueday dataset on penguins Expl...

3416 sym R (9178 sym/38 pcs) 3 img

Lasso Regression with The Office

27.07.2020

I’ve been publishing screencasts demonstrating how to use the tidymodels framework from first steps in modeling to how to tune more complex models. Today, I’m using this week’s TidyTuesday dataset on the Office to show how to build lasso regression model and choose regularization parameters! Explore the data Our modeling goal here is to p...

4449 sym R (13434 sym/41 pcs) 3 img

MULTINOMIAL CLASSIFICATION WITH VOLCANO ERUPTIONS

25.07.2020

MULTINOMIAL CLASSIFICATION WITH TIDYMODELS AND TIDYTUESDAY VOLCANO ERUPTIONS Lately I’ve been publishing screencasts demonstrating how to use the tidymodels framework, from first steps in modeling to how to evaluate complex models. Today’s screencast demonstrates how to implement multiclass or multinomial classification using with this week...

4446 sym R (10916 sym/43 pcs) 3 img

Bagging Model

23.07.2020

BAGGING WITH TIDYMODELS AND TIDYTUESDAY ASTRONAUT MISSIONS Lately I’ve been publishing screencasts demonstrating how to use the tidymodels framework, from first steps in modeling to how to evaluate complex models. Today’s screencast focuses on bagging using this week’s TidyTueday dataset on astronaut missions Here is the code I used in th...

3618 sym R (11351 sym/34 pcs) 3 img

XGB tutorial

23.07.2020

TUNE XGBOOST WITH TIDYMODELS AND TIDYTUESDAY BEACH VOLLEYBAL Lately I’ve been publishing screencasts demonstrating how to use the tidymodels framwork, starting from just getting started. Today’s screencast explores a more advanced topic in how to tune an XGBoost classification model using with this week’s tidytuesday dataset on beach vol...

1828 sym R (3085 sym/8 pcs) 1 img