Publications by Nguyen Bui
EXPLORE DATASET USING LUBRIDATE PACKAGE
PRACTICE USING LUBRIDATE… THEATERICALLY Read in the London Stage Database Learn more about the London Stage Databse, including about the data [rpvenance and copde used to build the databse. Briefly, it explroes the theater scene in London from when playhouses were reopened in 1660 after the English civil war to the end of the 18th century. T...
4431 sym R (1160067 sym/38 pcs) 6 img
Women's Basketball Tournament
Modeling NCAA women’s basketball tournament seeds Lately, I’ve been publishing screencasts demonstrating how to use the tidymodels framework, from starting out with first modeling steps to tuning more complex models. Today’s screencast walks through how to tune and choose hyperparameters using this week dataset NCAA women's basketball tou...
3734 sym R (11843 sym/50 pcs) 9 img
tidymodels chap8 chap9
A model workflow In the previous two chapters, we discuessed the recipes and parscip packages. These packages can be used to prepare the data for analysis and fitting the model. This chapter introduces a new object called a model workflow. The purpose of this object is to encapsulate the major pieces of the modeling process. The workflow is imp...
17143 sym R (13300 sym/81 pcs) 5 img
tidymodels chap1 chap2
Types of Models Before proceeding, let’s describe a taxonomy for types of models, grouped by purpose. While not exhasive, most models faill into at least one of these categories: Descriptive Models The purpose of descriptive model is to describe or illustrate characteristics of some data/ The analysis might have no other purpose than to visual...
4549 sym 2 img
tidymodels chap7
Fitting models with parsnip The parsnip package provides a fluent and standardized interface for a variety of different models. In this chapter, we both give some motiavation for why a common interface is beneficial and show how the use the package. Create a model Once the data have been encoded in a format ready for a modeling algorithm, such...
5847 sym R (7218 sym/50 pcs)
tidymodels chap6
6. Feature engineering with recipes Feature engineering encompassess activities that reformat predictor values to make them easier for a model to use effectively. This includes transforming and encoding of the data to best represent their important characteristics. There are many other examples of preprocessing to build better features for mode...
12584 sym R (13007 sym/38 pcs) 4 img
tidymodels chap4 chap5
The Ames housing data The Ames housing data set is an excellent resource for learning about models that we will use throughout this book. It contains data on 2930 properties in Ames, Iowa, including columns rated to house characteristics (bedrooms, garage. fireplace, pool, porch, etc…) location (neighborhood) lot information (zoning, shape, ...
8602 sym R (2200 sym/21 pcs) 3 img
Survey
nielsen_round1 Nguyen_LSCM — 9/23/2020 Survey Qualitative Q1 Q2 Q3 Q4 Q5 Q6 Q7 Q8 Q9 Q10 Q11 Q12 ...
207 sym 12 img
Predicting class member
Predicting class membership for the TidyTuesday Datasaurus Dozen Explore the data The Datasaurus Dozen dataset is a collection of 13 sets of x/y data that have very similar summary statistics but look very different when plotted. Our modeling goal is to predict which member of the ‘dozen’ each point belongs to Let’s start by reading in t...
1948 sym R (9244 sym/34 pcs) 4 img
GHTK report
Analytics Job in Ecommerce Nguyen — March 03 2021 #Business Job Description Row {data-height=500} Bus_Des biz_jd Bus_2words Row {data-height=500} Data_Des data_jd Data_2words Job Requirement Row {data-height=350} Bus_req biz_jd Bus_2words Row {data-height=500} Data_req data_jd Data_2words Benefits Row {data-height=...
607 sym 10 img