Publications by Nguyen Bui

Barcelona

28.03.2021

Introduction Hi! In this kernel we are going to analize some important topics about Barcelona: population, culture, transport, environment, etc. As a citizen and lover of this city, I would like to introduce it to the rest of Kaggle users. We are going to use data from the Open Data BCN platform: Open Data BCN, a project that was born in 2010, im...

4879 sym R (31646 sym/86 pcs) 6 img 33 tbl

Bootstrap confidence intervals for Super Bowl commercials

14.03.2021

Explore the data Our modeling goal is to estimate how the characteristics of Super Bowl commercials have changed over time. There aren’t a lot of observations in this dataset. And thí í an approach that can be used for robust estimates in such situations. Let’s start by reading in the data skimr::skim(youtube) Data summary Name youtube ...

1581 sym R (5576 sym/15 pcs) 3 img 5 tbl

Correlation and regression fundamentals with tidy data principles

11.01.2022

Correlation and regression fundamentals with tidy data principles LEARNING OBJECTIVE Analyze the results of correlationship tests and simple regression models for many data sets at once INTRODUCTION This article only requires the tidymodels package. While the tidymodels package broom is useful for summarizing the result of a single analysis in a...

3011 sym R (22539 sym/41 pcs) 1 img

decision tree model

28.08.2021

suppressMessages(library(tidyverse)) suppressMessages(library(tidymodels)) scooby_raw <- read_csv("https://raw.githubusercontent.com/rfordatascience/tidytuesday/master/data/2021/2021-07-13/scoobydoo.csv") scooby_raw %>% filter(monster_amount > 0) %>% count(monster_real) ## # A tibble: 2 x 2 ## monster_real n ## <chr> <...

27 sym R (6433 sym/18 pcs) 4 img

Bootstrap resampling

13.08.2021

suppressMessages(library(tidyverse)) ## Warning: package 'tidyverse' was built under R version 4.0.5 ## Warning: package 'ggplot2' was built under R version 4.0.5 ## Warning: package 'tibble' was built under R version 4.0.4 ## Warning: package 'tidyr' was built under R version 4.0.4 ## Warning: package 'readr' was built under R version 4.0.5 ## W...

47 sym R (6624 sym/38 pcs) 4 img

Stats Inference

12.08.2021

Load packages ## Warning: package 'tidyverse' was built under R version 4.0.5 ## Warning: package 'ggplot2' was built under R version 4.0.5 ## Warning: package 'tibble' was built under R version 4.0.4 ## Warning: package 'tidyr' was built under R version 4.0.4 ## Warning: package 'readr' was built under R version 4.0.5 ## Warning: package 'dplyr'...

1437 sym R (26451 sym/143 pcs) 24 img 10 tbl

tidymodels nfl season

23.08.2021

suppressMessages(library(tidyverse)) suppressMessages(library(tidymodels)) suppressMessages(library(readxl)) attendance <- read_csv("https://raw.githubusercontent.com/rfordatascience/tidytuesday/master/data/2020/2020-02-04/attendance.csv") standings <- read_csv("https://raw.githubusercontent.com/rfordatascience/tidytuesday/master/data/2020/2...

28 sym R (7628 sym/18 pcs) 5 img

ML project

05.09.2021

setwd("C:/Users/DellPC/Desktop/Corner/R_source_code/Julia_Silge") #Manipulation library(tidyverse) #Create models library(tidymodels) #Feature importance library(vip) library(lime) #Visualization library(RColorBrewer) library(scales) library(DT) library(ggsci) library(ggthemes) library(ggpubr) library(gghalves) library(ggridge...

17921 sym R (22536 sym/36 pcs) 26 img 6 tbl

Publish Document

20.09.2021

This is the latest in my series of screencasts demonstraing how t ouse the tidymodels packages, from starting out with first modeling steps to tuning more complex models. Explore the data Our modeling goal is to understand how student debt and inequality has been changing over time. We can build a model to understand the relationship between stu...

1668 sym R (5117 sym/30 pcs) 2 img

Document

14.11.2021

Spatial resampling for Tidy Tuesday Geographic data is special when it comes to, well, basically everything. suppressMessages(library(tidyverse)) suppressMessages(library(tidymodels)) data("lsl", package = "spDataLarge") landslides <- as_tibble(lsl) landslides ## # A tibble: 350 x 8 ## x y lslpts slope cplan cprof el...

780 sym R (3332 sym/13 pcs) 16 img