Publications by Derek Howanski
DAT 3100 Apply 9
Explore data Is there a relationship between the sex of the member and deaths on specific peaks? library(tidyverse) ## ── Attaching core tidyverse packages ──────────────────────── tidyverse 2.0.0 ── ## ✔ dplyr 1.1.3 ✔ readr 2.1.4 ## ✔ forcats 1.0.0 ✔ stringr 1.5...
1358 sym R (5622 sym/27 pcs) 2 img
Apply 5
Explore data Our goal is to build a classification model to predict the spam email(yesno). spam <- readr::read_csv('https://raw.githubusercontent.com/rfordatascience/tidytuesday/master/data/2023/2023-08-15/spam.csv') ## Rows: 4601 Columns: 7 ## ── Column specification ─────────────────────────...
2006 sym R (4884 sym/28 pcs) 7 img 3 tbl
Code Along 9
Explore the data library(tidyverse) ## ── Attaching core tidyverse packages ──────────────────────── tidyverse 2.0.0 ── ## ✔ dplyr 1.1.3 ✔ readr 2.1.4 ## ✔ forcats 1.0.0 ✔ stringr 1.5.0 ## ✔ ggplot2 3.4.3 ✔ tibble 3.2.1 ## ✔ lubridate 1.9.2 �...
1712 sym R (6733 sym/26 pcs) 2 img
DAT 3100 Apply 6
Our modeling goal is to predict the whether a museum in the UK is accredited or not, based on other characteristics of the museum like its size and subject matter. Let’s start by reading in the data: Explore Data library(tidyverse) ## ── Attaching core tidyverse packages ───────────────────────...
3412 sym R (10744 sym/52 pcs) 4 img
DAT 3100 Apply 3
Let’s build a model for NY Times bestsellers list. We can build a model to predict the total weeks on best sellers list. Explore data library(tidyverse) nyt_titles <- readr::read_tsv('https://raw.githubusercontent.com/rfordatascience/tidytuesday/master/data/2022/2022-05-10/nyt_titles.tsv') ## Rows: 7431 Columns: 8 ## ── Column specifica...
4791 sym R (10948 sym/33 pcs) 5 img
DAT 3100 Code Along 5
Let’s build a model for water sources. We can build a model to predict whether water is available at a given source. Explore data Our modeling goal is to predict whether a water source actually has water available at it, based on characteristics of the water source observed during a visit. Let’s start by reading in the data. library(tidyve...
4262 sym R (5107 sym/19 pcs) 8 img
DAT 3100 Apply 3
Let’s build a model for board games. We can build a model to predict the rating of the board game from the characteristics. Explore data ratings <- read_csv("https://raw.githubusercontent.com/rfordatascience/tidytuesday/master/data/2022/2022-01-25/ratings.csv") ## Rows: 21831 Columns: 10 ## ── Column specification ────────...
4940 sym R (11138 sym/35 pcs) 5 img
DAT 3100 Code Along 3
Let’s build a model for board games. We can build a model to predict the rating of the board game from the characteristics. Explore data ratings <- read_csv("https://raw.githubusercontent.com/rfordatascience/tidytuesday/master/data/2022/2022-01-25/ratings.csv") ## Rows: 21831 Columns: 10 ## ── Column specification ────────...
4940 sym R (11138 sym/35 pcs) 5 img
DAT 3100 Apply 1 Resubmit
Let’s build a model for NY Times bestsellers list. We can build a model to predict the total weeks on best sellers list. Explore data library(tidyverse) nyt_titles <- readr::read_tsv('https://raw.githubusercontent.com/rfordatascience/tidytuesday/master/data/2022/2022-05-10/nyt_titles.tsv') # Remove four rows with missing values in author ...
4267 sym R (5937 sym/24 pcs) 4 img
DAT 3100 Apply 1
Let’s build a model for NY Times bestsellers list. We can build a model to predict the total weeks on best sellers list. Explore data library(tidyverse) nyt_titles <- readr::read_tsv('https://raw.githubusercontent.com/rfordatascience/tidytuesday/master/data/2022/2022-05-10/nyt_titles.tsv') Do the debut rank and the best rank of the book rela...
4265 sym R (5555 sym/22 pcs) 4 img