Publications by Derek Howanski

DAT 3100 Apply 9

09.11.2023

Explore data Is there a relationship between the sex of the member and deaths on specific peaks? library(tidyverse) ## ── Attaching core tidyverse packages ──────────────────────── tidyverse 2.0.0 ── ## ✔ dplyr 1.1.3 ✔ readr 2.1.4 ## ✔ forcats 1.0.0 ✔ stringr 1.5...

1358 sym R (5622 sym/27 pcs) 2 img

Apply 5

08.11.2023

Explore data Our goal is to build a classification model to predict the spam email(yesno). spam <- readr::read_csv('https://raw.githubusercontent.com/rfordatascience/tidytuesday/master/data/2023/2023-08-15/spam.csv') ## Rows: 4601 Columns: 7 ## ── Column specification ─────────────────────────...

2006 sym R (4884 sym/28 pcs) 7 img 3 tbl

Code Along 9

08.11.2023

Explore the data library(tidyverse) ## ── Attaching core tidyverse packages ──────────────────────── tidyverse 2.0.0 ── ## ✔ dplyr 1.1.3 ✔ readr 2.1.4 ## ✔ forcats 1.0.0 ✔ stringr 1.5.0 ## ✔ ggplot2 3.4.3 ✔ tibble 3.2.1 ## ✔ lubridate 1.9.2 �...

1712 sym R (6733 sym/26 pcs) 2 img

DAT 3100 Apply 6

02.11.2023

Our modeling goal is to predict the whether a museum in the UK is accredited or not, based on other characteristics of the museum like its size and subject matter. Let’s start by reading in the data: Explore Data library(tidyverse) ## ── Attaching core tidyverse packages ───────────────────────...

3412 sym R (10744 sym/52 pcs) 4 img

DAT 3100 Apply 3

18.10.2023

Let’s build a model for NY Times bestsellers list. We can build a model to predict the total weeks on best sellers list. Explore data library(tidyverse) nyt_titles <- readr::read_tsv('https://raw.githubusercontent.com/rfordatascience/tidytuesday/master/data/2022/2022-05-10/nyt_titles.tsv') ## Rows: 7431 Columns: 8 ## ── Column specifica...

4791 sym R (10948 sym/33 pcs) 5 img

DAT 3100 Code Along 5

12.10.2023

Let’s build a model for water sources. We can build a model to predict whether water is available at a given source. Explore data Our modeling goal is to predict whether a water source actually has water available at it, based on characteristics of the water source observed during a visit. Let’s start by reading in the data. library(tidyve...

4262 sym R (5107 sym/19 pcs) 8 img

DAT 3100 Apply 3

12.10.2023

Let’s build a model for board games. We can build a model to predict the rating of the board game from the characteristics. Explore data ratings <- read_csv("https://raw.githubusercontent.com/rfordatascience/tidytuesday/master/data/2022/2022-01-25/ratings.csv") ## Rows: 21831 Columns: 10 ## ── Column specification ────────...

4940 sym R (11138 sym/35 pcs) 5 img

DAT 3100 Code Along 3

12.10.2023

Let’s build a model for board games. We can build a model to predict the rating of the board game from the characteristics. Explore data ratings <- read_csv("https://raw.githubusercontent.com/rfordatascience/tidytuesday/master/data/2022/2022-01-25/ratings.csv") ## Rows: 21831 Columns: 10 ## ── Column specification ────────...

4940 sym R (11138 sym/35 pcs) 5 img

DAT 3100 Apply 1 Resubmit

09.10.2023

Let’s build a model for NY Times bestsellers list. We can build a model to predict the total weeks on best sellers list. Explore data library(tidyverse) nyt_titles <- readr::read_tsv('https://raw.githubusercontent.com/rfordatascience/tidytuesday/master/data/2022/2022-05-10/nyt_titles.tsv') # Remove four rows with missing values in author ...

4267 sym R (5937 sym/24 pcs) 4 img

DAT 3100 Apply 1

09.10.2023

Let’s build a model for NY Times bestsellers list. We can build a model to predict the total weeks on best sellers list. Explore data library(tidyverse) nyt_titles <- readr::read_tsv('https://raw.githubusercontent.com/rfordatascience/tidytuesday/master/data/2022/2022-05-10/nyt_titles.tsv') Do the debut rank and the best rank of the book rela...

4265 sym R (5555 sym/22 pcs) 4 img