Publications by Caiden Skakalski

Quiz 1

18.02.2021

# Load packages library(tidyquant) library(tidyverse) # for count() function # Import S&P500 Stock Index SP500 <- tq_index("SP500") SP500 ## # A tibble: 505 x 8 ## symbol company identifier sedol weight sector shares_held local_currency ## <chr> <chr> <chr> <chr> <dbl> <chr> <dbl> <chr> ## 1 AAP...

987 sym R (1735 sym/4 pcs)

Quiz 4

02.04.2021

For this quiz, you are going to use orange juice data. This data set is originally used in a machine learning (ML) class, with the goal to predict which of the two brands of orange juices the customers bought. Of course, you are not building a ML algorithm in this quiz. I just wanted to provide you with the context of the data. The response varia...

3851 sym R (6330 sym/22 pcs) 1 img

busStatQuiz3

19.03.2021

# Load the package library(tidyverse) # Import data Orange <- read.csv('https://raw.githubusercontent.com/selva86/datasets/master/orange_juice_withmissing.csv', stringsAsFactors = TRUE) %>% mutate(STORE = as.factor(STORE), StoreID = as.factor(StoreID)) # Print the first 6 rows head(Orange) ## Purchase WeekofPurchase StoreID ...

1838 sym R (5931 sym/12 pcs) 2 img

Quiz 2

04.03.2021

For this quiz, you are going to use orange juice data. This data set is originally used in a machine learning (ML) class, with the goal to predict which of the two brands of orange juices the customers bought. Of course, you are not building a ML algorithm in this quiz. I just wanted to provide you with the context of the data. The response varia...

3066 sym R (5741 sym/8 pcs) 2 img

Quiz 5

20.04.2021

You will use college tuition and diversity data for this quiz. See below for the definition of some of the variables. in_state_total: Total cost for in-state residents in USD (sum of room & board + in state tuition) out_of_state_total: Total cost for in-state residents in USD (sum of room & board + out of state tuition) percent_minority: share o...

2646 sym R (11793 sym/11 pcs) 3 img

Quiz 6

30.04.2021

For this quiz, you are going to use mpg (miles per galon) dataset. This dataset contains a subset of the fuel economy data that the EPA makes available on http: //fueleconomy.gov. It contains only models which had a new release every year between 1999 and 2008 - this was used as a proxy for the popularity of the car. The dataset has the following...

2168 sym R (3724 sym/22 pcs)

Term Paper

08.05.2021

Choose one of David Robinson’s tidytuesday screencasts, watch the video, and summarise. https://www.youtube.com/channel/UCeiiqmVK07qhY-wvg3IZiZQ Instructions You must follow the instructions below to get credits for this assignment. Elaborate your answer. One or two sentence answers won’t get credit. Make sure to cite what you see and hear ...

3371 sym