Publications by Semyon Toybis

Project2Data624

15.12.2024

library(readxl) library(caret) library(tidyverse) library(VIM) library(ggplot2) library(GGally) library(car) library(glmnet) Abstract We are tasked with with analyzing the manufacturing process of a beverage company, ABC Beverage. Specifically, we are tasked with analyzing what factors in the process can help us predict the PH of a beverage. We tr...

11832 sym Python (31394 sym/101 pcs) 19 img

HW9DATA624

16.11.2024

HW9DATA624 Author Semyon Toybis Assignment We are required to complete questions 8.1, 8.2, 8.3, and 8.7 from chapter 8 of “Applied Predictive Modeling” by Max Kuhn and Kjell Johnson. 8.1 First, I recreate the simulated data as instructed per the text book: set.seed(200) simulated <- mlbench.friedman1(200, sd = 1) simulated <- cbind(simula...

7977 sym 13 img

HW8DATA624

10.11.2024

HW8DATA624 Author Semyon Toybis Assignment We are required to complete questions 7.2 and 7.5 from chapter 7 of “Applied Predictive Modeling” by Max Kuhn and Kjell Johnson. 7.2 We are tasked with tuning models on simulated data via the Friedman1 function from the mlbench package. Below, I use the code provided in the book to create the data...

3682 sym 9 img

HW7DATA624

04.11.2024

HW7DATA624 Author Semyon Toybis Assignment We are required to complete questions 6.2 and 6.3 from chapter 6 of “Applied Predictive Modeling” by Max Kuhn and Kjell Johnson. 6.2 A Below I load the permeability data set which contains the matrix “fingerprints” and a matrix for permeability. data("permeability") Fingerprints contains bi...

4122 sym 8 img

Project1DATA624

27.10.2024

Project1 Author Semyon Toybis Project 1 A - ATM Data We are tasked with forecasting how much cash is taken out of four different ATM machines for the month of May 2010. First, I import the data: atm_data <- read_xlsx('ATM624Data.xlsx', col_types = c('date','text','numeric')) head(atm_data) # A tibble: 6 × 3 DATE ATM Cas...

10826 sym 30 img

HW5DATA624

21.10.2024

HW6DATA624 Author Semyon Toybis Assignment We are required to complete questions 9.1, 9.2, 9.3, 9.5, 9.6, 9.7, and 9.8 from chapter 9 of “Forecasting: Principles and Practice” Third Edition by Rob Hyndman and George Athanasopoulos. 9.1 A The above plots are autocorrelation plots for three different time series, showing the autocorrelation...

9425 sym Python (11065 sym/89 pcs) 38 img

DATA 624 ARIMA Demo

08.10.2024

Plot & Transform Source Data We use federal unemployment rate data located at FRED - https://fred.stlouisfed.org/series/UNRATE. This is US Monthly Unemployment Data. library(ggplot2) library(tsibble) library(tidyverse) library(lubridate) library(forecast) library(tseries) library(fabletools) library(fable) library(feasts) library(kableExtra) librar...

5526 sym 7 img 3 tbl

HW5DATA624

06.10.2024

HW5DATA624 Author Semyon Toybis Assignment We are required to complete questions 8.1, 8.5, 8.6, 8.7, 8.8, and 8.9 from chapter 8 of “Forecasting: Principles and Practice” Third Edition by Rob Hyndman and George Athanasopoulos. 8.1 - Victoria pigs from aus_livestock aus_livestock |> filter(Animal=='Pigs', State=='Victoria') |> autoplot() ...

6596 sym 18 img

HW4DATA624

29.09.2024

hw4DATA624 Author Semyon Toybis Assignment We are required to complete questions 3.1 and 3.2 from chapter 3 of “Applied Predictive Modeling” by Max Kuhn and Kjell Johnson. 3.1 - UC Irvine Machine Learning - Glass Data A - exploring the data As described in the book, the data consists of 214 glass samples labeled as one of seven categories...

5878 sym Python (17262 sym/32 pcs) 5 img

HW3_DATA624

22.09.2024

HW3DATA624 Author Semyon Toybis Assignment We are required to complete questions 5.1, 5.2, 5.3, 5.4, and 5.7 from chapter 5 of “Forecasting: Principles and Practice” Third Edition by Rob Hyndman and George Athanasopoulos. 5.1 We are tasked with using NAIVE(y), SNAIVE(y) or RW(y ~ drift()), which ever is most appropriate, for a variety o...

6663 sym Python (8922 sym/62 pcs) 29 img