Publications by Christian Thieme

Car Insurance Analysis and Logistic Regression

03.05.2021

Authorship Critical Thinking Group 1 Angel Claudio Bonnie Cooper Manolis Manoli Magnus Skonberg Christian Thieme Leo Yi Abstract We will explore, analyze and model a data set containing approximately 8,000 records. Each row represents a customer at an auto insurance company. Each record has two response variables. The first response variable, ...

17561 sym R (54124 sym/39 pcs) 8 img

Influenza and Pneumonia Mortality during the Global COVID-19 Pandemic and the Impact of Local Government Restrictions

24.05.2021

Influenza and Pneumonia Mortality during the Global COVID-19 Pandemic and the Impact of Local Government RestrictionsAngel Claudio, Bonnie Cooper, Manolis Manoli, Magnus Skonberg, Christian Thieme, Leo Yi"2021-05-23" Communicable Disease during COVID-19 The global COVID-19 pandemic has greatly impacted disease beyond direct cases of COVID-19. R...

6944 sym 5 img 1 tbl

Project 2 - Non-linear Regression Models

11.07.2021

Group Members Subhalaxmi Rout Kenan Sooklall Devin Teran Christian Thieme Leo Yi /pagebreak Introduction We have been given a dataset from a beverage manufacturing company that consists of 2,571 rows of data and 33 columns. The dataset contains information on different beverages and their chemical composition. The goal of this analysis is to u...

7315 sym R (7019 sym/23 pcs) 7 img

Nonlinear Regression Models, Regression Trees and Rules-Based Models

06.07.2021

Homework 5: Applied Predictive Modeling 7.2 set.seed(200) trainingData <- mlbench.friedman1(200, sd = 1) trainingData$x <- data.frame(trainingData$x) testData <- mlbench.friedman1(5000, sd = 1) testData$x <- data.frame(testData$x) knn_fit<- train(x=trainingData$x,y=trainingData$y, method = "knn", preProcess = c("center", "scale"), tuneLe...

2744 sym R (11173 sym/38 pcs) 5 img 1 tbl

Arima Models

19.06.2021

Forecasting: Principles and Practice 8.1 The main difference between these figures are that the scale of both the ACF values and the 95% limit lines seems to be decreasing from left to right. Additionally, it appears that some of the patterns change from chart to chart with positive and negative ACF. It does appear that these charts indicate th...

5291 sym R (2824 sym/21 pcs) 15 img

Time Series and Decomposition

19.06.2021

Homework 1: Time Series and Decomposition Forecasting: Principles and Practice 2.1 Use the help function to explore what the series gold, woolyrnq and gas represent. #?gold #?woolyrnq #?gas \((a)\). Use autoplot() to plot each of these in separate plots. autoplot(gold) autoplot(woolyrnq) autoplot(gas) \((b)\) What is the frequency of each ...

3748 sym R (1060 sym/26 pcs) 13 img

Data Pre-processing & Exponential Smoothing

19.06.2021

Applied Predictive Modeling Chapter 3 - Data Pre-processing 3.1 The UC Irvine Machine Learning Repository contains a data set related to glass identification. The data consist of 214 glass samples labeled as one of seven class categories. There are nine predictors, including the refractive index and percentages of eight elements: Na, Mg, Al, Si,...

7566 sym R (8614 sym/38 pcs) 5 img

Time Series Analysis Stock Data

25.06.2021

Project 1: S05 - Forecast Var02, Var03 Load Libraries and Data library(readxl) # read excel library(dplyr) library(PerformanceAnalytics) # correlation and histogram library(ggplot2) # ggplot library(tidyverse) library(tidyr) # drop_na() library(forecast) # autoplot library(fpp3) library(zoo) library(seasonal) library(tidymodels) data ...

9979 sym R (11488 sym/61 pcs) 23 img

Project 1 - Time Series Analysis and Forecasting Using ARIMA and ETS

26.06.2021

Group Members Subhalaxmi Rout Kenan Sooklall Devin Teran Christian Thieme Leo Yi Introduction The dataset for this project was provided as a de-identified excel spreadsheet that included six different groups. Each group had two variables to be forecasted 140 periods into the future using 1,622 historical periods. This data had been provided as ...

9459 sym R (1845 sym/6 pcs) 11 img

Final Project - DATA622

30.11.2021

Group 1 Members: David Moste Vinayak Kamath Kenan Sooklall Christian Thieme Lidiia Tronina Introduction Data scientists are in extremely high demand around the world. Companies are constantly fighting to acquire and keep talented professionals. However, as we’ve seen in the last year, many professionals are leaving their jobs and looking for ...

21189 sym R (15734 sym/42 pcs) 21 img 2 tbl