Publications by Matthew Tillmawitz
Data 624 Final
1. Introduction Recent regulatory changes emphasize the importance of having a comprehensive understanding of manufacturing processes and their impact on product quality. At ABC Beverage, pH levels are a critical parameter for ensuring consistency and maintaining product standards. This analysis aims to identify and quantify the factors driving pH ...
36192 sym Python (24607 sym/89 pcs) 20 img 23 tbl
The Effect of Female Education on Birth Rates in the United States
Introduction There is a popular and persistent perception in American society that women who pursue higher education have fewer children. This narrative arises again and again in popular culture in movies such as Idiocracy as well as conservative talk shows and podcasts. In this analysis we will explore whether the pursuit of higher education by wo...
12262 sym Python (16922 sym/29 pcs) 20 img 10 tbl
Data 606 Final Project Proposal
Data Preparation A peak of the final dataframe is at the end, the data manipulation gets pretty complicated. geography <- "public use microdata area" all_vars <- load_variables(2022, "acs5") # Plain text labels associated with variable codes education_data <- get_acs( geography = geography, table = "B15001", year = 2022, survey = "acs5",...
6182 sym Python (11368 sym/15 pcs) 3 img 1 tbl
Data 624 Homework 9
8.1 Recreating data. set.seed(200) simulated <- mlbench.friedman1(200, sd = 1) simulated <- cbind(simulated$x, simulated$y) simulated <- as.data.frame(simulated) colnames(simulated)[ncol(simulated)] <- "y" 8.1.1 set.seed(200) library(randomForest) library(caret) model1 <- randomForest(y ~ ., data = simulated, importance = TR...
4349 sym R (18708 sym/45 pcs) 1 img
Data 624 Assignment 8
7.2 Going through our various models, we can see that the neural network slightly outperforms the MARS and SVM models while the kNN model trails the others. The MARS model does indeed select the informative predictors. set.seed(8675309) gen_data <- mlbench.friedman1(5000, sd = 1) gen_df <- data.frame( y = gen_data$y, x = gen_data$x ) names(gen_...
2135 sym R (15897 sym/44 pcs) 5 img
Recommender System Case Study
Introduction As winter draws closer so does ski season! In Denver we are currently experiencing the largest November snow storm in decades, with over a foot of snow predicted by the end. Ski resorts in the Rockies are opening earlier than they have in decades, and that means people are more excited than ever to hit the slopes. Evo.com is one of the...
6838 sym 4 img
Data 624 Assignment 7
6.2 6.2.1 library(AppliedPredictiveModeling) data(permeability) kable(head(permeability)) permeability 12.520 1.120 19.405 1.730 1.680 0.510 kable(head(fingerprints)) X1 X2 X3 X4 X5 X6 X7 X8 X9 X10 X11 X12 X13 X14 X15 X16 X17 X18 X19 X20 X21 X22 X23 X24 X25 X26 X27 X28 X29 X30 X31 X32 X33 X34 X35 X36 X37 X38 X39 X40 X41 X42 X43 X44 X45 X...
3017 sym R (18141 sym/22 pcs) 3 img 6 tbl
Data 624 Project 1
Introduction This project consists of analysis and forecasting of two provided datasets, one dealing with ATM withdrawals and the other power consumption. We will begin with the ATM dataset. ATM Forecasting The ATM dataset consists of daily cash withdrawal totals from 4 different ATM machines. We have been tasked to “forecast how much cash is ta...
19323 sym Python (14782 sym/86 pcs) 22 img
Data 607 Web API Assignment
Fetching the NYT Best Sellers List This assignment lends itself better to an actual R Script file but for ease of publishing as well as viewing the results it will be done in R Markdown. Note that this code assumes you have already registered an application and authorized it to call the /lists API as well as stored the API key in your .Renviron. Th...
1180 sym
Data 624 Assignment 6
9.1 9.1.1 All appear to indicate that the data is white noise as none of the autocorrelations rise to the level of significance, although I will admit the plot of 1000 random numbers is difficult to determine with the naked eye due to the size of the spikes and low critical value. 9.1.2 The critical values decrease with the number of observations...
9035 sym Python (11844 sym/63 pcs) 27 img