Publications by rachelgreenlee

hw1_decision-trees

11.03.2022

#prevent conflict with skimr and dlookr options(kableExtra.auto_format = FALSE) library(skimr) library(tidyverse) library(lubridate) library(rpart) #decision tree package rec'd by Practical ML in R textbook library(rpart.plot) #decision tree display package rec'd by Practical ML in R textbook Assignment Prompt (a) Visit the following web...

7130 sym R (4409 sym/26 pcs) 3 img 4 tbl

Project1_times-series-predictions

26.03.2022

Midterm project for DATA624 course in CUNY’s MSDS program Part A - ATM Forecast In part A, I want you to forecast how much cash is taken out of 4 different ATM machines for May 2010. The data is given in a single file. The variable ‘Cash’ is provided in hundreds of dollars, other than that it is straight forward. I am being somewhat ambig...

20286 sym R (24300 sym/155 pcs) 43 img 6 tbl

hw7_linear-regression

01.04.2022

Exercises from Chapter 6 of textbook Applied Predictive Modeling by Kuhn & Johnson Exercise 6.2 Developing a model to predict permeability (see Sect. 1.4) could save significant resources for a pharmaceutical company, while at the same time more rapidly identifying molecules that have a sufficient permeability to become a drug: (a) Start R and...

6167 sym R (5517 sym/24 pcs) 3 img 2 tbl

hw4_knn-and-clustering

15.05.2022

options(kableExtra.auto_format = FALSE) library(tidyverse) library(skimr) library(dlookr) library(class) #for knn() library(flextable) library(fastDummies) library(factoextra) library(cluster) library(gridExtra) ##making theme mpeach <- "#fbaa82" mteal <- "#73a2ac" mdarkteal <- "#0b5d69" mgray <- "#4c4c4c" # set plot theme for as...

4599 sym R (4032 sym/11 pcs) 4 img 5 tbl

hw9_trees-and-rules

30.04.2022

Exercises from Chapter 7 of textbook Applied Predictive Modeling by Kuhn & Johnson 8. Recreate the simulated data from Exercise 7.2: set.seed(200) simulated <- mlbench.friedman1(200, sd = 1) simulated <- cbind(simulated$x, simulated$y) simulated <- as.data.frame(simulated) colnames(simulated)[ncol(simulated)] <- "y" (a) Fit a random forest...

5736 sym R (7190 sym/40 pcs) 4 img

hw3_svm

24.04.2022

library(tidyverse) library(e1071) #package with svm() library(caret) Assignment Prompt (a) Perform an analysis of the dataset used in Homework #2 using the SVM algorithm.Compare the results with the results from previous homework. Preparing the dataset exactly as I did in HW2. stress <- read.csv("stress.csv", col.names = c(...

3103 sym R (2075 sym/6 pcs)

hw8_nonlinear-regression

09.04.2022

Exercises from Chapter 7 of textbook Applied Predictive Modeling by Kuhn & Johnson Exercise 7.2 Friedman (1991) introduced several benchmark data sets create by simulation. One of these simulations used the following nonlinear equation to create data: \(y = 10 sin(\pi x_1x_2) + 20(x_3 − 0.5)^2 + 10x_4 + 5x_5 + N(0, \sigma^2)\) where the x val...

3518 sym R (7698 sym/31 pcs) 2 img