Publications by Donny Lofland

DATA 624 Homework 8 (Non-Linear Regression)

02.12.2020

Source Code: https://github.com/djlofland/DATA624_PredictiveAnalytics/tree/master/Homework_8 Problem 7.2 Friedman (1991) introduced several benchmark data sets create by simulation. One of these simulations used the following nonlinear equation to create data: \[y = 10 sin(\pi x_1 x_2) + 20(x_3 − 0.5)^2 + 10x_4 + 5x_5 + N(0, \sigma^2)\] where ...

4586 sym R (16063 sym/79 pcs) 15 img

Data 621 - Homework 4 (Logistic Regression)

23.11.2020

Source code: https://github.com/djlofland/DS621_F2020_Group3/tree/master/Homework_4 Introduction Group 3 created a multiple linear regression and binary logistic model to estimate the probability of a driver having an auto accident, and the monetary damage, for the customer Khansari Auto Insurance. As auto insurance and insurance in general stem...

21542 sym R (47393 sym/17 pcs) 19 img

DATA 624 Homework 7 (Linear Regression)

02.11.2020

Source Code: https://github.com/djlofland/DATA624_PredictiveAnalytics/tree/master/Homework_7 Problem 6.2 Developing a model to predict permeability (see Sect. 1.4) could save significant resources for a pharmaceutical company, while at the same time more rapidly identifying molecules that have a sufficient permeability to become a drug: Start R...

6928 sym R (19045 sym/67 pcs) 7 img

DATA 624 Homework 9 (Trees)

03.12.2020

Source Code: https://github.com/djlofland/DATA624_PredictiveAnalytics/tree/master/Homework_9 Problem 8.1 Recreate the simulated data from Exercise 7.2: Load Data set.seed(200) simulated <- mlbench.friedman1(200, sd = 1) simulated <- cbind(simulated$x, simulated$y) simulated <- as.data.frame(simulated) colnames(simulated)[ncol(simulated)] <- "y"...

9351 sym R (100176 sym/74 pcs) 8 img

DATA 624 Homework 10 (Market Basket Analysis)

05.12.2020

Source Code: https://github.com/djlofland/DATA624_PredictiveAnalytics/tree/master/Homework_10 Overview Imagine 10000 receipts sitting on your table. Each receipt represents a transaction with items that were purchased. The receipt is a representation of stuff that went into a customer’s basket - and therefore ‘Market Basket Analysis’. That...

2175 sym R (737566 sym/19 pcs) 5 img

DATA 622 - Homework 1 (Penguins Logistic)

20.02.2021

Source Code: https://github.com/djlofland/DATA622_MachineLearning/tree/master/Homework1 Instructions Let’s use the Penguin dataset for our assignment. To learn more about the dataset, please visit: https://allisonhorst.github.io/palmerpenguins/articles/intro.html For this assignment, let us use species as our outcome or the dependent variable....

9508 sym R (13880 sym/39 pcs) 6 img 6 tbl

Data622 Homework3

09.04.2021

Source Code: https://github.com/djlofland/DATA622_S2021_Group2/tree/master/Homework3 Part 1: KNN on the Penguins dataset Please use the K-nearest neighbor (KNN) algorithm to predict the species variable. Please be sure to walk through the steps you took. (40 points) Similar to past assignments when using the Palmer Penguins dataset, we’ll fir...

24263 sym R (28761 sym/84 pcs) 23 img 10 tbl

DATA 622 - Homework 3 (Group2)

09.04.2021

Source Code: https://github.com/djlofland/DATA622_S2021_Group2/tree/master/Homework3 Part 1: KNN on the Penguins dataset Please use K-nearest neighbor (KNN) algorithm to predict the species variable. Please be sure to walk through the steps you took. (40 points) Similar to past assignments when using the Palmer Penguins dataset, we’ll first d...

19069 sym R (33681 sym/91 pcs) 25 img 10 tbl

DATA 622 - Homework 2 (Penguins Cont)

20.03.2021

Source Code: https://github.com/djlofland/DATA622_MachineLearning/tree/master/Homework2 Instructions Let’s use the Penguin dataset for our assignment. To learn more about the dataset, please visit: https://allisonhorst.github.io/palmerpenguins/articles/intro.html We will be working with the Penguin dataset again as we did for Homework #1. Plea...

11568 sym R (22597 sym/118 pcs) 4 img 6 tbl

DATA622 Homework 4 (Group2)

24.04.2021

Source Code: https://github.com/djlofland/DATA622_S2021_Group2/tree/master/Homework4 For this assignment, we will be working with a very interesting mental health dataset from a real-life research project. All identifying information, of course, has been removed. The attached spreadsheet has the data (the tab name “Data”). The data dictionary...

27909 sym R (42327 sym/64 pcs) 22 img 8 tbl