Publications by Donny Lofland
DATA 624 Homework 8 (Non-Linear Regression)
Source Code: https://github.com/djlofland/DATA624_PredictiveAnalytics/tree/master/Homework_8 Problem 7.2 Friedman (1991) introduced several benchmark data sets create by simulation. One of these simulations used the following nonlinear equation to create data: \[y = 10 sin(\pi x_1 x_2) + 20(x_3 − 0.5)^2 + 10x_4 + 5x_5 + N(0, \sigma^2)\] where ...
4586 sym R (16063 sym/79 pcs) 15 img
Data 621 - Homework 4 (Logistic Regression)
Source code: https://github.com/djlofland/DS621_F2020_Group3/tree/master/Homework_4 Introduction Group 3 created a multiple linear regression and binary logistic model to estimate the probability of a driver having an auto accident, and the monetary damage, for the customer Khansari Auto Insurance. As auto insurance and insurance in general stem...
21542 sym R (47393 sym/17 pcs) 19 img
DATA 624 Homework 7 (Linear Regression)
Source Code: https://github.com/djlofland/DATA624_PredictiveAnalytics/tree/master/Homework_7 Problem 6.2 Developing a model to predict permeability (see Sect. 1.4) could save significant resources for a pharmaceutical company, while at the same time more rapidly identifying molecules that have a sufficient permeability to become a drug: Start R...
6928 sym R (19045 sym/67 pcs) 7 img
DATA 624 Homework 9 (Trees)
Source Code: https://github.com/djlofland/DATA624_PredictiveAnalytics/tree/master/Homework_9 Problem 8.1 Recreate the simulated data from Exercise 7.2: Load Data set.seed(200) simulated <- mlbench.friedman1(200, sd = 1) simulated <- cbind(simulated$x, simulated$y) simulated <- as.data.frame(simulated) colnames(simulated)[ncol(simulated)] <- "y"...
9351 sym R (100176 sym/74 pcs) 8 img
DATA 624 Homework 10 (Market Basket Analysis)
Source Code: https://github.com/djlofland/DATA624_PredictiveAnalytics/tree/master/Homework_10 Overview Imagine 10000 receipts sitting on your table. Each receipt represents a transaction with items that were purchased. The receipt is a representation of stuff that went into a customer’s basket - and therefore ‘Market Basket Analysis’. That...
2175 sym R (737566 sym/19 pcs) 5 img
DATA 622 - Homework 1 (Penguins Logistic)
Source Code: https://github.com/djlofland/DATA622_MachineLearning/tree/master/Homework1 Instructions Let’s use the Penguin dataset for our assignment. To learn more about the dataset, please visit: https://allisonhorst.github.io/palmerpenguins/articles/intro.html For this assignment, let us use species as our outcome or the dependent variable....
9508 sym R (13880 sym/39 pcs) 6 img 6 tbl
Data622 Homework3
Source Code: https://github.com/djlofland/DATA622_S2021_Group2/tree/master/Homework3 Part 1: KNN on the Penguins dataset Please use the K-nearest neighbor (KNN) algorithm to predict the species variable. Please be sure to walk through the steps you took. (40 points) Similar to past assignments when using the Palmer Penguins dataset, we’ll fir...
24263 sym R (28761 sym/84 pcs) 23 img 10 tbl
DATA 622 - Homework 3 (Group2)
Source Code: https://github.com/djlofland/DATA622_S2021_Group2/tree/master/Homework3 Part 1: KNN on the Penguins dataset Please use K-nearest neighbor (KNN) algorithm to predict the species variable. Please be sure to walk through the steps you took. (40 points) Similar to past assignments when using the Palmer Penguins dataset, we’ll first d...
19069 sym R (33681 sym/91 pcs) 25 img 10 tbl
DATA 622 - Homework 2 (Penguins Cont)
Source Code: https://github.com/djlofland/DATA622_MachineLearning/tree/master/Homework2 Instructions Let’s use the Penguin dataset for our assignment. To learn more about the dataset, please visit: https://allisonhorst.github.io/palmerpenguins/articles/intro.html We will be working with the Penguin dataset again as we did for Homework #1. Plea...
11568 sym R (22597 sym/118 pcs) 4 img 6 tbl
DATA622 Homework 4 (Group2)
Source Code: https://github.com/djlofland/DATA622_S2021_Group2/tree/master/Homework4 For this assignment, we will be working with a very interesting mental health dataset from a real-life research project. All identifying information, of course, has been removed. The attached spreadsheet has the data (the tab name “Data”). The data dictionary...
27909 sym R (42327 sym/64 pcs) 22 img 8 tbl