Publications by Magnus Skonberg

DATA 621 HW2

19.03.2021

Authorship Critical Thinking Group 1: Angel Claudio, Bonnie Cooper, Manolis Manoli, Magnus Skonberg, Christian Thieme and Leo Yi Background In the following exercises we will be working with a version of a well known dataset known as the “Pima Indians Diabetes Database”. The dataset was gathered by the National Institute of Diabetes and Dig...

12651 sym R (7575 sym/33 pcs) 2 img

GSEVC Intro to EDA

02.04.2021

income <- read_csv("https://raw.githubusercontent.com/Magnus-PS/GSEVC/main/nj_county_medInc.csv") pop <- read_csv("https://raw.githubusercontent.com/Magnus-PS/GSEVC/main/nj_county_pop.csv") vb <- read_csv("https://raw.githubusercontent.com/Magnus-PS/GSEVC/main/nj_county_vb.csv") INTRO Garden State Elite Volleyball Club (GSEVC) is located in Su...

9136 sym R (5823 sym/13 pcs) 6 img 2 tbl

DATA 622 HW3

08.04.2021

Authorship Group 5: Don (Geeth) Padmaperuma, Subhalaxmi Rout, Isabel Ramesar, and Magnus Skonberg Background The purpose of this assignment was to explore classification via K-nearest neighbors, Decision Trees, Random Forests, and Gradient Boosting. Classification Classification is a supervised machine learning technique whose main purpose is ...

23952 sym R (28132 sym/89 pcs) 5 img 4 tbl

DATA 621 HW3

16.04.2021

Authorship Critical Thinking Group 1 Angel Claudio Bonnie Cooper Manolis Manoli Magnus Skonberg Christian Thieme Leo Yi Background In the following exercises we will be working with The Boston Housing Dataset. The dataset was gathered by the US Census Bureau regarding housing in the Boston Massachussetts area and can be obtained from the StatLi...

24316 sym R (33407 sym/41 pcs) 5 img 1 tbl

DATA 622 Final Project

20.05.2021

Authorship Group 5: Don (Geeth) Padmaperuma, Subhalaxmi Rout, Isabel Ramesar, and Magnus Skonberg Background The purpose of our Final Project was to explore the application of Neural Networks to loan approval data to then back compare model performance with a variety of Classification algorithms (ie. KNN, DT, RF, GBM). Neural Networks Neural n...

29901 sym R (54718 sym/81 pcs) 12 img 2 tbl

DATA 698 Research Project

16.05.2021

Abstract With a high rate of avoidable deaths and chronic disease as well as an obesity rate at two times higher than the OECD average, the forecast for American health is gloomy. With that in mind, the purpose of this project was ultimately to answer three questions: 1) What United States counties are most favorable for an active, healthy lifest...

43959 sym R (37724 sym/3 pcs) 6 img 3 tbl

DATA 698 Data Gathering and Pre-processing

10.05.2021

Background The purpose of this project will be to investigate the impact of a variety of health and lifestyle metrics on county-level health scores. We set out with the goal of answering the following 3 questions: What United States counties are most favorable for an active, healthy lifestyle? What are the differentiating characteristics that ma...

12885 sym R (30404 sym/67 pcs) 10 img 1 tbl

DATA 622 HW 4

05.05.2021

Authorship Group 5: Don (Geeth) Padmaperuma, Subhalaxmi Rout, Isabel R., and Magnus Skonberg Background The purpose of this assignment was to explore Clustering, Principal Component Analysis, and Support Vector Machines. Clustering is used as a means of grouping a set of observations in a way that those in the same group are more similar to one...

34156 sym R (34328 sym/89 pcs) 37 img 8 tbl

DATA 621 HW4

02.05.2021

Authorship Critical Thinking Group 1 Angel Claudio Bonnie Cooper Manolis Manoli Magnus Skonberg Christian Thieme Leo Yi Abstract We will explore, analyze and model a data set containing approximately 8,000 records. Each row represents a customer at an auto insurance company. Each record has two response variables. The first response variable, ...

17563 sym R (54235 sym/39 pcs) 8 img

DATA 621 HW5

13.05.2021

Authorship Critical Thinking Group 1 Angel Claudio Bonnie Cooper Manolis Manoli Magnus Skonberg Christian Thieme Leo Yi Abstract We will explore, analyze and model a data set containing approximately 12,000 records representing various commercially available wines. The variables are primarily related to the chemical properties of the wine bein...

19912 sym R (48061 sym/20 pcs) 8 img