Publications by Magnus Skonberg
DATA 606 Project Proposal
Data Preparation We read in our data regarding automation from github (in its raw form). data <- read.csv("https://raw.githubusercontent.com/Magnus-PS/CUNY-SPS-DATA-606/Data-Project/automation_data_by_state.csv", header = TRUE) We then operate upon the automation dataset to merge our State columns into one column representative of total employmen...
3899 sym R (1963 sym/16 pcs) 2 img
DATA 605 Final Project
Code Show All Code Hide All Code DATA 605 - Final Project DATA 605 - Final Project Background Problem 1 Probability Investigation Testing Problem 2 Descriptive and Inferential Statistics Linear Algebra and Correlation Calculus-Based Probability & Statistics Modeling Closing Remarks Magnus Skonberg 2020-12-16 Background The purpose of o...
18674 sym R (14474 sym/105 pcs) 10 img
DATA 607 Final Project Presentation
12/9/2020 Background Question(s): What are the best Data Science companies to work for? and What are the characteristics that make them so? Data: Top Tech Companies Stock Price (.csv) and Glassdoor Company Reviews (web scrape). Data is properly cited in the Appendix. Approach: Acquire and filter. Tidy and transform. Visualize and analyze. ...
3712 sym R (1150 sym/5 pcs) 1 img
DATA 605 HW 15
Background The purpose of the assignment was to explore Calculus: Functions of Several Variables. Exercise 1 Find the equation of the regression line for the given points. Round any final values to the nearest hundredth, if necessary. (5.6, 8.8), (6.3, 12.4), (7, 14.8), (7.7, 18.2), (8.4, 20.8) Per R calculations below, the equation of the reg...
5263 sym R (1154 sym/8 pcs)
DATA 605 HW 14
Background The purpose of the assignment was to capture our exploration of Taylor Series expansion using an R markdown document. Exercise 1 Work out the Taylor Series expansion of \(f(x) = \frac{1}{(1 - x)}\). For each function only consider its valid ranges as indicated in the notes when computing the Taylor Series expansion. We start with cr...
4152 sym
DATA 605 HW 13
Background The purpose of the assignment was to explore the fundamentals of Calculus in R. Exercise 1 Use integration by substitution to solve ∫ \(4e^{-7x} dx\). We apply u-substituton and unwind the chain rule. To start, let’s define u: \(u = -7x\) From this we take the derivative of u with respect to x and derive: \(du/dx = -7\) –> \(dx ...
6408 sym R (654 sym/8 pcs)
DATA 605 Wk 13 Disc
Background The purpose of this week’s discussion topic is to build a multiple regression model. In the building of this model, we’re to include (1) quadratic term, (1) dichotomous term, and (1) dichotomous vs. quantitative interaction term. Once built, we’re to interpret all coefficients, conduct residual analysis, and determine whether or...
5551 sym R (4760 sym/27 pcs) 1 img
DATA 607 Project 4
DATA 607 - Project 4 Jered Ataky, Magnus Skonberg 2020-11-13 Background The focus of this project is document classification. For this project, we will start with a corpus dataset, unzip our data, generate a training model that we’ll then use to predict the class of new documents (those withheld from the training set or taken from another sour...
7439 sym R (12256 sym/36 pcs) 1 img
DATA 605 HW 12
Background The purpose of the assignment was to explore the properties of linear regression. Load data After downloading the .csv file from Blackboard and uploading it to Github, we read the corresponding data (in raw form) and then familiarize ourselves with the dataset by displaying column names, column number, row number, the 1st 6 observatio...
5922 sym R (5494 sym/30 pcs) 2 img
DATA 607 Tidyverse EXTEND
Introduction This vignette will take a quick peek at two useful data exploration plot types provided in the ggExtra package using a UFO sightings dataset. We aren’t going to worry about style or labels, just some quick plots to explore your data well before further analysis and presenting findings to others. Setup To have some fun, I picked a ...
3474 sym R (3734 sym/14 pcs) 7 img