Publications by Jacob Martin
DS 2870: Module 3 - Line graphs of Karen and Terry
Line graphs A line graph is a type a graph that uses a line to play “connect the dots” with the data points represented. It’s not required, but the x-axis on most line graphs represent time in some way. We’ll start with an example for a single name picked at random (like “Jacob”) # We'll create a line graph for the popularity of the ...
4460 sym Python (3711 sym/11 pcs) 8 img
DS 2870: Module 10 Homework - Summer 2024
Data Description: The used cars.csv file has information about 1000 randomly sampled used sedans (4 door cars) in 2021. The variables are: manufactor: The company that makes the car model: The model of the car price: The sale price of the used car (our response variable) year: The year are the car was manufactured age: The age of the car when i...
3995 sym Python (5464 sym/13 pcs) 3 img
DS 2870: Module 8 Homework - Summer 2024 - Key
Set up knitr::opts_chunk$set(echo = TRUE, fig.align = "center") # load packages pacman::p_load(tidyverse, class, skimr, caret, rpart, rpart.plot) # Changing the default theme theme_set(theme_bw()) Question 1) Spam Email The data set “Spam_Email.csv” contains columns that measure how frequently certain characters (;...
3573 sym Python (11144 sym/28 pcs) 3 img 3 tbl
DS 2870: Module 7 - Additional Practice - Savings Calculator
Question 1: Balance calculator Alex plans to save $500 in a savings account that pays 4% yearly interest, compounded monthly. This means that every month, the account earns 0.04/12 on top of the current value of the account. Ie, after the first month, they’ll have: \[500 + 0*(0.04/12) = 500.00\] After two months, Alex have the previous month�...
2359 sym 4 img
DS 2870: Module 5 Homework - Summer 2024 - Key
For this assignment, we’ll be working through with data from the Recording Industry Association of America (RIAA) in an attempt to recreate an image similar to this one: The columns other than year measure the amount of sales for that music format in millions of dollars. Question 1) Manipulate the data Change the data to be in the proper for...
1888 sym Python (4013 sym/6 pcs) 5 img
DS 2870: Homework 7 - Summer 2024 - Results
Set up Logistic regression For our example of objective functions, we looked a simple linear regression since it is a very common machine learning method. Another common machine learning method is logistic regression, which attempts to estimate the probability of success of a binary (categorical with two outcome) variable. While we won’t be ...
5668 sym 1 img
DS 2870: Module 4 Homework - Summer 2024 - Key
Data Description The movies data set has 44010 rows about the amount of explicit content (drugs, language, sex, nudity, and violence) found in 1467 movies released since 1958. Each movie is represented by 30 rows (1 row = movie & tag_name type combo). The relevant variables in the data set are: imdb_id: The identifier used by IMDB to uniquely ...
4345 sym Python (7472 sym/11 pcs) 1 img
DS 2870: Homework 6 - Summer 2024 - Graphs
Set Up Your Project and Load Libraries Question 1: State Data Part 1a) Creating the State data set Start by creating a data set that has the name of the state and its population. Show the first 10 rows in the knitted document. ## # A tibble: 51 × 2 ## state_name population ## <chr> <int> ## 1 Alabama ...
2036 sym 3 img
DS 2870 - Module 5 Homework - Summer 2024 - graphs
For this assignment, we’ll be working through with data from the Recording Industry Association of America (RIAA) in an attempt to recreate an image similar to this one: The columns other than year measure the amount of sales for that music format in millions of dollars. Question 1) Manipulate the data Change the data to be in the proper for...
1883 sym 5 img
DS 2870: Module 3 Homework - Summer 2024 - Key
Data Description The sp500 data set has the 502 companies in the Standards & Poors (S&P 500) that are the largest 500 (502) publicly traded companies in the US. The data set has 11 variables, with the important ones being: symbol: The 3 to 4 letter symbol used to ID the company on the stock market company: The name of the company sector: The s...
2301 sym Python (3914 sym/6 pcs) 6 img