Publications by Jacob Martin
DS 2870: Homework 7 - Summer 2024 - Results
Set up Logistic regression For our example of objective functions, we looked a simple linear regression since it is a very common machine learning method. Another common machine learning method is logistic regression, which attempts to estimate the probability of success of a binary (categorical with two outcome) variable. While we won’t be ...
5668 sym 1 img
DS 2870: Module 4 Homework - Summer 2024 - Key
Data Description The movies data set has 44010 rows about the amount of explicit content (drugs, language, sex, nudity, and violence) found in 1467 movies released since 1958. Each movie is represented by 30 rows (1 row = movie & tag_name type combo). The relevant variables in the data set are: imdb_id: The identifier used by IMDB to uniquely ...
4345 sym Python (7472 sym/11 pcs) 1 img
DS 2870: Homework 6 - Summer 2024 - Graphs
Set Up Your Project and Load Libraries Question 1: State Data Part 1a) Creating the State data set Start by creating a data set that has the name of the state and its population. Show the first 10 rows in the knitted document. ## # A tibble: 51 × 2 ## state_name population ## <chr> <int> ## 1 Alabama ...
2036 sym 3 img
DS 2870 - Module 5 Homework - Summer 2024 - graphs
For this assignment, we’ll be working through with data from the Recording Industry Association of America (RIAA) in an attempt to recreate an image similar to this one: The columns other than year measure the amount of sales for that music format in millions of dollars. Question 1) Manipulate the data Change the data to be in the proper for...
1883 sym 5 img
DS 2870: Module 3 Homework - Summer 2024 - Key
Data Description The sp500 data set has the 502 companies in the Standards & Poors (S&P 500) that are the largest 500 (502) publicly traded companies in the US. The data set has 11 variables, with the important ones being: symbol: The 3 to 4 letter symbol used to ID the company on the stock market company: The name of the company sector: The s...
2301 sym Python (3914 sym/6 pcs) 6 img
DS 2870: Module 4 Homework - Summer 2024 - Graphs
Data Description The movies data set has 44010 rows about the amount of explicit content (drugs, language, sex, nudity, and violence) found in 1467 movies released since 1958. Each movie is represented by 30 rows (1 row = movie & tag_name type combo). The relevant variables in the data set are: imdb_id: The identifier used by IMDB to uniquely ...
4339 sym 1 img
DS 2870: Homework 3 - Summer 2024 - Solutions
knitr::opts_chunk$set(echo = T, warning = F, message = F, fig.align = "center") ## Load the required package: tidyverse library(tidyverse) ## Reading in the Dr Who data from github drwho <- read.csv("https://raw.githubusercontent.com/Shammalamala/DS-2870-Data-Sets/main/d...
3058 sym R (3339 sym/6 pcs) 5 img
DS 2870: Module 3 Homework - Graphs - summer 2024
Data Description The sp500 data set has the 502 companies in the Standards & Poors (S&P 500) that are the largest 500 (502) publicly traded companies in the US. The data set has 11 variables, with the important ones being: symbol: The 3 to 4 letter symbol used to ID the company on the stock market company: The name of the company sector: The s...
2294 sym 5 img
DS 2870: Module 2 Homework - Summer 2024 - Graphs
knitr::opts_chunk$set(echo = F, warning = F, message = F, fig.align = "center") ## Load the required package: tidyverse library(tidyverse) ## Reading in the Dr Who data from github drwho <- read.csv("https://raw.githubusercontent.com/Shammalamala/DS-2870-Data-Sets/main/d...
3053 sym R (840 sym/1 pcs) 5 img
DS 2870: Module 3 - Bar Charts
Set Up Your Project and Load Libraries ## Set the default size of figures and default to printing the R code knitr::opts_chunk$set(fig.align = "center", echo = F, warning = F, message = F, include = T) ## Load the libraries we will be using pacman::p_l...
10112 sym 13 img 2 tbl