Publications by Michael Ippolito

ippolito_data624_pj2

08.07.2023

output: html_document: theme: yeti highlight: tango toc: yes toc_float: yes pdf_document: dev: cairo_pdf toc: yes — Load data First, we load the data from Github. We had some trouble reading the Excel files, so we converted them to CSV. # Load training data (m=modeling) dfm_raw <- read.csv('https://raw.githubusercontent.com/klgriffen96/summer...

13724 sym R (58203 sym/24 pcs) 8 img 4 tbl

ippolito_data624_wk5

03.07.2023

KJ 7.2 Friedman (1991) introduced several benchmark data sets create by simulation. One of these simulations used the following nonlinear equation to create data: \(y\ =\ 10sin(πx_1x_2)\ +\ 20(x_3\ −\ 0.5)^2\ +\ 10x_4\ +\ 5x_5\ +\ N(0,\ σ^2)\) where the x values are random variables uniformly distributed between [0, 1] (there are also 5 oth...

12932 sym Python (358154 sym/129 pcs) 6 img 17 tbl

ippolito_data624_wk6

04.07.2023

output: html_document: theme: yeti highlight: tango toc: yes toc_float: yes pdf_document: dev: cairo_pdf toc: yes — Market basket analysis Imagine 10000 receipts sitting on your table. Each receipt represents a transaction with items that were purchased. The receipt is a representation of stuff that went into a customer’s basket – and the...

1070 sym R (5835 sym/21 pcs) 3 img 1 tbl

ippolito_data624_wk4

20.06.2023

KJ 6.3 A chemical manufacturing process for a pharmaceutical product was discussed in Sect. 1.4. In this problem, the objective is to understand the relationship between biological measurements of the raw materials (predictors), measurements of the manufacturing process (predictors), and the response of the product yield. Biological predictors...

4516 sym R (15676 sym/42 pcs) 8 img 3 tbl

ippolito_data624_project1

13.06.2023

Load data # Load data from git GET('https://github.com/klgriffen96/summer23_data624/raw/main/project_1/Data%20Set%20for%20Class.xls', write_disk(tmpfile <- tempfile(fileext=".xls"))) ## Response [https://raw.githubusercontent.com/klgriffen96/summer23_data624/main/project_1/Data%20Set%20for%20Class.xls] ## Date: 2023-06-13 15:26 ## Status: 20...

156 sym Python (7032 sym/14 pcs) 13 img

ippolito_data624_wk2

11.06.2023

KJ 3.1 KJ 3.1 (a) The UC Irvine Machine Learning Repository contains a data set related to glass identification. The data consist of 214 glass samples labeled as one of seven class categories. There are nine predictors, including the refrative index and percentages of eight elements: Na, Mg, Al, Si, K, Ca, Ba, and Fe. Using visualizations, exp...

9259 sym Python (38552 sym/108 pcs) 13 img

ippolito_data624_week3

11.06.2023

HA 8.1 HA 8.1 (a) Figure 8.31 shows the ACFs for 36 random numbers, 360 random numbers and 1,000 random numbers. Explain the differences among these figures. Do they all indicate that the data are white noise? wnacfplus-1.png Figure 8.31: Left: ACF for a white noise series of 36 numbers. Middle: ACF for a white noise series of 360 numbers. Ri...

3413 sym 2 img

ippolito_data624_week1

10.06.2023

HA 2.1 (a) Use autoplot() to plot each of these in separate plots (gold, woolyrnq and gas). # Set minimal theme theme_set(theme_minimal()) # Autoplot gold data data(gold) autoplot(gold) # Autoplot wolyrnq data data(woolyrnq) autoplot(woolyrnq) # Autoplot gas data data(gas) autoplot(gas) HA 2.1 (b) What is the frequency of each series? ...

3351 sym Python (17906 sym/67 pcs) 18 img

ippolito_data624_hw2

10.06.2023

KJ 3.1 (a) The UC Irvine Machine Learning Repository contains a data set related to glass identification. The data consist of 214 glass samples labeled as one of seven class categories. There are nine predictors, including the refrative index and percentages of eight elements: Na, Mg, Al, Si, K, Ca, Ba, and Fe. Using visualizations, explore the...

9212 sym Python (38552 sym/108 pcs) 13 img

CUNY624_hw1

04.06.2023

HA 2.1 (a) Use autoplot() to plot each of these in separate plots (gold, woolyrnq and gas). # Set minimal theme theme_set(theme_minimal()) # Autoplot gold data data(gold) autoplot(gold) # Autoplot wolyrnq data data(woolyrnq) autoplot(woolyrnq) # Autoplot gas data data(gas) autoplot(gas) HA 2.1 (b) What is the frequency of each series? ...

3334 sym Python (10312 sym/58 pcs) 15 img