Publications by Shamecca Marshall

Data 624 - HW 8

13.04.2025

Exercise 7.2: Friedman Benchmark Dataset Friedman (1991) introduced several benchmark data sets created by simulation. One of these simulations used the following nonlinear equation to create data: \[ y = 10 \sin(\pi x_1 x_2) + 20(x_3 - 0.5)^2 + 10x_4 + 5x_5 + N(0, \sigma^2) \] where the $x$ values are random variables uniformly distributed...

3684 sym R (16804 sym/61 pcs) 11 img

Data 624 HW 7

06.04.2025

library(tidyverse) ## Warning: package 'lubridate' was built under R version 4.4.2 ## ── Attaching core tidyverse packages ──────────────────────── tidyverse 2.0.0 ── ## ✔ dplyr 1.1.4 ✔ readr 2.1.5 ## ✔ forcats 1.0.0 ✔ stringr 1.5.1 ## ✔ ggplot2 3.5.1 ✔...

9562 sym Python (12064 sym/54 pcs) 6 img

Data 624 Project 1

31.03.2025

1 DATA 624 Forecasting Project – ATM, Power Usage, and Water Flow Analysis 1.1 Introduction This project consists of three forecasting tasks. Part A involves predicting the cash withdrawals from four ATMs for May 2010. Part B focuses on forecasting residential power consumption for the year 2014 using monthly data from 1998–2013. Part C is...

30757 sym R (44099 sym/177 pcs) 51 img 7 tbl

Data 622 0 Assignment 2

23.03.2025

Introduction In Machine Learning, Experimentation refers to the systematic process of designing, executing, and analyzing different configurations to identify the optimal settings that performs best on a given task. Experimentation is learning by doing. It involves systematically changing parameters, evaluating results with metrics, and compari...

16270 sym Python (36889 sym/118 pcs) 18 img 2 tbl

Data 624 - HW 6

14.03.2025

9.1: Figure 9.32 shows the ACFs for 36 random numbers, 360 random numbers and 1,000 random numbers. (a): Explain the differences among these figures. Do they all indicate that the data are white noise? The difference between these figures lies in the height of the spikes and how wide the confidence bands are. As the sample size grows, the spi...

13841 sym Python (12482 sym/64 pcs) 26 img

Data 624 HW 5

07.03.2025

knitr::opts_chunk$set(fig.width = 7, fig.height = 5, fig.align = "center") 8.1 Question: Consider the the number of pigs slaughtered in Victoria, available in the aus_livestock dataset. (a) Use the ETS() function to estimate the equivalent model for simple exponential smoothing. Find the optimal values of $\alpha$ and $\ell_0$, and generate...

8942 sym Python (10070 sym/50 pcs) 12 img

Data 622 - HW 1

03.03.2025

Introduction This assignment focuses on Exploratory Data Analysis (EDA), a crucial step in data science that helps identify data gaps, imbalances, and provides insights into data quality. This analysis will focus on campaign effectiveness, customer demographics, and seasonality trends to understand how the bank can improve future marketing stra...

6677 sym Python (18565 sym/37 pcs) 12 img 5 tbl

Data 624 HW 4

27.02.2025

3.1 The UC Irvine Machine Learning Repository contains a dataset related to glass identification. The dataset consists of 214 glass samples labeled as one of seven class categories. There are nine predictors, including the refractive index (RI) and percentages of eight elements: Na, Mg, Al, Si, K, Ca, Ba, and Fe. This dataset will be used to e...

5089 sym 41 img 1 tbl

Data 624 - HW 3

23.02.2025

5.1. Produce forecasts for the following series using whichever of NAIVE(y), SNAIVE(y) or RW(y ~ drift()) is more appropriate in each case: - Australian Population (global_economy) global_economy %>% filter(Country == "Australia") %>% model(RW(Population ~ drift())) %>% forecast(h = 14) %>% autoplot(global_economy) + labs(title =...

9367 sym 18 img

Data 624 HW1

09.02.2025

1. Exploring Time Series Data Use the help function to explore what the series gold, woolyrnq, and gas represent. Load Required Libraries library(forecast) # ASSIGNING THE LIBRARY ## Warning: package 'forecast' was built under R version 4.4.2 ## Registered S3 method overwritten by 'quantmod': ## method from ## as.zoo.data.frame ...

5561 sym R (8616 sym/84 pcs) 17 img