Publications by rachelgreenlee

Final Project

08.12.2020

Introduction This project is an exploration of how the public and/or shareholders can pressure companies to not turn a blind eye to the affects their operations have on the environment and the climate crisis. Many governments have been slow to impose sweeping regulations on business and industry with regards to harmful emissions, so looking to ho...

14623 sym R (5328 sym/12 pcs) 4 img

DATA607 - Assignment 10

31.10.2020

Purpose For this assignment I’ll be using the base code from Chapter 2 of Text Minint with R (https://www.tidytextmining.com/sentiment.html) to learn how to do some basic sentiment analysis with provided lexicons. Base code #gather the needed libraries, lexicons, and Jane Austen books as our corpus library(tidytext) library(textdata) librar...

2345 sym R (2501 sym/12 pcs) 1 img

605 Final Project

15.05.2021

Libraries library(tidyverse) library(ggcorrplot) library(matrixcalc) library(MASS) library(stats) Part 1 Using R, generate a random variable X that has 10,000 random uniform numbers from 1 to N, where N can be any number of your choosing greater than or equal to 6. Then generate a random variable Y that has 10,000 random normal numbers with...

8953 sym R (25373 sym/61 pcs) 10 img

hw1_time-series-graphics

05.02.2022

Exercises from Section 2 of Forecasting: Principles & Practice at https://otexts.com/fpp3/graphics-exercises.html Exercise 2.1 Use the help function to explore what the series gafa_stock, PBS, vic_elec and pelt represent. #?gafa_stock #?PBS #?vic_elec #?pelt Use autoplot() to plot some of the series in these data sets. autoplot(gafa_stock) ...

4569 sym R (4077 sym/26 pcs) 14 img

hw2_time-series-decomposition

07.02.2022

Exercises from Section 3 of Forecasting: Principles & Practice at https://otexts.com/fpp3/decomposition-exercises.html Exercise 3.1 Consider the GDP information in global_economy. Plot the GDP per capita for each country over time. Which country has the highest GDP per capita? How has this changed over time? For the most recent year of 2016 Mon...

8308 sym R (5451 sym/23 pcs) 17 img

hw2_trees-and-forests

26.03.2022

#prevent conflict with skimr and dlookr options(kableExtra.auto_format = FALSE) library(skimr) library(tidyverse) library(lubridate) library(rpart) #decision tree package rec'd by Practical ML in R textbook library(rpart.plot) #decision tree display package rec'd by Practical ML in R textbook library(randomForest) #for random forest model...

3530 sym R (2959 sym/13 pcs) 2 img 3 tbl

hw6_ARIMA

26.03.2022

Exercises from Section 9 of Forecasting: Principles & Practice at https://otexts.com/fpp3/arima-exercises.html Exercise 9.1 Figure 9.32 shows the ACFs for 36 random numbers, 360 random numbers and 1,000 random numbers. (a) Explain the differences among these figures. Do they all indicate that the data are white noise? Technically yes, since al...

9552 sym R (6863 sym/74 pcs) 30 img

hw5_exponential-smoothing

05.03.2022

Exercises from Section 5 of Forecasting: Principles & Practice at https://otexts.com/fpp3/expsmooth-exercises.html Exercise 8.1 Consider the the number of pigs slaughtered in Victoria, available in the aus_livestock dataset. (a) Use the ETS() function to estimate the equivalent model for simple exponential smoothing. Find the optimal values of...

9248 sym R (8102 sym/50 pcs) 15 img

hw3_forecasting

19.02.2022

Exercises from Section 5 of Forecasting: Principles & Practice at https://otexts.com/fpp3/toolbox-exercises.html Exercise 5.1 Produce forecasts for the following series using whichever of NAIVE(y), SNAIVE(y) or RW(y ~ drift()) is more appropriate in each case: Australian Population (global_economy) A quick plot of the data shows a fairly linea...

9241 sym R (5653 sym/40 pcs) 30 img

hw4_preprocessing

04.03.2022

Exercises from Chapter 3 of textbook Applied Predictive Modeling by Kuhn & Johnson Exercise 3.1 The UC Irvine Machine Learning Repository contains a data set related to glass identification. The data consit of 214 glass samples labeled as one of seven class categories. There are nine predictors, including the refractive index and percentages of...

6955 sym R (1740 sym/10 pcs) 15 img 6 tbl