Publications by Omar Pineda, Calvin Wong, Murali Kunissery

Data 624 HW3

23.02.2020

library(fpp2) Question 6.2 The plastics data set consists of the monthly sales (in thousands) of product A for a plastics manufacturer for five years. a.Plot the time series of sales of product A. Can you identify seasonal fluctuations and/or a trend-cycle? autoplot(plastics) + ggtitle("Sales of product A for a plastics manufacturer") + ...

2244 sym R (1778 sym/9 pcs) 8 img

Data 612 Recommender Project 1

22.02.2020

Description Restaurant Rating Recommender System This is a recommender system with 20 users and their respective ratings on 5 restaurants in New York It recommends restaurants to users based on other user ratings. Dataset Step-1: I scrapped some data from online to get this rating on the 5 restaurants Load Data Step-2: Load the necessary libr...

2229 sym R (6171 sym/32 pcs) 2 img 10 tbl

Data 624 HW2

17.02.2020

Question 3.1 For the following series, find an appropriate Box-Cox transformation in order to stabilise the variance. usnetelec usgdp mcopper enplanements library(fpp2) library(ggplot2) Question 3.1.1 autoplot(usnetelec) + ylab("billion kwh") + xlab("Year") + ggtitle("Annual US net electricity generation (billion kwh) for 194...

3421 sym R (2778 sym/34 pcs) 15 img

Data 624 Predictive Analytics - HW1

09.02.2020

Predictive Analytics Exercise Use the help function to explore what the series gold, woolyrnq and gas represent. Use autoplot() to plot each of these in separate plots. What is the frequency of each series? Hint: apply the frequency() function. Use which.max() to spot the outlier in the gold series. Which observation was it? : library(fpp2) gol...

3134 sym R (1700 sym/40 pcs) 30 img

Data 624 HW4

01.03.2020

Question 3.1 The UC Irvine Machine Learning Repository 6 contains a data set related to glass identification. The data consist of 214 glass samples labeled as one of seven class categories. There are nine predictors, including the refractive index and percentages of eight elements: Na, Mg, Al, Si, K, Ca, Ba, and Fe. The data can be accessed via: ...

3989 sym R (4080 sym/16 pcs) 9 img

Data 612 Recommender Project 2

06.03.2020

library(recommenderlab) library(reshape2) library(RCurl) library(dplyr) library(ggplot2) library(knitr) library(tidyverse) #Read the Movie data movies.df <- read.csv('movies.csv',header = TRUE, stringsAsFactors = FALSE) kable(head(movies.df)) movieId title genres 1 Toy Story (1995) Adventure|Animation|Children|Comedy|Fantasy 2 Jumanji ...

331 sym R (6941 sym/58 pcs) 5 img 2 tbl

Data 624 HW 6

22.03.2020

Homework 6 Question 8.1 1.Figure 8.31 shows the ACFs for 36 random numbers, 360 random numbers and 1,000 random numbers. Explain the differences among these figures. Do they all indicate that the data are white noise? These pictures show the correlation between different lags of the series (shown on the x-axis). The y-axis (the correlation) has...

3836 sym R (2969 sym/52 pcs) 19 img