Publications by Kenan Sooklall

KSooklall_Discussion_11-DATA607

24.04.2021

Recommender System - For this discussion I will be looking at netflix recommender system. Netflix is all about connecting people to the movies and shows that they love. To help customers find those content, they developed a recommendation system so we spend more time on the platform. Scenario Design: Netflix’s model is all about keeping users ...

1277 sym

DATA-607 Project 4

05.05.2021

Animal Crossing is one of the most relaxing and friendly game series in the market today. Animal Crossing ignores many common video game aspects, such as competition and challenge, and let’s you enjoy and interact with your environment and friends. The most recent version New Horizons has sold in 31.18 million copies as of Dec 2020, making it o...

5382 sym R (10763 sym/39 pcs) 12 img

project2

09.07.2021

library(tidyverse) ## ── Attaching packages ─────────────────────────────────────── tidyverse 1.3.1 ── ## ✓ ggplot2 3.3.4 ✓ purrr 0.3.4 ## ✓ tibble 3.1.2 ✓ dplyr 1.0.7 ## ✓ tidyr 1.1.3 ✓ stringr 1.4.0 ## ✓ readr 1.4.0 ✓ f...

665 sym R (6517 sym/38 pcs) 4 img 3 tbl

data624 project 1

23.06.2021

Loading the data path = '/home/kenan/Documents/learning/masters/CUNY-SPS-Masters-DS/DATA_624/project/' df <- readxl::read_excel(paste0(path, 'DATA624_Project1_Data_Schema.xlsx'), sheet='S04', skip=2) df <- df %>% mutate(date=as.Date(df$SeriesInd, origin="1899-12-30")) Splitting the data Splitting the data into training and testing rows = nrow(df...

1025 sym R (3471 sym/26 pcs) 13 img

Data624_hw4

25.06.2021

A chemical manufacturing process for a pharmaceutical product was discussed in Sect. 1.4. In this problem, the objective is to understand the relationship between biological measurements of the raw materials (predictors), measurements of the manufacturing process (predictors), and the response of product yield. Biological predictors cannot be cha...

2571 sym R (6447 sym/12 pcs) 4 img

Document

25.06.2021

Loading the data path = '/home/kenan/Documents/learning/masters/CUNY-SPS-Masters-DS/DATA_624/project/' df <- readxl::read_excel(paste0(path, 'DATA624_Project1_Data_Schema.xlsx'), sheet='S04', skip=2) df <- df %>% mutate(date=as.Date(df$SeriesInd, origin="1899-12-30")) Splitting the data Splitting the data into training and testing rows = nrow(df...

1274 sym R (4143 sym/23 pcs) 12 img

DATA-621 Homework 1

03.09.2021

Data Exploration Load data and rename columns. I removed TEAM in all column names since it wasn’t informative. path = '/home/kenan/Documents/learning/masters/CUNY-SPS-Masters-DS/DATA_621/hw1/' df <- read.csv(paste0(path, 'moneyball-training-data.csv')) %>% select(-INDEX) names(df) <- gsub('TEAM_', '', x=names(df)) The training data has 16 col...

3995 sym R (9658 sym/30 pcs) 4 img

Google Merchandise Store

03.11.2021

The dataset we are analyzing is a Google Merchandise Store (also known as GStore, where Google swag is sold) customer dataset. The raw dataset contains 12 columns to predict the transaction revenue per customer. The outcome from this analysis will aid in better use of marketing budgets. There are 2 datasets, big and small. The small dataset is ab...

8183 sym R (8578 sym/39 pcs) 14 img 5 tbl