Publications by Kenan Sooklall
KSooklall_Discussion_11-DATA607
Recommender System - For this discussion I will be looking at netflix recommender system. Netflix is all about connecting people to the movies and shows that they love. To help customers find those content, they developed a recommendation system so we spend more time on the platform. Scenario Design: Netflix’s model is all about keeping users ...
1277 sym
DATA-607 Project 4
Animal Crossing is one of the most relaxing and friendly game series in the market today. Animal Crossing ignores many common video game aspects, such as competition and challenge, and let’s you enjoy and interact with your environment and friends. The most recent version New Horizons has sold in 31.18 million copies as of Dec 2020, making it o...
5382 sym R (10763 sym/39 pcs) 12 img
project2
library(tidyverse) ## ── Attaching packages ─────────────────────────────────────── tidyverse 1.3.1 ── ## ✓ ggplot2 3.3.4 ✓ purrr 0.3.4 ## ✓ tibble 3.1.2 ✓ dplyr 1.0.7 ## ✓ tidyr 1.1.3 ✓ stringr 1.4.0 ## ✓ readr 1.4.0 ✓ f...
665 sym R (6517 sym/38 pcs) 4 img 3 tbl
data624 project 1
Loading the data path = '/home/kenan/Documents/learning/masters/CUNY-SPS-Masters-DS/DATA_624/project/' df <- readxl::read_excel(paste0(path, 'DATA624_Project1_Data_Schema.xlsx'), sheet='S04', skip=2) df <- df %>% mutate(date=as.Date(df$SeriesInd, origin="1899-12-30")) Splitting the data Splitting the data into training and testing rows = nrow(df...
1025 sym R (3471 sym/26 pcs) 13 img
Data624_hw4
A chemical manufacturing process for a pharmaceutical product was discussed in Sect. 1.4. In this problem, the objective is to understand the relationship between biological measurements of the raw materials (predictors), measurements of the manufacturing process (predictors), and the response of product yield. Biological predictors cannot be cha...
2571 sym R (6447 sym/12 pcs) 4 img
Document
Loading the data path = '/home/kenan/Documents/learning/masters/CUNY-SPS-Masters-DS/DATA_624/project/' df <- readxl::read_excel(paste0(path, 'DATA624_Project1_Data_Schema.xlsx'), sheet='S04', skip=2) df <- df %>% mutate(date=as.Date(df$SeriesInd, origin="1899-12-30")) Splitting the data Splitting the data into training and testing rows = nrow(df...
1274 sym R (4143 sym/23 pcs) 12 img
DATA-621 Homework 1
Data Exploration Load data and rename columns. I removed TEAM in all column names since it wasn’t informative. path = '/home/kenan/Documents/learning/masters/CUNY-SPS-Masters-DS/DATA_621/hw1/' df <- read.csv(paste0(path, 'moneyball-training-data.csv')) %>% select(-INDEX) names(df) <- gsub('TEAM_', '', x=names(df)) The training data has 16 col...
3995 sym R (9658 sym/30 pcs) 4 img
Google Merchandise Store
The dataset we are analyzing is a Google Merchandise Store (also known as GStore, where Google swag is sold) customer dataset. The raw dataset contains 12 columns to predict the transaction revenue per customer. The outcome from this analysis will aid in better use of marketing budgets. There are 2 datasets, big and small. The small dataset is ab...
8183 sym R (8578 sym/39 pcs) 14 img 5 tbl