Publications by Puja Roy

DATA607FinalProject

06.05.2024

Introduction The goal of this project is to perform sentiment analysis on the reviews of Amazon’s Top 100 Bestselling Books. By analyzing the sentiment expressed in these reviews, I aim to gain insights into customer preferences and opinions regarding bestselling books. This analysis will not only provide valuable information for book publish...

15766 sym Python (172761 sym/60 pcs) 6 img

DATA607Project4

29.04.2024

Introduction The objective of this project is to classify emails in a data set as either spam or ham. Spam emails are considered to be junk, unsolicited and may possibly contain phishing and other harmful links that can make people expose their sensitive and other personal information. Whereas, ham emails are considered to be intended and safe ...

8925 sym Python (23207 sym/32 pcs)

DATA607EXTENDTidyverse

29.04.2024

Objective The goal of this assignment is to EXTEND a Vignette of a classmate’s CREATE Vignette from the Tidyverse Assignment GitHub Repository. EXTEND Vignette For this assignment, I am going to EXTEND Atta Boateng’s Vignette. By extending the Vignette below, I will utilize ggplot to plot and customize the color of the plot based on the ope...

7306 sym 10 img

DATA607Week11ExtraCreditRecomSys

17.04.2024

Objective Using the information you collected on movie ratings, implement a Global Baseline Estimate recommendation system in R. Most recommender systems use personalized algorithms like “content management” and “item-item collaborative filtering.” Sometimes non-personalized recommenders are also useful or necessary. One of the best no...

6257 sym Python (5210 sym/15 pcs)

DATA606FinalProjectProposal

08.04.2024

Data Preparation For this final project proposal, I am planning to leverage the Drug Use by Age dataset from FiveThirtyEight https://github.com/fivethirtyeight/data. The reason I chose this dataset is because many people are addicted to drugs ranging from multiple ages and I wanted to explore which ages and what types of drugs certain people ar...

1887 sym R (10547 sym/10 pcs) 2 img

DATA607RecommenderSystems

08.04.2024

New York Times Recommendation Engine Scenario Design Analysis Your task is to analyze an existing recommender system that you find interesting. You should: Perform a Scenario Design analysis as described below. Consider whether it makes sense for your selected recommender system to perform scenario design twice, once for the organization (e.g.�...

9115 sym

DATA607Week10Assignment

01.04.2024

Load Packages library(tidyverse) library(openintro) 2.1 The sentiments datasets #install.packages('tidytext') #install.packages('textdata') library(tidytext) ## Warning: package 'tidytext' was built under R version 4.3.3 get_sentiments("afinn") ## # A tibble: 2,477 × 2 ## word value ## <chr> <dbl> ## 1 abandon -2 ##...

11638 sym Python (13718 sym/80 pcs) 9 img

DATA607CREATETidyverse

25.03.2024

Objective The purpose of this assignment is to create an Example by using one or more TidyVerse packages, and any dataset from fivethirtyeight.com or Kaggle, to create a programming sample “vignette” that demonstrates how to use one or more of the capabilities of the selected TidyVerse package with the selected dataset. For this assignment,...

10422 sym 3 img

DATA607Week9

25.03.2024

Objective The New York Times web site provides a rich set of APIs, as described here: https://developer.nytimes.com/apis You’ll need to start by signing up for an API key. The main objective is to choose one of the New York Times APIs, construct an interface in R to read in the JSON data, and transform it into an R DataFrame. I chose to retrie...

4784 sym Python (64797 sym/8 pcs)

DATA608Story4

18.03.2024

library(tidyverse) library(openintro) Exercise 1 Insert any text here. # Insert code for Exercise 1 here Exercise 2 … LS0tDQp0aXRsZTogIkRBVEEgNjA4IFN0b3J5IzQiDQphdXRob3I6ICJQdWphIFJveSINCmRhdGU6ICIzLzE3LzI0Ig0Kb3V0cHV0OiBvcGVuaW50cm86OmxhYl9yZXBvcnQNCi0tLQ0KDQpgYGB7ciBsb2FkLXBhY2thZ2VzLCBtZXNzYWdlPUZBTFNFfQ0KbGlicmFyeSh0aWR5dmVyc2UpDQpsaWJyYX...

499 sym