Publications by Michael Ippolito

CUNY - DATA607 - Project4

03.05.2021

Project 4 - Spam/Ham Classification Overview The assignment was to use a set of email messages and build a classifier to separate them into spam or ham. I used a set of messages from Kaggle (Wessel van Lit, https://www.kaggle.com/veleon/ham-and-spam-dataset, 2019). I used a manual Bayesian model to gain a better understanding of how the process ...

2112 sym R (14786 sym/17 pcs) 5 img

SPWG Sentiment Analysis

29.04.2021

Assignment 10 - Tidytext Scrape data I wrote a python script to convert each writer’s docx files into plain text. Then I iterated through each writer’s directory, which contained folders for each book he or she has written. Each book’s containing folder holds a series of sections, which roughly correspond to chapters. This code reads those...

2840 sym R (15653 sym/40 pcs) 16 img

CUNY - Data607 - Assignment11

21.04.2021

Assignment 11 - Recommender Systems Overview Sweetwater.com is a site that caters to musicians, producers, performers, and studio engineers, offering a huge catalogue of music-related products for both production and performance. My approach to this assignment was as follows: Perform a scenario design analysis. Discern how the recommender syste...

4963 sym R (3300 sym/4 pcs) 5 img 2 tbl