Publications by George Cruz

DS621 - Final Presentation - Alternative

12.12.2021

PetFinder Adoption Prediction George Cruz Karim Hammoud Maliat Islam Gabriella Martinez Ken Popkin Date: 2021-12-11 Due: 2021-12-12 Overview There are millions of stray pets around the world, some of which are fortunate enough to be adopted while many others are not. While adoption of a pet is often the definition of success, the rate at whic...

3462 sym R (2323 sym/3 pcs) 8 img

DS621 - Final Presentation

12.12.2021

Overview What to Predict Data Cleaning and Transformation Dog Breed Word Cloud Ordinary Logistic Regression Model OLR Histogram OLR Predictions Binomial Logistic Regression Model BLR Predictions Negative Binomial Model Random Forest, XGBoost Conclusions Links and References Overview There are millions of stray pets around the world, some of whi...

3663 sym R (2323 sym/3 pcs) 8 img

DS622 - Homework 2

27.04.2022

Assignment 2 Introduction Based on the latest topics presented, bring a dataset of your choice and create a Decision Tree where you can solve a classification or regression problem and predict the outcome of a particular feature or detail of the data used. From Kaggle Cardiovascular diseases (CVDs) are the number 1 cause of death globally, taki...

4436 sym R (4930 sym/17 pcs) 4 img 2 tbl

DS622- Homework 3

09.05.2022

Assignment 3 Perform an analysis of the dataset used in Homework #2 using the SVM algorithm.Compare the results with the results from previous homework. Introduction Based on the latest topics presented, bring a dataset of your choice and create a Decision Tree where you can solve a classification or regression problem and predict the outcome of...

6068 sym R (3998 sym/18 pcs) 2 img 2 tbl

DS622- Homework 1

25.04.2022

Assignment 1 Visit the following website and explore the range of sizes of this dataset (from 100 to 5 million records). https://eforexcel.com/wp/downloads-18-sample-csv-files-data-sets-for-testing-sales/ File Selection Based on your computer’s capabilities (memory, CPU), select 2 files you can handle (recommended one small, one large) I picke...

5626 sym R (11214 sym/36 pcs) 6 img