Publications by Gabriella Martinez
DATA622 Final
Final Project Exploratory analysis and essay Choose a dataset. You get to decide which dataset you want to work on. The data set must be different from the ones used in previous homeworks You can work on a problem from your job, or something you are interested in. You may also obtain a dataset from sites such as Kaggle, Data.Gov, Census Bureau, US...
16120 sym Python (37849 sym/91 pcs) 11 img 15 tbl
DATA622 HW3
Assignment 3 Pre-work Read the following articles: https://www.hindawi.com/journals/complexity/2021/5550344/ https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8137961/ Search for academic content (at least 3 articles) that compare the use of decision trees vs SVMs in your current area of expertise. Perform an analysis of the dataset used in Homework #...
10932 sym 3 img
DATA622 HW2
Assignment 2 Pre-work 1. Read this blog, which shows some of the issues with decision trees 2. Choose a dataset from a source in Assignment #1, or another dataset of your choice. Assignment work 1. Based on the latest topics presented, choose a dataset of your choice and create a Decision Tree where you can solve a classification or regression p...
13563 sym Python (17818 sym/66 pcs) 8 img
DATA622 HW1
Assignment 1 Visit the following website and explore the range of sizes of this dataset (from 100 to 5 million records): https://excelbianalytics.com/wp/downloads-18-sample-csv-files-data-sets-for-testing-sales/ or (new) https://www.kaggle.com/datasets Select 2 files to download. Based on your computer’s capabilities (memory, CPU), select 2 file...
1638 sym Python (27499 sym/80 pcs) 4 img
DATA608 Assignment 1
Module 1: Exploratory Data Analysis library(dplyr) library(reactable) library(ggplot2) library(forcats) Principles of Data Visualization and Introduction to ggplot2 I have provided you with data about the 5,000 fastest growing companies in the US, as compiled by Inc. magazine. lets read this in: inc <- read.csv("https://raw.githubusercontent....
4712 sym 5 img
DATA608 Final Project
Unemployment in Red and Blue States The purpose of this project was to explore the relationship between the red and blue states in the last two elections (including each year prior to the election) and unemployment rates and benefit amounts across the U.S.. Shiny App for this project. Packages library(readxl) library(readr) library(rvest) lib...
7212 sym 7 img
DATA 605 HW4
Homework 4 Build and visualize eigenimagery that accounts for 80 percent of the variability. library(foreach) library(jpeg) library(EBImage) library(kableExtra) library(OpenImageR) library(doParallel) Use of Graphics #############Prepare for Image Processing####################### num=17 files=list.files("C:/Users/marcosmartinez689/Docum...
1113 sym R (8534 sym/19 pcs) 4 img
DATA 607 HW 3
1. Provide code that identifies the majors that contain either “DATA” or “STATISTICS” from fivethirtyeight.com’s College Majors dataset. x <- url("https://raw.githubusercontent.com/fivethirtyeight/data/master/college-majors/majors-list.csv") college_majors <- read.csv(x) majors <- as.matrix.data.frame(college_majors) grep(pattern =...
1647 sym R (2287 sym/17 pcs)
DATA 607 HW 2
Week 2 Assignment – Recommend Movies SQL & R Data Collection & Storing Data in a Relational Database For this weeks assignment, I collected the data from from five friends and family members, including our treasured four month old furry friend and asked them for their opinion on six recently watched movies. When it comes to our furry friend, C...
2720 sym R (367 sym/3 pcs) 3 img
DATA 607 HW 1
Super Bowl Advertisements Overview With the super bowl around the corner, the data set “superbowl -ads” from fivethirtyeight https://projects.fivethirtyeight.com/super-bowl-ads/ delves into the top ten brands with the most advertisements run on Super Bowl Sundays from 2000 to 2020. The data for “superbowl-ads” originates from http://www....
1061 sym R (1580 sym/2 pcs) 1 img