Publications by Kory Martin

DATA 607 - Tidyverse Create Assignment

15.04.2023

Introduction For this project, we are using the Netflix TV Shows and Movies dataset from Kaggle. Per the description provided on the website, “[t]his data set was created to list all shows available on Netflix streaming, and analyze the data to find interesting facts. This data was acquired in July 2022 containing data available in the United Sta...

4197 sym Python (9817 sym/17 pcs)

DATA 607 - Week #10 Assignment

03.04.2023

Introduction This assignment is focused on conducting a sentiment analysis of a text corpus using the same methodology and overall codebase introduced in Chapter 2: Sentiment Analysis with Tidy Data of the Text Mining with R text. We extend this code by doing the following: a) Working with a different corpus of our choosing; and b) Incorporating at...

1414 sym R (6611 sym/36 pcs) 4 img

DATA 607 - Week #9 Assignment

26.03.2023

Introduction For this project, I focused on importing data from the “Most Popular” API provided by the New York Times. The data set used in this assignment, looks at the most popular articles over the past 30-days based on the number of times they were emailed, shared, or viewed. In addition to a number of pre-processing steps, for this project...

1814 sym R (6432 sym/20 pcs) 2 img 5 tbl

DATA 607 - Project #3 (Team 5)

19.03.2023

Introduction For this project we focused on answering the question “What are the most valued data science skills?”. Our method for answering this question was to collect data from a number of Data Science and Analysis related job postings and query that data to try and identify the skills that are the most prominent across these postings. The o...

8430 sym R (25043 sym/48 pcs) 2 img 9 tbl

DATA 607 - Week 7 Assignment

12.03.2023

Week 7 Assignment - Working with XML and JSON in R For this assignment, I began by picking three of my favorite books related to one of my favorite topics, Marketing. For each book, I created a data structure that included the following: Title Subtitle Author(s) Publisher Subject(s) I created the data structures to hold this information in the fo...

2782 sym R (1609 sym/8 pcs) 4 tbl

DATA 607 - Assignment #5

26.02.2023

Assignment - Tidying and Transforming Data This assignment is focused on demonstrating a knowledge and ability to work with untidy data by doing the necessary transformations to the data to convert it into Tidy data that is then more functionally capable of being used for data analysis purposes. The main steps in this assignment include: (1) Create...

4006 sym R (14415 sym/37 pcs) 3 img

DATA 605 - Assignment #4

19.02.2023

Assignment #4 With the attached data file, build and visualize eigenimagery that accounts for 80% of the variability. Provide full R code and discussion. Overview For this assignment, we are going to take a bunch of image files and find the set of images that account for ~80% of the variability of the entire image. The methodology used in this ass...

3397 sym 1 img

DATA607 - Project #1

18.02.2023

Create Player DF My initial focus is on importing the text file and then storing the relevant data into a dataframe that can be used for additional pre-processing data_string = readLines(file_name, warn = F) cols = c('pair_num', 'player_name','player_points', 'player_state', 'pre_rating', 'round1_opponent', 'round2_opponent', 'round3_oppo...

1227 sym 1 tbl

DATA 607 - Assignment #3

11.02.2023

Question #1 Using the 173 majors listed in fivethirtyeight.com’s College Majors dataset [https://fivethirtyeight.com/features/the-economic-guide-to-picking-a-college-major/], provide code that identifies the majors that contain either “DATA” or “STATISTICS” college_majors = read_csv('https://raw.githubusercontent.com/fivethirtyeight/data/...

1944 sym

DATA 605 - Week 1 Assignment

30.01.2023

Overview This article from FiveThirtyEight highlights how people of color made up only 28% of the general election candidates in the 2022 election, while as a group they make up 41% of the general population. Additionally, the article highlights how this disparity among candidate representation continues to be more stark within the Republican par...

2090 sym R (13002 sym/43 pcs)