Publications by Alice Ding
Project 2, Dataset 1
Overview For this dataset, I’ll be using the one I found and it represents test score data. This data has one row per student and includes the following columns: ID Name Phone Sex and age Test number Term 1 Term 2 Term 3 student_data <- read.csv("https://gist.githubusercontent.com/Kimmirikwa/b69d0ea134820ea52f8481991ffae93e/raw/4db7b1698035ee29...
2766 sym
Project 2, Dataset 2
Overview For this dataset, I’ll be using the first one posted by Waheeb and it represents sales data for different product lines based on a specific date. This data has one row per date and includes the following columns: Date Product Line 1 Product Line 2 Product Line 3 The last three columns are dollar values. product_data <- read.csv("https:...
2231 sym 1 img
Project 2, Dataset 3
Overview For this dataset, I’ll be using the first one posted by Farhana and it holds vote counts for two states. This data has one row per political candidate and includes the following columns: Candidate CA FL The last 2 columns are vote counts for those states. vote_data <- read.csv("https://raw.githubusercontent.com/addsding/data607/main/pr...
1407 sym
Assignment 5
Overview This data set includes information from two airlines: ALASKA and AM WEST. These two airlines have 6 destinations that they operate with and this data includes a count of flights that were on time and delayed for each of the 6 cities. data <- read.csv("https://raw.githubusercontent.com/addsding/data607/main/assignment5/data.csv") head(data)...
2809 sym Python (2259 sym/7 pcs) 2 img
Project 1
Overview The point of this project is to take a text file formatted in a certain way, mold the data into a .csv, and do some calculations in order to get certain aggregations of data for each row. The necessary fields in this .csv are: Player’s Name Player’s State Total Number of Points Player’s Pre-Rating Average Pre-Chess Rating of Opponen...
4028 sym Python (6100 sym/25 pcs) 2 img
Extra Credit
Overview weather_data <- read.csv("london_weather.csv") glimpse(weather_data) ## Rows: 15,341 ## Columns: 10 ## $ date <int> 19790101, 19790102, 19790103, 19790104, 19790105, 197… ## $ cloud_cover <dbl> 2, 6, 5, 8, 6, 5, 8, 8, 4, 7, 1, 3, 1, 7, NA, 8, 8, 8… ## $ sunshine <dbl> 7.0, 1.7, 0.0, 0.0, 2.0, 3.8, 0.0, 0.1, 5.8...
4380 sym
Assignment 3
Question 1 Using the 173 majors listed in fivethirtyeight.com’s College Majors dataset [https://fivethirtyeight.com/features/the-economic-guide-to-picking-a-college-major/], provide code that identifies the majors that contain either “DATA” or “STATISTICS” majors <- read.csv("https://raw.githubusercontent.com/fivethirtyeight/data/master/c...
3146 sym R (5700 sym/52 pcs)
DATA607: Assignment 1
Overview This data set contains one row per match for the 2022 World Cup. The specific data includes the chance that each team will win, lose or tie every one of their matches, and each team’s SPI (soccer performance index) as well as a projected score. The table also holds information regarding non-shot expected goals (xG) and then adjusted fo...
1664 sym R (4242 sym/7 pcs)