Publications by Alice Ding

Project 2, Dataset 1

28.02.2023

Overview For this dataset, I’ll be using the one I found and it represents test score data. This data has one row per student and includes the following columns: ID Name Phone Sex and age Test number Term 1 Term 2 Term 3 student_data <- read.csv("https://gist.githubusercontent.com/Kimmirikwa/b69d0ea134820ea52f8481991ffae93e/raw/4db7b1698035ee29...

2766 sym

Project 2, Dataset 2

28.02.2023

Overview For this dataset, I’ll be using the first one posted by Waheeb and it represents sales data for different product lines based on a specific date. This data has one row per date and includes the following columns: Date Product Line 1 Product Line 2 Product Line 3 The last three columns are dollar values. product_data <- read.csv("https:...

2231 sym 1 img

Project 2, Dataset 3

28.02.2023

Overview For this dataset, I’ll be using the first one posted by Farhana and it holds vote counts for two states. This data has one row per political candidate and includes the following columns: Candidate CA FL The last 2 columns are vote counts for those states. vote_data <- read.csv("https://raw.githubusercontent.com/addsding/data607/main/pr...

1407 sym

Assignment 5

21.02.2023

Overview This data set includes information from two airlines: ALASKA and AM WEST. These two airlines have 6 destinations that they operate with and this data includes a count of flights that were on time and delayed for each of the 6 cities. data <- read.csv("https://raw.githubusercontent.com/addsding/data607/main/assignment5/data.csv") head(data)...

2809 sym Python (2259 sym/7 pcs) 2 img

Project 1

14.02.2023

Overview The point of this project is to take a text file formatted in a certain way, mold the data into a .csv, and do some calculations in order to get certain aggregations of data for each row. The necessary fields in this .csv are: Player’s Name Player’s State Total Number of Points Player’s Pre-Rating Average Pre-Chess Rating of Opponen...

4028 sym Python (6100 sym/25 pcs) 2 img

Extra Credit

09.02.2023

Overview weather_data <- read.csv("london_weather.csv") glimpse(weather_data) ## Rows: 15,341 ## Columns: 10 ## $ date <int> 19790101, 19790102, 19790103, 19790104, 19790105, 197… ## $ cloud_cover <dbl> 2, 6, 5, 8, 6, 5, 8, 8, 4, 7, 1, 3, 1, 7, NA, 8, 8, 8… ## $ sunshine <dbl> 7.0, 1.7, 0.0, 0.0, 2.0, 3.8, 0.0, 0.1, 5.8...

4380 sym

Assignment 3

09.02.2023

Question 1 Using the 173 majors listed in fivethirtyeight.com’s College Majors dataset [https://fivethirtyeight.com/features/the-economic-guide-to-picking-a-college-major/], provide code that identifies the majors that contain either “DATA” or “STATISTICS” majors <- read.csv("https://raw.githubusercontent.com/fivethirtyeight/data/master/c...

3146 sym R (5700 sym/52 pcs)

DATA607: Assignment 1

26.01.2023

Overview This data set contains one row per match for the 2022 World Cup. The specific data includes the chance that each team will win, lose or tie every one of their matches, and each team’s SPI (soccer performance index) as well as a projected score. The table also holds information regarding non-shot expected goals (xG) and then adjusted fo...

1664 sym R (4242 sym/7 pcs)