Publications by Jay Lee
606lab2 Jay Lee
Some define statistics as the field that focuses on turning information into knowledge. The first step in that process is to summarize and describe the raw information – the data. In this lab we explore flights, specifically a random sample of domestic flights that departed from the three major New York City airports in 2013. We will generate s...
11300 sym R (7252 sym/32 pcs) 9 img
607 week2 assignment
Summary Watching movie is one of the best thing to do during our free time, and Netflix may become your best friends after Covid-19 hit. I reach out to my friends and ask them to rate 6 of the movies on survey monkey. Let’s see what movie they like, and what movie they are not interested much! Preparing the data Using Survey Monkey For this p...
1937 sym R (4395 sym/26 pcs) 5 img
607assignemt3
Question1 inserting the data major_list_url<- "https://raw.githubusercontent.com/fivethirtyeight/data/master/college-majors/majors-list.csv" majors <-read.csv(major_list_url) pulling the data You can also embed plots, for example: ## FOD1P Major Major_Category ## 44 6212 MANAGEMENT INFORMATIO...
920 sym R (2522 sym/22 pcs)
lab4
In this lab, you’ll investigate the probability distribution that is most central to statistics: the normal distribution. If you are confident that your data are nearly normal, that opens the door to many powerful statistical methods. Here we’ll use the graphical tools of R to assess the normality of our data and also learn how to generate ra...
9664 sym R (2861 sym/29 pcs) 11 img
project2 - 1
Upload the file to Github and read the file from Github covid_url<- "https://raw.githubusercontent.com/jayleecunysps/AssignmentforSPS/main/United_States_COVID-19_Cases_and_Deaths_by_State_over_Time.csv" covid_url <-read.csv(covid_url) rawcovid <- covid_url Clean, select and join the data covid_url[is.na(covid_url)] = 0 covid_url$submission_date ...
1505 sym R (9301 sym/18 pcs)
project2 - 3
Upload the file to Github and read the file from Github Ramendata<- "https://raw.githubusercontent.com/jayleecunysps/AssignmentforSPS/main/ramen-ratings.csv" Ramendata <-read.csv(Ramendata) Clean the data, selet and join Ramendata <- Ramendata[!(Ramendata$Stars=="Unrated"),] Ramendata$Stars <-as.numeric(Ramendata$Stars) Ramendata <- Ramendata %>...
310 sym R (2347 sym/18 pcs)
project2 - 2
Upload the file to Github and read the file from Github tv_url<- "https://raw.githubusercontent.com/jayleecunysps/AssignmentforSPS/main/Dataset_Superhero-TV-Shows.csv" tvdata <-read.csv(tv_url) Clean the data, selet and join tvworkingdata <- tvdata %>% select("show_title","imdb_rating","release_year","runtime","genre","parental_guideline") tv...
1277 sym R (15480 sym/11 pcs)
606lab5
In this lab, you will investigate the ways in which the statistics from a random sample of data can serve as point estimates for population parameters. We’re interested in formulating a sampling distribution of our estimate in order to learn about the properties of the estimate, such as its distribution. Setting a seed: We will take some rando...
11836 sym R (5924 sym/35 pcs) 4 img
Document
Introduction In recent years, we have heard hate crimes more often from the media like New York Times. Hate Crimes are offenses that are motivated by a particular race, religion, ethnicity, gender, age, disability, ancestry, national origin or sexual orientation. Rising hate crimes may be due to “the new normal” that was caused by the pandemi...
6544 sym R (16573 sym/54 pcs) 29 img
Document
Part 1 - Introduction In recent couple years, we hear hate crimes more often from the media. Hate Crimes are the offenses that motivated by a particular race, religion, ethnicity, gender, age, disability, ancestry, national origin or sexual orientation. it may due to “the new normal” that caused by pandemic. Pandemic also cause a lot of peopl...
3301 sym R (3281 sym/10 pcs) 8 img