Publications by Saeah Go
Project 1 - Kobe Basketball Data
Objectives for this project R Programming Objectives Practice subsetting Using loops Creating your own functions Working with data frames in R Statistics Objectives Think about statistical independence Obtain frequencies from categorical data Obtain conditional probabilities with real data Learn how to simulate data under particular assumpti...
9496 sym R (5220 sym/33 pcs) 4 img
Project 2 - Energy Source Data Analysis and Visualization
General Information Import Data My question or hypothesis My base R plot My ggplot plot General Information I used block quote for my questions(hypothesis) and observations Between my base R plot and ggplot plot, I have a code chunk which rearrange power data, especially to remove NAs in year_of_capacity_data. But I hide this code chunk by usin...
3213 sym R (2068 sym/3 pcs) 2 img
Week 05 Exercises
In-class Exercise Week 5 Saeah Go Tuesday Lecture: playing with the gapminder data Explore other relationships in the gapminder data using what you learned today, could be considering other variables in the data set, or using an alternative geometry or faceting with other variables. Just make one figure, but using effectively as many of the conc...
899 sym R (1323 sym/21 pcs) 2 img
Week 06 Exercises
Tuesday Lecture: Flight problems with dplyr Thursday Lecture Tuesday Lecture: Flight problems with dplyr How many flights to Los Angeles (LAX) did each of the legacy carriers (AA, UA, DL or US) have in May from JFK, and what was their average duration? # your code here flights %>% filter(carrier%in%c("AA", "UA", "DL", "US"), month...
1068 sym R (7117 sym/30 pcs)
Week 07 Exercises
Tuesday Lecture Thursday Lecture Tuesday Lecture Load up datasets from nycflights13 data(flights) data(airports) data(planes) Compute the average delay by destination, then join on the airports data frame so you can show the spatial distribution of delays. You might want to use the size or colour of the points to display the average delay for...
1079 sym R (15226 sym/14 pcs) 5 img
Week 09 Exercises
Tuesday Lecture Thursday Lecture Tuesday Lecture Problem 1 We want to get inspections for coffee shops. Let’s say a coffee shop is anything that has “COFFEE”, “ESPRESSO”, or “ROASTER” in the name. The regex for this is COFFEE|ESPRESSO|ROASTER because | is a metacharacter that means “OR”. Use the str_detect() function, which re...
1485 sym R (6953 sym/47 pcs) 2 img
Week 10 Exercises
Problem 1 Problem 2 Problem 1 Write in dplyr and in data.table expressions to: calculate the mean arrival delay and number of flights by carrier, sort the resulting table by mean delay in descending order, and keep only those carriers with more than 100 flights input <- "https://raw.githubusercontent.com/Rdatatable/data.table/master/vignettes/...
1183 sym R (2018 sym/7 pcs) 1 img
modern regression analysis hw1
Before starting the homework Problem 1 (2.1) Problem 2 (2.2) Problem 3 (2.4) Problem 4 (2.6, no d) Additional Questions Before starting the homework I used block quotes to answer the questions. I tried to type question first, then write code for the question, and wrote the result(or answer) at the end. Problem 1 (2.1) The dataset teengamb conc...
6832 sym R (10283 sym/39 pcs) 1 img
modern regression analysis hw2
Before starting the homework Problem 1 (3.1 a, b, d) Problem 2 (3.2 a, b, c, d) Problem 3 (3.7) Before starting the homework I used block quotes to answer the questions. Problem 1 (3.1 a, b, d) For the prostate data, fit a model with lpsa as the response and the other variables as predictors: data(prostate, package="faraway") # load the data ...
8198 sym R (11193 sym/40 pcs) 1 img