Publications by Leo Yi & Christopher Bloome
Data607 Assignment 7
Working with XML and JSON in R This assignment requires us to create objects in html, xml, and json and then load them into separate dataframes to compare. I chose the subject of woodworking, and selected the top 3 books when searched on amazon. I created the html, xml, and json in text files and saved them and uploaded them to the web. Lets load...
732 sym R (3362 sym/15 pcs)
Data607 Project 2
Intro For this project, we’ll be taking 3 data sources referenced from recent discussion topics and analyzing them. We’ll need to tidy the datasets, then take a brief dive into what kind of story the data wants to tell us. We’ll be using a few pages that we’ll load first: library(tidyr) library(scales) library(dplyr) library(ggplot2) ...
4893 sym R (15547 sym/28 pcs) 11 img
Data607 Lab 4 - Normal Distribution
In this lab we’ll investigate the probability distribution that is most central to statistics: the normal distribution. If we are confident that our data are nearly normal, that opens the door to many powerful statistical methods. Here we’ll use the graphical tools of R to assess the normality of our data and also learn how to generate random...
11916 sym R (2838 sym/40 pcs) 11 img
Data607 Assignment 5
Intro This week’s assignment is to take a table of flight delays, enter that information into a csv file, then tidy, transform, and analyze the information in R. The information we’ll be looking at today looks like counts of flights that were either on time or delayed for two airlines. The information is available by arrival airport location....
3019 sym R (4771 sym/17 pcs) 5 img
Data606 Lab 3
Hot Hands Basketball players who make several baskets in succession are described as having a hot hand. Fans and players have long believed in the hot hand phenomenon, which refutes the assumption that each shot is independent of the next. However, a 1985 paper by Gilovich, Vallone, and Tversky collected evidence that contradicted this belief and...
11740 sym R (4326 sym/30 pcs) 3 img
Data606 Presentation
Stats scores. (2.33, p. 78) Below are the final exam scores of twenty introductory statistics students. 57, 66, 69, 71, 72, 73, 74, 77, 78, 78, 79, 79, 81, 81, 82, 83, 83, 88, 89, 94 Create a box plot of the distribution of these scores. The five number summary provided below may be useful. #inner quartile range = q3 - q1 (iqr <- 82.5 - 72.5) #...
300 sym R (763 sym/13 pcs) 3 img
Data607 Assignment 2
Movie Survey This assignment is a survey of 5 individuals. Each individual was asked to rate the following movies on a scale of 1 to 5. * Joker * It 2 * Parasite * Ready or Not * Avengers Endgame * Star Wars: Return of the Jedi The survey results were stored in a SQL table in pgAdmin4 using postgres. You can see the SQL code here. The table was c...
960 sym R (2471 sym/15 pcs) 2 img
Data607 Presentation
Introduction Business Intelligence Analyst at OnDeck Data Reduction One way to analyze data Summary of a larger dataset Easier to digest Fewer rows See results rather than data points Less detailed Do You Know Where America Stands On Guns? This is a FiveThirtyEight project found here The article asks you to guess the level of support for dif...
1531 sym R (5704 sym/10 pcs)
Assignment 1
Steak Survey This dataset is a survey of 550 Americans and tests for individual risk behaviors as well as steak preference. The article below summarizes the results of the information, where no correlation is found. The article can be found here: https://fivethirtyeight.com/features/how-americans-like-their-steak/ As part of this assignment, we�...
1464 sym R (1053 sym/9 pcs) 1 img
Data607 Assignment Week 3
#1. Using the 173 majors listed in fivethirtyeight.com’s College Majors dataset, provide code that identifies the majors that contain either “DATA” or “STATISTICS” majors <- read.csv(url('https://raw.githubusercontent.com/fivethirtyeight/data/master/college-majors/majors-list.csv'), stringsAsFactors = F) str(majors) ## 'data.frame': ...
1753 sym R (1370 sym/8 pcs)