Publications by Leo Yi & Christopher Bloome

Gapminder Analysis

13.05.2020

Introduction We want to research the association of basic education and its impact to different aspects of society. There’s a common understanding that education is beneficial for society and as people become more educated, society becomes more civilized. This is paired with the idea that when evaluating a scale of animal instincts to conscious...

17646 sym R (34326 sym/104 pcs) 37 img

Usual Source of Care and Healthcare Costs

12.05.2020

Introduction: According to the Center for Medicare and Medicade spending, in the United States healthcare spending totals over 15% of our GDP. As a result of this total and the lack of price sensitivity from the consumer, any impact we can have on healthcare spending, whether it is maintaining the health of our population or removing waste in ou...

14741 sym R (10341 sym/47 pcs) 11 img

Bloome TidyverseExtend

19.04.2020

library(tidyverse); ## -- Attaching packages -------------------------------------------------------------------- tidyverse 1.3.0 -- ## v ggplot2 3.2.1 v purrr 0.3.3 ## v tibble 2.1.3 v dplyr 0.8.4 ## v tidyr 1.0.2 v stringr 1.4.0 ## v readr 1.3.1 v forcats 0.4.0 ## -- Conflicts ----------------------------------------...

2364 sym R (4118 sym/29 pcs) 3 img

Christopher Bloome Tidyverse Lubridate

13.04.2020

Introduction I have always struggled when dealing with Time, Dates and Duration in most computer programs. There does not seem to be an intuitive way to represent the idiosyncrasies we are so accustomed to in these concepts: from the fact that duration and time of day appear similar, to time zones, leap years, and even our “sexagesimal” syste...

2666 sym R (3248 sym/18 pcs)

607 Project 3

22.03.2020

Introduction In this project we are tasked with determining which are the most valued skills for a data scientist. In order to do this we decided to webscrape Indeed.com for Data Scientists job postings. After scraping this information we then transformed and analyzed the data to see which skills are most often occurring in the indeed data set. W...

1479 sym R (9759 sym/18 pcs) 3 img

Project 2

08.03.2020

Overview In this project, we were tasked with choosing 3 data sets among a set aggregated by our peers in the preceding week, and running an analysis proposed in our weekly discussion. I found that some of these asks were not specific, so we took liberties when outlining our objectives. The module was focused on using the tidry and dplyr packages...

8408 sym R (9882 sym/23 pcs)

607W5

01.03.2020

Introduction The chart above describes arrival delays for two airlines across five destinations. Your task is to: (1) Create a .CSV file that includes all of the information above. You’re encouraged to use a “wide” structure similar to how the information appears above, so that you can practice tidying and transformations as described below...

1817 sym R (3983 sym/15 pcs)

607 Project 1 Bloome

23.02.2020

Introduction In this project, you’re given a text file with chess tournament results where the information has some structure. Your job is to create an R Markdown file that generates a .CSV file (that could for example be imported into a SQL database) with the following information for all of the players: Player’s Name, Player’s State, Tota...

5573 sym R (7396 sym/15 pcs)

607W1 - Women's World Cup Win Probability

02.02.2020

Introduction: In 2018, the American Women’s National Soccer Team entered the World Cup as one of two favorites - both host nation France and the USA were seen as likely eventual champions. The team at FiveThirtyEight tracked the probability that each team of the 24 Women’s World Cup teams would advance to each round of the tournament. They ac...

3186 sym R (589 sym/5 pcs) 1 tbl

607W2

09.02.2020

Introduction: In this study, I was tasked with interviewing 5 individuals, and having them rate 6 recent movies on a scale of 1-5. I then uploaded these responses into a MySQL Database, and imported them into R. Movie Selection: I wanted to pick movies which individuals had a high probability of viewing, but also movies which lend themselves to ...

3368 sym R (880 sym/4 pcs) 3 tbl