Publications by Diego Correa
DATA607 - Assignment 1
How One High-Risk Community In Rural South Carolina Is Bracing For COVID-19 Introduction The article from April discusses the rise of Covid-19 and high risk counties and available ICU beds to understand the potential threat. It highlights a county called Hilton Head, South Carolina that has 3900 high risk individuals with 3 hospitals and 28 ICU ...
3738 sym R (8144 sym/33 pcs) 3 img
DATA607 - Assignment 2
Introduction Choose six recent popular movies. Ask at least five people that you know (friends, family, classmates, imaginary friends if necessary) to rate each of these movies that they have seen on a scale of 1 to 5. Take the results (observations) and store them in a SQL database of your choosing. Load the information from the SQL database int...
1384 sym R (5284 sym/25 pcs) 1 img
DATA 607 - Project 3
607 - Term Data Acquisition and Management DATA Project 3 Create a short document, with the names of group members. You should briefly describe your collaboration tool(s) you’ll use as a group, including for communication, code sharing, and project documentation. You should have identified your data sources, where the data can be found, and how...
10822 sym R (18675 sym/44 pcs) 15 img
DATA 607 - Assignment 7
Introduction Pick three of your favorite books on one of your favorite subjects. At least one of the books should have more than one author. For each book, include the title, authors, and two or three other attributes that you find interesting. Take the information that you’ve selected about these three books, and separately create three files ...
1267 sym R (4422 sym/14 pcs)
DATA605 - Assignment 3
Question 1.1 What is the rank of the matrix A? \[A = \begin{bmatrix} 1 & 2 & 3 & 4\\ -1 & 0 & 1 & 3\\ 0 & 1 & -2 & 1\\ 5 & 4 & -2 & -3 \end{bmatrix}\] A <- matrix(c(1,-1,0,5,2,0,1,4,3,1,-2,-2,4,3,1,-1), nrow = 4) A ## [,1] [,2] [,3] [,4] ## [1,] 1 2 3 4 ## [2,] -1 0 1 3 ## [3,] 0 1 -2 1 ## [4,] ...
1685 sym R (843 sym/9 pcs)
DATA607 - Assignment 3
Question 1 Using the 173 majors listed in fivethirtyeight.com’s College Majors dataset [https://fivethirtyeight.com/features/the-economic-guide-to-picking-a-college-major/], provide code that identifies the majors that contain either “DATA” or “STATISTICS” #Load readr library and use to store csv into dataframe library(readr, quietly ...
1165 sym R (3859 sym/25 pcs)
DATA 606 - 2.11
2.11 Air quality. Daily air quality is measured by the air quality index (AQI) reported by the Environmental Protection Agency. This index reports the pollution level and what associated health effects might be a concern. The index is calculated for five major air pollutants regulated by the Clean Air Act and takes values from 0 to 300, where a h...
903 sym R (500 sym/4 pcs) 1 img
DATA607 - Project 1
Project Description In this project, you’re given a text file with chess tournament results where the information has some structure. Your job is to create an R Markdown file that generates a .CSV file (that could for example be imported into a SQL database) with the following information for all of the players: Player’s Name, Player’s Stat...
1437 sym R (5260 sym/16 pcs)
DATA607 - Assignment 5
Introduction Create a .CSV file (or optionally, a MySQL database!) that includes all of the information above. You’re encouraged to use a “wide” structure similar to how the information appears above, so that you can practice tidying and transformations as described below. Read the information from your .CSV file into R, and use tidyr and ...
1299 sym R (3756 sym/12 pcs) 5 img
DATA 607 - Project 2
Introduction The goal of this assignment is to give you practice in preparing different datasets for downstream analysis work. We are asked to choose three datasets from Week 5’s Discussion, transform the data, perform analysis, and have a conclusion. NYC Subway Ridership from 2013 Cameron Smith provided this example where he suggests to analy...
3242 sym R (10290 sym/27 pcs) 4 img