Publications by Irene Jacob

DATA 607_Project_1

19.09.2020

Chess Tournament In this project, you’re given a text file with chess tournament results where the information has some structure. Your job is to create an R Markdown file that generates a .CSV file (that could for example be imported into a SQL database) with the following information for all of the players: Player’s Name, Player’s State, ...

2121 sym R (2716 sym/12 pcs) 1 img

DATA 607_Assignment_3

12.09.2020

1. Using the 173 majors listed in fivethirtyeight.com’s College Majors dataset [https://fivethirtyeight.com/features/the-economic-guide-to-picking-a-college-major/], provide code that identifies the majors that contain either “DATA” or “STATISTICS” Load the data from csv file and identify the majors that contain DATA and STATISTICS maj...

1850 sym R (1408 sym/14 pcs)

DATA 607_Project_2

03.10.2020

Goal The goal of this assignment is to give you practice in preparing different datasets for downstream analysis work. Your task is to: Choose any three of the “wide” datasets identified in the Week 6 Discussion items. (You may use your own dataset; please don’t use my Sample Post dataset, since that was used in your Week 6 assignment!) For...

3245 sym R (4746 sym/30 pcs) 6 img

DATA 607_Assignment_7

09.10.2020

Goal Pick three of your favorite books on one of your favorite subjects. At least one of the books should have more than one author. For each book, include the title, authors, and two or three other attributes that you find interesting. Take the information that you’ve selected about these three books, and separately create three files which st...

1178 sym R (2214 sym/9 pcs)

Data606_Homework 3

17.10.2020

Dice rolls If you roll a pair of fair dice, what is the probability of getting a sum of 1? It is not possible to get a sum of 1 when a pair of dice is rolled. The minimum sum possible is 2. getting a sum of 5? It is possible to get a sum of 5 when a pair of dice is rolled. The different possible combinations are: 1 and 4, 2 and 3, 3 and 2, 4 ...

6308 sym R (1115 sym/15 pcs) 2 img

Data606_Homework 4

17.10.2020

Chapter 4 - Distributions of Random Variables Area under the curve, Part I. What percent of a standard normal distribution N(μ = 0, = 1) is found in each region? Be sure to draw a graph. Z < −1.35 pnorm(-1.35) ## [1] 0.08850799 normalPlot(bounds = c(-5,-1.35)) Z > 1.48 1-pnorm(1.48) ## [1] 0.06943662 normalPlot(bounds = c(1.48,5)) −0.4...

6321 sym R (1636 sym/51 pcs) 7 img

Data606_Homework 5

17.10.2020

Heights of adults Researchers studying anthropometry collected body girth measurements and skeletal diameter measurements, as well as age, weight, height and gender, for 507 physically active individuals. The histogram below shows the sample distribution of heights in centimeters. What is the point estimate for the average height of active indiv...

7235 sym R (1120 sym/32 pcs) 6 img

Data606_Lab 4

17.10.2020

Lab 4 The data library(tidyverse) ## -- Attaching packages --------------------------------------- tidyverse 1.3.0 -- ## v tibble 3.0.4 v purrr 0.3.4 ## v tidyr 1.1.2 v stringr 1.4.0 ## v readr 1.4.0 v forcats 0.5.0 ## Warning: package 'tibble' was built under R version 4.0.3 ## -- Conflicts ---------------------------------...

3471 sym R (3420 sym/41 pcs) 17 img

DATA 607_Assignment_9

23.10.2020

Goal The New York Times web site provides a rich set of APIs, as described here. You’ll need to start by signing up for an API key. Your task is to choose one of the New York Times APIs, construct an interface in R to read in the JSON data, and transform it into an R DataFrame. Register for API Key and Retrieve the Data Read the data having mo...

766 sym R (886 sym/3 pcs)

DATA 607_Tidyverse_CREATE_Assignment

25.10.2020

Goal In this assignment, you’ll practice collaborating around a code project with GitHub. You could consider our collective work as building out a book of examples on how to use TidyVerse functions. GitHub repository: https://github.com/acatlin/FALL2020TIDYVERSE Your task here is to Create an Example. Using one or more TidyVerse packages, and a...

2786 sym R (858 sym/5 pcs)