Publications by Adam Gersowitz

DATA 607 Assignment 9

28.03.2020

Introduction In this assignment I will be using the New York Times API that I signed up for in order to download some of their information on best selling books into an r dataframe. I chose to look for fiction books that are currently on the New York Times fiction best seller list. First I found the appropriate name for this list which is “comb...

561 sym R (785 sym/3 pcs)

DATa 606 HW 7

22.03.2020

Working backwards, Part II. (5.24, p. 203) A 90% confidence interval for a population mean is (65, 77). The population distribution is approximately normal and the population standard deviation is unknown. This confidence interval is based on a simple random sample of 25 observations. Calculate the sample mean, the margin of error, and the sampl...

6285 sym R (1226 sym/26 pcs) 4 img

606 Lab 7

22.03.2020

North Carolina births In 2004, the state of North Carolina released a large data set containing information on births recorded in this state. This data set is useful to researchers studying the relation between habits and practices of expectant mothers and the birth of their children. We will work with a random sample of observations from this da...

5939 sym R (7581 sym/56 pcs) 8 img 1 tbl

DATA 607 Assignment 7

09.03.2020

Overview Assignment 7 will be focused on importing html,xml, and json tables into R and noting the differences between the 3 structures. library(XML) library(httr) library(RCurl) ## Loading required package: bitops library(rvest) ## Loading required package: xml2 ## ## Attaching package: 'rvest' ## The following object is masked from 'package...

666 sym R (1510 sym/16 pcs)

DATA 607 Project 2

07.03.2020

Overview This is a project that is focused on bringing in different datasets tidying, cleaning and analyzing them. most of these data sets are wid and need t be converted to a long format. Dataset 1 NFL Play by Play My First dataset is from my own discussion post. This dataset contains wide formatted nfl play by play data. library (readr) libra...

2541 sym R (10107 sym/13 pcs)

DATA 607 Assignment 5

23.02.2020

Introduction This assignment is focused on Tidying and Transforming data for analysis. Prior to being transformed this data would be difficult to analyze and work with due to its format and inconsistencies. Importing the Data I start by bringing in the .csv file from a github repository and making sure “air” is a dataframe. I make sure to co...

2758 sym R (4303 sym/22 pcs)

DATA 607 Assignment 3

08.02.2020

Overview Week 3 assignment will be working with various data sets to practice Data manipulation and processing Question 1 Using the 173 majors listed in fivethirtyeight.com’s College Majors dataset [https://fivethirtyeight.com/features/the-economic-guide-to-picking-a-college-major/], provide code that identifies the majors that contain either ...

1909 sym R (1450 sym/6 pcs)

DATA 607 Assignment 1

31.01.2020

Overview The article to be analyzed in this file is a file on the fight songs of colleges and universities across the US. This includes characteristics such as the school, writers and attributes about the song. Link https://projects.fivethirtyeight.com/college-fight-song-lyrics/ library (readr) library(RCurl) x <- getURL("https://raw.githubus...

1077 sym R (22670 sym/2 pcs)

R HW 3 project

12.01.2020

#Meaningful question for analysis. Which flowers should be planted to attract polinators from a large distance? ##1. Data Exploration: This should include summary statistics, means, medians, quartiles, or any other relevant information about the data set. Please include some conclusions in the R Markdown text. iris <- read.csv(file="iris.csv", he...

1800 sym R (24232 sym/28 pcs) 4 img

Bridge R HW 2

08.01.2020

#Please select one, download it and perform the following tasks: #1. Use the summary function to gain an overview of the data set. Then display the mean and #median for at least two attributes. iris <- read.csv(file="iris.csv", header=TRUE, sep=",") summary(iris) ## X Sepal.Length Sepal.Width Petal.Length ## ...

5 sym R (31994 sym/45 pcs)