Publications by Joe Connolly
Data 607 Project 2, PT I
Data transformations The datasets: (1) NYC Subway Transit (2) BLM (Bureau of Land Management) uranium mines (CO,UT) (3) EPA Air Pollutants (Vertical tidy) Consider other set Cleaning Data set 1: NYC Subway Transit: Entrances and Exits library(tidyr) library(dplyr) ## ## Attaching package: 'dplyr' ## The following objects are masked from 'pac...
774 sym R (2827 sym/22 pcs)
Data 606 Lab 5a
In this lab, you will investigate the ways in which the statistics from a random sample of data can serve as point estimates for population parameters. We’re interested in formulating a sampling distribution of our estimate in order to learn about the properties of the estimate, such as its distribution. Setting a seed: We will take some rando...
12019 sym R (8160 sym/39 pcs) 4 img
Data 606 Hw#6
2010 Healthcare Law. (6.48, p. 248) On June 28, 2012 the U.S. Supreme Court upheld the much debated 2010 healthcare law, declaring it constitutional. A Gallup poll released the day after this decision indicates that 46% of 1,012 Americans agree with this decision. At a 95% confidence level, this sample has a 3% margin of error. Based on this inf...
7158 sym R (2598 sym/26 pcs)
Data 606 Lab # 6
Getting Started Load packages In this lab, we will explore and visualize the data using the tidyverse suite of packages, and perform statistical inference using infer. The data can be found in the companion package for OpenIntro resources, openintro. Let’s load the packages. library(tidyverse) ## Warning: package 'tidyverse' was built under R ...
8414 sym R (3888 sym/36 pcs) 1 img
Data 606 Hw #7
SAT scores. (7.14, p. 261) SAT scores of students at an Ivy League college are distributed with a standard deviation of 250 points. Two statistics students, Raina and Luke, want to estimate the average SAT score of students at this college as part of a class project. They want their margin of error to be no more than 25 points. Raina wants to u...
5137 sym R (1320 sym/24 pcs) 4 img
Data 606 Lab #7
Getting Started Load packages In this lab, we will explore and visualize the data using the tidyverse suite of packages, and perform statistical inference using infer. The data can be found in the companion package for OpenIntro resources, openintro. Let’s load the packages. library(tidyverse) library(openintro) library(infer) The data Ever...
7929 sym R (8504 sym/68 pcs) 5 img
Data 607 Tidyverse Vignette
Requirements Clone the provided repository (1 point) Write a vignette using one TidyVerse package (15 points) Write a vignette using more than one TidyVerse packages (+ 2 points) Make a pull request on the shared repository (1 point) Update the README.md file with your example (2 points) Submit your GitHub handle name & link to Peergrade (1 point...
3324 sym R (4347 sym/31 pcs) 8 img
Data 606 Project Proposal
Research question Is BMI a significant predictor for the cost of insurance? Is it more significant when one is a smoker? Cases There are 1,339 cases of Americans with basic health information, such as age (18-64), number of children per individual, smoking, region of residence within US, and insurance charges. Data collection This data is colle...
2419 sym R (2111 sym/17 pcs) 4 img
Data 606 HW #9
Baby weights, Part I. (9.1, p. 350) The Child Health and Development Studies investigate a range of topics. One study considered all pregnancies between 1960 and 1967 among women in the Kaiser Foundation Health Plan in the San Francisco East Bay area. Here, we study the relationship between smoking and weight of the baby. The variable smoke is c...
7687 sym R (819 sym/17 pcs) 2 img
Data 606 Lab # 9
Grading the professor Many college courses conclude by giving students the opportunity to evaluate the course and the instructor anonymously. However, the use of these student evaluations as an indicator of course quality and teaching effectiveness is often criticized because these measures may reflect the influence of non-teaching related charac...
13082 sym R (13804 sym/41 pcs) 17 img