Publications by Joe Connolly

Data 607 Assignment #9

12.04.2021

library(jsonlite) library(tidyverse) ## -- Attaching packages --------------------------- tidyverse 1.3.0 -- ## v ggplot2 3.3.2 v purrr 0.3.4 ## v tibble 3.0.3 v dplyr 1.0.0 ## v tidyr 1.1.0 v stringr 1.4.0 ## v readr 1.3.1 v forcats 0.5.0 ## -- Conflicts ------------------------------ tidyverse_conflicts() -- ## x d...

511 sym R (1441 sym/10 pcs) 3 tbl

Data 607 Project #3

05.04.2021

Main Title Assignment Requirements This is a project for your entire class section to work on together, since being able to work effectively on a virtual team is a key “soft skill” for data scientists. Please note especially the requirement about making a presentation during our first meetup after the project is due. W. Edwards Deming sa...

7673 sym R (13505 sym/57 pcs) 10 img 5 tbl

Data 606 Lab 5b

16.03.2021

If you have access to data on an entire population, say the opinion of every adult in the United States on whether or not they think climate change is affecting their local community, it’s straightforward to answer questions like, “What percent of US adults think climate change is affecting their local community?”. Similarly, if you had dem...

11212 sym R (1531 sym/12 pcs) 1 img 1 tbl

Data 606 Hw #5

15.03.2021

Heights of adults. (7.7, p. 260) Researchers studying anthropometry collected body girth measurements and skeletal diameter measurements, as well as age, weight, height and gender, for 507 physically active individuals. The histogram below shows the sample distribution of heights in centimeters. What is the point estimate for the average height...

10222 sym R (525 sym/5 pcs) 5 img

Data 607 Project 2, PT 2

14.03.2021

A dataset from the Bureau of Land Management of uranium mines in Colorado and Utah from the earl-mid 90’s library(tidyr) library(dplyr) ## ## Attaching package: 'dplyr' ## The following objects are masked from 'package:stats': ## ## filter, lag ## The following objects are masked from 'package:base': ## ## intersect, setdiff, s...

785 sym R (5626 sym/20 pcs)

Data 607 Project 2, PT 3

14.03.2021

nk <- read.csv("https://raw.githubusercontent.com/jconno/NK-food/main/North%20Korea%20food.csv") str(names(nk)) ## chr [1:40] "ï..Year" "Area.Harvested..1000.Ha" "Yield.MT...Ha" ... nk <- rename(nk,c(Year = ï..Year, Harvested_Area = Area.Harvested..1000.Ha, Yield = Yield.MT...Ha, Amount_Produced = Production.Quantity..1000.MT, Amount_Imported ...

228 sym R (5858 sym/21 pcs) 2 img

Data 606 March 10th Binomial Presentation

10.03.2021

4.17; Underage drinking pt. 1: Data collected by the Substance Abuse and Mental HEalth Service Administration (SAMSHA) suggests that 69.7% of 18-20 year olds consumed alcoholic beverages in any given year. Suppose a random sample of of ten 18-20 year olds is taken. Is the use of the binomial distribution appropriate for calculating the probabil...

2052 sym R (733 sym/10 pcs)

Data 606 Lab #4

08.03.2021

library(tidyverse) ## Warning: package 'tidyverse' was built under R version 4.0.3 library(openintro) ## Warning: package 'openintro' was built under R version 4.0.3 ## Warning: package 'airports' was built under R version 4.0.3 ## Warning: package 'cherryblossom' was built under R version 4.0.3 ## Warning: package 'usdata' was built under R vers...

20190 sym R (11827 sym/72 pcs) 25 img

Squirrels in Central Park--How Nuts! Data 607 Hw #5

08.03.2021

library(tidyverse) ## Warning: package 'tidyverse' was built under R version 4.0.3 ## -- Attaching packages ------------------------------------------ tidyverse 1.3.0 -- ## v ggplot2 3.3.2 v purrr 0.3.4 ## v tibble 3.0.3 v dplyr 1.0.0 ## v tidyr 1.1.0 v stringr 1.4.0 ## v readr 1.3.1 v forcats 0.5.0 ## -- Conflicts ---...

659 sym R (3501 sym/21 pcs)

Data 606 HW #4

08.03.2021

Area under the curve, Part I. (4.1, p. 142) What percent of a standard normal distribution \(N(\mu=0, \sigma=1)\) is found in each region? Be sure to draw a graph. Citation for drawing area under curve: https://www.youtube.com/watch?v=HtXeTsx9ka0 \(Z < -1.35\) mean <- 0 std <- 1 z <- -1.35 # Solving for x: x <- z*std + mean x ## [1] -1.3...

7144 sym R (2588 sym/60 pcs) 6 img