Publications by Mr. Hern

Level 2 Lab 3.2 Key

29.10.2023

In this lab, we use the AnimalData dataset to continue to explore the measures of center and spread for data distributions; specifically, we examine the life expectancy of shelter dogs. Research Question What age is at the 20th percentile of life expectancy for dogs? Based on this question, we define the population of interest as dogs. It is possi...

4904 sym 2 img

R Homework 3 Key

29.10.2023

In this R homework assignment, we use the KarloffU dataset to further explore the use of z-scores and the attendant percentiles to analyze normally distributed variables. Research Question 1: Analyze follow-up visit temperatures for upperclassmen at KarloffU’s health clinic and determine the values at the 25th and 75th percentiles. Our first ste...

4060 sym 2 img

Lab 1.2 Key

29.10.2023

In this initial lab, we will work with the BikeData dataset to perform our initial exploration of data analysis and the RStudio environment. Lab Question Do students and non-students cycle with the same frequency? Our first step will be to import the BikeData dataset and name it “bike.” We can then use the built-in view of the dataset to answe...

2337 sym

Level 2 Lab 4.2 Key

29.10.2023

In this lab, we use the BullRider dataset to continue to explore the process of statistical inference and hypothesis testing; specifically, we examine the height data of bullriders and determine whether or not there is a significant difference in their average height vs. the average height of males in the US (70 inches). Research Question Do bull...

6415 sym 1 img

Lab 4.2 Key

26.10.2023

In this lab, we use the HealthData dataset to explore the process of statistical inference and hypothesis testing; specifically, we examine the data about white blood cell count in Indonesian women with low albumin levels, and ask the question: is the mean of this group’s white blood cell count lower than what is considered ‘normal’ (4000). T...

5906 sym 2 img

Lab 3.2 Key

23.10.2023

In this lab, we use the SoccerPlayers dataset to better understand how the distribution of a population relates to how we describe its center and spread; the key point is that we can use statistics like mean and standard deviation only with normal distributions. And this leads us to the first question: Question 1 - In order to use the standard norm...

3297 sym 4 img