Publications by Mr. Hern

Lesson Check 6.2 Key

18.02.2024

Introduction Continuing our exploration of categorical data, we will learn how we can use the chi-square goodness of fit technique to make statistical inferences on non-numeric data. The Secret Life of Pets: Questions 1 - 6 Tuscany claims that 70% of local pet owners own a dog, and 30% own a cat. Sayber decides to test her claim and takes a sample...

4521 sym

Homework 6.2 Key

18.02.2024

Introduction We continue to practice with chi-square goodness of fit through additional real world examples. Question 1 Select all the statement(s) that are true about the chi-square test. The chi-square test should only be performed on categorical data. The shape of the chi-square distribution depends on degrees of freedom. Values for chi-square...

8188 sym 4 tbl

Level 2 Lab 6.1 Key

17.01.2024

In this lab, we use the BullRider dataset to continue to practice with visualizations of categorical data; specifically to analyze the countries of origin for the riders in the dataset. We will create a bar chart, a table of proportions, and a pie chart and assess their relative effectiveness. Research Question What countries did the riders in the...

5640 sym Python (1620 sym/9 pcs) 4 img

Lab 6.1 Key

15.01.2024

In this lab, we use the FilmData dataset to explore the use of bar chart visualizations as a way of representing straightforward comparisons (vs. statistical inference). Specifically, we address the question of which major studios have produced the most and fewest high grossing movies. Lab Question Which studio has made the most high-grossing mov...

6765 sym Python (868 sym/5 pcs) 4 img

Level 2 Lab 5.2 Key

15.01.2024

In this lab, we use the PostSurvey dataset to continue to expand our understanding of ANOVA testing for comparing more than two groups; specifically whether there is a difference in the average amount of time different classes of college students spend talking on the phone. Since the research question asks for comparisons of three groups, we will u...

9426 sym 2 img

Homework 6.1 Key

15.01.2024

Introduction We continue our exploration of bar charts (and their tabular equivalents) as tools for visualizing categorical data. Fasten Your Seatbelts: Questions 1 - 3 This is a bar chart displaying how often 180 teens report wearing a seatbelt. Question 1 Approximately what number of teens report wearing a seatbelt ‘most times’? Question 1 ...

6680 sym 3 img 1 tbl

Lesson Check 6.1 Key

14.01.2024

Introduction After an extended trek through statistical inference involving quantitative data, we turn our attention to the visualization of qualitatative data, specifically, the use of bar charts for comparing frequency data of qualitative variables. Let’s All Go to the Lobby: Questions 1 - 4 The different ratings of 151 movies are shown below....

3311 sym 1 img

Lab 5.2 Key

11.01.2024

In this lab, we use the FilmData dataset to explore ANOVA testing for hypothesis testing involving more than two independent samples. Specifically, we evaluate the IMDB ratings of films from three different studios. Lab Question How do Fox, Sony, and Warner Brothers studio movies compare in their average IMDB ratings? To address this question, we ...

12130 sym 2 img

Truck Driver Inquiry Key

09.01.2024

This project is perhaps nothing more than your standard mother-in-law trope, but it was good to be able to apply the rigorous disciplines of data science to stop disinformation in its tracks. My wife and I were on a road trip with her mother over the holidays, taking her to visit my wife’s brother while we headed on to visit my family. My mother-...

6427 sym

R Homework 5 Key

27.12.2023

Introduction In this homework we return to Karloff U to cover independent t-testing and ANOVA (short for ANalysis Of VAriance) testing. Research Question 1: Baby, Baby, Can’t You Hear My Heartbeat? Do Karloff students with a high pulse rate during their health clinic visit moontan for a significantly different amount of time, on average, than st...

9910 sym 4 img