Publications by Jeff Parks

Data 605 -

13.04.2024

The attached who.csv dataset contains real-world data from 2008. The variables included follow. Country: name of the country LifeExp: average life expectancy for the country in years InfantSurvival: proportion of those surviving to one year or more Under5Survival: proportion of those surviving to five years or more TBFree: proportion of the popula...

5965 sym 9 img

Data 605 - Discussion 12

09.04.2024

Using R, build a multiple regression model for data that interests you.  Include in this model at least one quadratic term, one dichotomous term, and one dichotomous vs. quantitative interaction term. Interpret all coefficients. Conduct residual analysis.  Was the linear model appropriate? Why or why not? The King County House Sales dataset f...

3929 sym Python (4507 sym/11 pcs) 3 img 4 tbl

Data 605 Exercise 11

31.03.2024

# libraries library(tidyverse) library(moderndive) library(kableExtra) library(mice) library(caret) # ggplot theme_set(theme_light()) The Saratoga Houses dataset from the ModernDive package contains 1057 observations of 8 variables from home sales data: price (num) living_area (num) bathrooms (num) bedrooms (num) fireplaces (num) lot_size (num) a...

2024 sym R (2499 sym/13 pcs) 4 img 4 tbl

Data 605 - Homework 11

30.03.2024

The cars dataset in R has 50 observations of two variables from a 1920 study: speed and stopping distance (dist). We’ll build a simple linear regression model for stopping distance as a function of speed, and analyze the results. Data Setup and EDA df_cars = cars head(df_cars) ## speed dist ## 1 4 2 ## 2 4 10 ## 3 7 4 ## 4 ...

3452 sym 4 img

Data 608 - HW 6

12.03.2024

Problem 1 A bag contains 5 green and 7 red jellybeans. How many ways can 5 jellybeans be withdrawn from the bag so that the number of green ones withdrawn will be less than 2? This can be calculated as the sum of two combinations; a draw with zero greens, and a draw with one green. Choose 5 of 7 reds: \(C(7,5) = \frac{7!}{5!(7-5)!}\) = 21 Choose 1 ...

6253 sym

Data 608 - HW7

11.03.2024

Problem 1 Let \(X_1, X_2, . . . , Xn\) be n mutually independent random variables, each of which is uniformly distributed on the integers from 1 to k. Let Y denote the minimum of the \(X_i\)’s. Find the distribution of Y. Each variable \(X_i\) in a (discrete) uniform distribution will have the same probability \(P(x) = \frac{1}{k}\) for \(x = (1,...

4135 sym

Data 608 - Story 3

06.03.2024

Introduction The CDC publishes firearm mortality for each State per 100,000 persons. Each State’ firearm control laws can be categorized as very strict to very lax. The purpose of this Story is to answer the question, “Do stricter firearm control laws help reduce firearm mortality?” # libraries library('tidyverse') library('readxl') library('...

3217 sym R (2443 sym/6 pcs) 1 img

DATA 605 - Homework 5

28.02.2024

Problem 1 (Bayesian). A new test for multinucleoside-resistant (MNR) human immunodeficiency virus type 1 (HIV-1) variants was recently developed. The test maintains 96% sensitivity, meaning that, for those with the disease, it will correctly report “positive” for 96% of them. The test is also 98% specific, meaning that, for those without the di...

6849 sym

Story 1 - Infrastructure

07.02.2024

data <- read_csv('https://raw.githubusercontent.com/jefedigital/cuny-data-608-visual-analytics/main/assignments/week-01/story-1-infrastructure/story-1-data.csv') Introduction The provided contained data on the present allocation of the Infrastructure Investment and Jobs Act funding by State and Territory. This story explores the following question...

3157 sym 4 img

Linear Algebra - Determinants

05.02.2024

Text: A First Course in Linear Algebra (Beezer) Chapter: Determinants, Exercise C24 (Page 278) Q: Doing the computations by hand, find the determinant of the matrix below. \[\begin{bmatrix} -2 & 3 & -2 \\ -4 & -2 & 1 \\ 2 & 4 & 2 \\ \end{bmatrix}\] A: The formula for the determinant of a 3x3 matrix below is \(det=a(ei−fh)−b(di−fg)+c(dh−eg)\...

557 sym