Publications by Jeff Parks
Data 605 -
The attached who.csv dataset contains real-world data from 2008. The variables included follow. Country: name of the country LifeExp: average life expectancy for the country in years InfantSurvival: proportion of those surviving to one year or more Under5Survival: proportion of those surviving to five years or more TBFree: proportion of the popula...
5965 sym 9 img
Data 605 - Discussion 12
Using R, build a multiple regression model for data that interests you. Include in this model at least one quadratic term, one dichotomous term, and one dichotomous vs. quantitative interaction term. Interpret all coefficients. Conduct residual analysis. Was the linear model appropriate? Why or why not? The King County House Sales dataset f...
3929 sym Python (4507 sym/11 pcs) 3 img 4 tbl
Data 605 Exercise 11
# libraries library(tidyverse) library(moderndive) library(kableExtra) library(mice) library(caret) # ggplot theme_set(theme_light()) The Saratoga Houses dataset from the ModernDive package contains 1057 observations of 8 variables from home sales data: price (num) living_area (num) bathrooms (num) bedrooms (num) fireplaces (num) lot_size (num) a...
2024 sym R (2499 sym/13 pcs) 4 img 4 tbl
Data 605 - Homework 11
The cars dataset in R has 50 observations of two variables from a 1920 study: speed and stopping distance (dist). We’ll build a simple linear regression model for stopping distance as a function of speed, and analyze the results. Data Setup and EDA df_cars = cars head(df_cars) ## speed dist ## 1 4 2 ## 2 4 10 ## 3 7 4 ## 4 ...
3452 sym 4 img
Data 608 - HW 6
Problem 1 A bag contains 5 green and 7 red jellybeans. How many ways can 5 jellybeans be withdrawn from the bag so that the number of green ones withdrawn will be less than 2? This can be calculated as the sum of two combinations; a draw with zero greens, and a draw with one green. Choose 5 of 7 reds: \(C(7,5) = \frac{7!}{5!(7-5)!}\) = 21 Choose 1 ...
6253 sym
Data 608 - HW7
Problem 1 Let \(X_1, X_2, . . . , Xn\) be n mutually independent random variables, each of which is uniformly distributed on the integers from 1 to k. Let Y denote the minimum of the \(X_i\)’s. Find the distribution of Y. Each variable \(X_i\) in a (discrete) uniform distribution will have the same probability \(P(x) = \frac{1}{k}\) for \(x = (1,...
4135 sym
Data 608 - Story 3
Introduction The CDC publishes firearm mortality for each State per 100,000 persons. Each State’ firearm control laws can be categorized as very strict to very lax. The purpose of this Story is to answer the question, “Do stricter firearm control laws help reduce firearm mortality?” # libraries library('tidyverse') library('readxl') library('...
3217 sym R (2443 sym/6 pcs) 1 img
DATA 605 - Homework 5
Problem 1 (Bayesian). A new test for multinucleoside-resistant (MNR) human immunodeficiency virus type 1 (HIV-1) variants was recently developed. The test maintains 96% sensitivity, meaning that, for those with the disease, it will correctly report “positive” for 96% of them. The test is also 98% specific, meaning that, for those without the di...
6849 sym
Story 1 - Infrastructure
data <- read_csv('https://raw.githubusercontent.com/jefedigital/cuny-data-608-visual-analytics/main/assignments/week-01/story-1-infrastructure/story-1-data.csv') Introduction The provided contained data on the present allocation of the Infrastructure Investment and Jobs Act funding by State and Territory. This story explores the following question...
3157 sym 4 img
Linear Algebra - Determinants
Text: A First Course in Linear Algebra (Beezer) Chapter: Determinants, Exercise C24 (Page 278) Q: Doing the computations by hand, find the determinant of the matrix below. \[\begin{bmatrix} -2 & 3 & -2 \\ -4 & -2 & 1 \\ 2 & 4 & 2 \\ \end{bmatrix}\] A: The formula for the determinant of a 3x3 matrix below is \(det=a(ei−fh)−b(di−fg)+c(dh−eg)\...
557 sym