Publications by jfor: Josh Forster
Data621: HW1 Submitted
Setup: First, let’s read in the provided dataset. Data Exploration: ## The dataset consists of 2276 observations of 17 variables. The variables and their definitions can be seen below: Variable Definition INDEX Identification variable TARGET_WINS Number of wins TEAM_BATTING_H Base hits by batters (1B, 2B, 3B, HR) TEAM_BATTING_2B Doubles by ...
16666 sym R (29528 sym/29 pcs) 19 img 3 tbl
Data621: HW1
Setup: First, let’s read in the provided dataset. Data Exploration: ## The dataset consists of 2276 observations of 17 variables. The variables and their definitions can be seen below: Variable Definition INDEX Identification variable TARGET_WINS Number of wins TEAM_BATTING_H Base hits by batters (1B, 2B, 3B, HR) TEAM_BATTING_2B Doubles by ...
16606 sym R (29528 sym/29 pcs) 19 img 3 tbl
Data605: Final Exam
Final Exam Probability Density 1: X~Gamma. Using R, generate a random variable X that has 10,000 random Gamma pdf values. A Gamma pdf is completely describe by n (a size parameter) and lambda (λ , a shape parameter). Choose any n greater 3 and an expected value (λ) between 2 and 10 (you choose). *Note: Assuming in the above prompt that the n para...
9488 sym 15 img
Data 605: Assignment 14
1 This week, we’ll work out some Taylor Series expansions of popular functions. \(f(x) = \frac{1}{(1−x)}\) Given this is the approximation of the geometric series, we know that the first few terms of it are \(1+x^2+x^3+x^4..x^n\) \(\\\) \(\Sigma_{n=0}^{\infty} x^n\) Reviewing the derivatives to demonstrate the pattern \(f(x)=\frac{1}{1-x}\) \(f...
1408 sym
Data605: HW13
Use integration by substitution to solve the integral below. \(\int 4e^{-7x}dx \\\) \(\\u = -7x; du = -7\\\) \(-\frac{1}{7}du = dx\\\) \(-\frac{4}{7}\int e^u du = -\frac{4}{7} e^u + C = -\frac{4}{7} e^{-7x}+ C\) Biologists are treating a pond contaminated with bacteria. The level of contamination is changing at a rate of $ =-3150t^{-4} -220 $ bact...
4141 sym
Data605: HW12
Provided dataset from 2008 with the following variables included: Country: name of the country LifeExp: average life expectancy for the country in years InfantSurvival: proportion of those surviving to one year or more Under5Survival: proportion of those surviving to five years or more TBFree: proportion of the population without TB. PropMD: propor...
6732 sym 5 img
Data605: Discussion 12
Using R, build a multiple regression model for data that interests you. Include in this model at least one quadratic term, one dichotomous term, and one dichotomous vs. quantitative interaction term. Interpret all coefficients. Conduct residual analysis. Was the linear model appropriate? Why or why not? Load dataset df <- read.csv('https://raw.gi...
4077 sym 4 img
Data605: Discussion 11
Using R, build a regression model for data that interests you. Conduct residual analysis. Was the linear model appropriate? Why or why not? The dataset I chose to review for this discussion post was used in a visualization assignment for Data602 from NYC OpenData on the Department of Sanitation, but I’m curious to see if the paper recyling collec...
3015 sym 3 img
Data605: HW11
Using the “cars” dataset in R, build a linear model for stopping distance as a function of speed and replicate the analysis of your textbook chapter 3 (visualization, quality evaluation of the model, and residual analysis.) Preliminary Review of the Numeric Variables summary(cars) ## speed dist ## Min. : 4.0 Min. :...
3750 sym 2 img
Data605: HW10
Smith is in jail and has 1 dollar; he can get out on bail if he has 8 dollars. A guard agrees to make a series of bets with him. If Smith bets A dollars, he wins A dollars with probability .4 and loses A dollars with probability .6. mat <- matrix(c(0.4,0.6)) zero <- matrix(0,9,9) print(zero) ## [,1] [,2] [,3] [,4] [,5] [,6] [,7] [,8] [,9] ##...
1683 sym