Publications by Jack Wright
Simple Logistic Regression
Simple Logistic Regression \(m=n(observation~per~row)\) This means that each row in your data frame has more than one observation. Additional requirements for simple logistic regression m!=1 if m==1 USE simple_binary_logistic_regression.Rmd MichelinFood<-read.table(here('data','MichelinFood.txt'),header = TRUE) Note: each row is has a count of m ...
2349 sym R (2662 sym/17 pcs) 2 img
WLS regression
Requirements for weighted least squares regression response= mean across level of predictor sd(response_i|predictor_i)=TRUE OR heteroscedasticity_test(simple_lin_reg)=FAIL response= mean across level of predictor sd(response_i|predictor_i)=TRUE cleaningwtd data clean<-read.table(here('data','cleaningwtd.txt'),header=TRUE) head(clean) ## Case ...
770 sym R (2712 sym/17 pcs) 2 img
multivariate regression process
Requirements for multivariate regression library(tidyverse) library(here) library(car) library(ggResidpanel) nyc<-read.csv(here('data','nyc.csv')) Response Requirements 1. response!=binary & response!=count IF response==binary use multivariate_logistic_regression.Rmd IF response==count use multivariate_count_regression.Rmd High correlation be...
2950 sym R (4106 sym/32 pcs) 4 img
Understanding Deviance and the saturated model
library(tidyverse) ## -- Attaching packages --------------------------------------- tidyverse 1.3.1 -- ## v ggplot2 3.3.5 v purrr 0.3.4 ## v tibble 3.1.4 v dplyr 1.0.7 ## v tidyr 1.1.3 v stringr 1.4.0 ## v readr 2.0.1 v forcats 0.5.1 ## -- Conflicts ------------------------------------------ tidyverse_conflicts() -- #...
5716 sym R (5179 sym/42 pcs) 4 img
Bill collector problem
Bill Collector Problem The problem is as stated: We are a bill collector collecting small amounts whos business model is based on collecting delinquent accounts quickly. The marketing department has come up with a slogan “under 60 days or your money back.” We will investigate under which conditions this promotion will be profitable. DATA: ra...
4106 sym R (2984 sym/20 pcs) 1 img
Predictive Project
Predictive Analytics Project 1 Jack Wright Part A Forecasting cash taken out of ATM Machines, May 2010 First lets load the data using the read.xlsx function from the readxl package and take a look at the summary. ## DATE ATM Cash ## Min. :2009-05-01 ATM1:365 Min. : 0.0 ## 1st Qu.:2009-08-01 ...
11746 sym R (7631 sym/63 pcs) 35 img 2 tbl
Binomial Regression
Binomial Regression Model For when response variable is binomially distributed (probabilities of k successes on n trials.) for binomial response we need 2 sets of info about the response y: successes n: number of trials OR n-y: number of failures Example: Challenger Disaster This is a dataset examining the failure of the O-rings (important com...
1374 sym R (2724 sym/18 pcs) 1 img
Simple Linear Categorical Predictor
Categorical Variables also called factor or qualitative variables limited number of values, or levels library(car) ## Loading required package: carData Salaries<-Salaries categorical variable with n levels will be transformed into n-1 variables each with 2 levels. levels(Salaries$rank) ## [1] "AsstProf" "AssocProf" "Prof" This 3 level variable w...
929 sym R (1413 sym/12 pcs) 1 img
Simple Linear Regression
Requirements for simple linear regression Response Requirements: n(response==1) response!=binary & response!=count IF response==binary - use simple_logistic_regression.Rmd IF response==count - use simple_count_regression.Rmd IF response is an average value at a level of predictor, and the standard devaition can be calculated, use wls_given_var...
1301 sym R (2951 sym/20 pcs) 3 img
ARIMA homework
ARIMA Homework Jack Wright 9.1, 9.2, 9.3, 9.5, 9.6, 9.7, 9.8 9.1 Figure 9.32 shows the ACFs for 36 random numbers, 360 random numbers and 1,000 random numbers. a. Explain the differences among these figures. Do they all indicate that the data are white noise? ## -- Attaching packages -------------------------------------------- fpp3 0.4.0 -- ## v...
10024 sym R (14060 sym/102 pcs) 28 img