Publications by MSDS 6372: Jacob Turner: Student: Jessica McPhaul link:
Final_Model
# Load libraries library(aplore3) library(caret) ## Loading required package: ggplot2 ## Loading required package: lattice library(pROC) ## Type 'citation("pROC")' for a citation. ## ## Attaching package: 'pROC' ## The following objects are masked from 'package:stats': ## ## cov, smooth, var library(randomForest) ## randomForest ...
802 sym R (92211 sym/93 pcs) 2 img
Project Part 2
Libraries ## PART 2 library(xgboost) library(shapper) ## Warning: package 'shapper' was built under R version 4.3.3 library(ROCR) ## Warning: package 'ROCR' was built under R version 4.3.3 library(ROSE) ## Warning: package 'ROSE' was built under R version 4.3.3 library(DMwR2) ## Warning: package 'DMwR2' was built under R version 4.3.3 library(smo...
458 sym R (193440 sym/252 pcs) 11 img
Applied_Statistics_6372_Jessica_McPhail_Unit_13_HW
Exercise 1: Conceptual Questions True or False? Principle component analysis is a predictive modeling technique such as linear regression, LDA, or knn. True or False? Technically speaking, PCA should not be applied to categorical variables. An analyst conducts PCA on continous variables 1 through 20 and settled on reducing the data down to 4 PC...
19467 sym R (15807 sym/36 pcs) 7 img
Stats2Project2_Objective1_FINAL NEEDS REVIEW
For a detailed exploration and insights from an initial analysis of the Glow_Bonemed dataset, refer to the detailed report on RPubs. Preliminaries Load necessary libraries library(ggplot2) library(dplyr) library(caret) library(pROC) library(car) library(effects) ## Warning: package 'effects' was built under R version 4.3.3 library(lmtest) ...
14615 sym R (34563 sym/185 pcs) 30 img
Jessica McPhaul Unit 12 HW
Exercise 1: Conceptual Questions Name at least two advantages that logistic regression has over LDA. Does logistic regression predict the binary outcome or does it predict something else? Explain. True or false? Multicollinearity is not typically an issue with LDA. However it is still an issue for logistic regression. Briefly explain the separa...
16340 sym R (14054 sym/48 pcs) 6 img
Jessica McPhaul 6372 Unit 13 Pre Live Session
Discussion 1: PCA Basics Review For this discussion we will be utilizing a data set that was a study investigating potential civil rights violations among insurance companies operating in Chicago, Illinois. A description of the data set can be found as well as a description for each variable. Please take the time and read through the descriptio...
16714 sym R (9880 sym/48 pcs) 5 img
Jessica McPhaul - 6372 - Unit-11-Homework
Exercise 1: Conceptual Questions What is the main issue with fitting an MLR on a binary response coded as 0 or 1? What must we do to a logistic regression coefficient to interpret it as an odds ratio? If we wanted to interpret an odds ratio for a 20-unit increase (rather than a one-unit increase) of a continuous predictor, what is the general ...
16161 sym R (8492 sym/44 pcs) 7 img
Jessica McPhaul - Unit 12 Pre Live Session
Discussion 1: Adjustment of a 2x2 table Dr. Turner mentioned in the videos that, like MLR, logistic regression allows for estimating effects while taking into account additional variables. These additional variables could be confounding the effect you wish to estimate most. Consider the following example where two colleges had a science compet...
14128 sym R (11757 sym/67 pcs) 10 img
Jessica McPhaul_Unit_10_HW
Jessica McPhaul - 6372 _Unit_10_Homework Exercise 1: Conceptual Questions State under what circumstance a difference in proportion confidence interval should not be used in favor of an odds ratio metric. Under what sampling schemes can a hypothesis test be generally worded as a “test for association”, rather than a test for difference in ...
6534 sym
Jessica_McPhaul_Unit_11_PLS
Jessica McPhaul - 6372 _Unit_11_PreLive Discussion 1: Continous Predictor Case As discussed in the videos, visualizing trends when the predictor is continuous and the response is binary takes a little care. Loess curves are one way to examine it but the results will vary depending on the data set and sample size. We will illustrate this using ...
12622 sym R (3686 sym/35 pcs) 6 img