Publications by Dennis Pong

Data 621 Discussion #6

28.02.2022

Code Show All Code Hide All Code Ex 6.3 Ex 6.3 Q. For the prostate data, fit a model with lpsa as the response and the other variables as predictors. Answer the questions posed in the first question. 2022-02-28 require(faraway) ## Loading required package: faraway Q. For the prostate data, fit a model with lpsa as the response and the ...

2914 sym R (8332 sym/51 pcs) 12 img

Data 621 Spring 2022 - HW1

16.02.2022

DATA EXPLORATION ## TARGET_WINS TEAM_BATTING_H TEAM_BATTING_2B TEAM_BATTING_3B ## Min. : 0.00 Min. : 891 Min. : 69.0 Min. : 0.00 ## 1st Qu.: 71.00 1st Qu.:1383 1st Qu.:208.0 1st Qu.: 34.00 ## Median : 82.00 Median :1454 Median :238.0 Median : 47.00 ## Mean : 80.79 Mean :1469 Mean :241.2 Mea...

11201 sym R (16676 sym/18 pcs) 8 img

Discussion #4

15.02.2022

The aatemp data come from the U.S. Historical Climatology Network. They are the annual mean temperatures (in degrees F) in Ann Arbor, Michigan going back about 150 years. a. Is there a linear trend? attach(aatemp) The following objects are masked from aatemp (pos = 3): temp, year lmod<-lm(temp~year) summary(lmod) Call: lm(formula = temp ~...

12727 sym R (8337 sym/39 pcs) 18 img

Discussion #6

25.02.2022

Question Using the faithful data, fit a regression of duration on waiting. Assuming that there was a measurement error in waiting of 30 seconds, use the SIMEX method to obtain a better estimate of the slope. Solution Declarations Use the print_formula custom function to print the final equation for the model input print_formula <- function(mode...

953 sym R (1104 sym/11 pcs)

Data 621 Discussion #15

07.05.2022

Data Descriptions This data frame contains the following columns: sr savings rate - personal saving divided by disposable income pop15 percent population under age of 15 pop75 percent population over age of 75 dpi per-capita disposable income in dollars ddpi percent growth rate of dpi The sm package: This package implements nonparametric sm...

1267 sym R (1068 sym/11 pcs) 4 img

Data 621 -- Discussion #12

01.05.2022

Question (from Extending the Linear Model with R [ELMR], p.124) The pneumo data gives the number of coal miners classified by radiological examination into one of three categories of pneumonoconiosis and by the number of years spent working at the coal face divided into eight categories. Loading of libraries library(nnet) # for multinom: Fit Mul...

2374 sym R (4818 sym/34 pcs) 3 img

Data 621 - HW5 (final ver.)

03.05.2022

Code Show All Code Hide All Code Data 621 - HW5 Data 621 - HW5 Loading of Libraries DATA EXPLORATION Data Summary Distribution plots Multi-collinearity DATA PREPARATION Boxplots Data Imputation Data Transformation BUILD MODELS Poisson models Negative Binomial models Multiple Linear Regression Models Model Coefficients Comparison Hurdle ...

8898 sym R (41198 sym/46 pcs) 16 img 1 tbl

Data 621 Final Project v4.2

19.05.2022

Code Show All Code Hide All Code DATA 621 Final Project DATA 621 Final Project 1. Data Source 2. Data Clean Up and Aggregation 3. Feature Engineering 4. Exploratory Data Analysis 4.1 Feature Correlations - Collinearity Check for Numeric Variables 4.2 Relationship with the target variable - churn 5. Modeling and Performance Evaluation 6. Co...

18832 sym R (19501 sym/45 pcs) 9 img

Data 621 HW 5

17.04.2022

Code Show All Code Hide All Code Data 621 - HW5 Data 621 - HW5 Loading of Libraries Loading of files Data Exploration Summary Histograms Boxplots Scatter Plots Missing values & Imputations Data Preparations Feature-Target Correlations Multi-collinearity Model Building Model Selection 2022-04-17 Loading of Libraries Data Dictionary Loadi...

4206 sym R (53608 sym/66 pcs) 7 img 1 tbl