Publications by Shidong Li
ANLY505-2020-Late Fall-Assignment 11 - God Spiked the Integers
Chapter 11 - God Spiked the Integers This chapter described some of the most common generalized linear models, those used to model counts. It is important to never convert counts to proportions before analysis, because doing so destroys information about sample size. A fundamental difficulty with these models is that parameters are on a different...
3157 sym R (4101 sym/21 pcs) 1 img
ANLY505-2020-Late Fall-Assignment 12 - Monsters and Mixtures
Chapter 12 - Monsters and Mixtures This chapter introduced several new types of regression, all of which are generalizations of generalized linear models (GLMs). Ordered logistic models are useful for categorical outcomes with a strict ordering. They are built by attaching a cumulative link function to a categorical outcome distribution. Zero-inf...
3423 sym R (14608 sym/35 pcs) 3 img
525 Term Project - Logistic Regression, Propensity Score
##Step 1. EDA I replaced (manually with ctrl+h in excel) the long strings with shorter categorical descriptions. sum(is.na(data)) ## [1] 0 str(data) ## tibble [113 x 33] (S3: tbl_df/tbl/data.frame) ## $ Id : num [1:113] 1 2 3 4 5 6 7 8 9 10 ... ## $ gender : chr [1:113] "1" "1" "0" "1" ... ## $ age ...
639 sym R (18753 sym/43 pcs) 2 img
525 Term Project - EDA
#knitr::opts_chunk$set(echo = TRUE) library(dplyr) ## ## Attaching package: 'dplyr' ## The following objects are masked from 'package:stats': ## ## filter, lag ## The following objects are masked from 'package:base': ## ## intersect, setdiff, setequal, union library(tidyr) library(ggplot2) library(moments) library(tidyverse) ...
661 sym R (21670 sym/41 pcs) 3 img