Publications by Shidong Li

ANLY505-2020-Late Fall-Assignment 11 - God Spiked the Integers

07.02.2021

Chapter 11 - God Spiked the Integers This chapter described some of the most common generalized linear models, those used to model counts. It is important to never convert counts to proportions before analysis, because doing so destroys information about sample size. A fundamental difficulty with these models is that parameters are on a different...

3157 sym R (4101 sym/21 pcs) 1 img

ANLY505-2020-Late Fall-Assignment 12 - Monsters and Mixtures

15.02.2021

Chapter 12 - Monsters and Mixtures This chapter introduced several new types of regression, all of which are generalizations of generalized linear models (GLMs). Ordered logistic models are useful for categorical outcomes with a strict ordering. They are built by attaching a cumulative link function to a categorical outcome distribution. Zero-inf...

3423 sym R (14608 sym/35 pcs) 3 img

525 Term Project - Logistic Regression, Propensity Score

16.02.2021

##Step 1. EDA I replaced (manually with ctrl+h in excel) the long strings with shorter categorical descriptions. sum(is.na(data)) ## [1] 0 str(data) ## tibble [113 x 33] (S3: tbl_df/tbl/data.frame) ## $ Id : num [1:113] 1 2 3 4 5 6 7 8 9 10 ... ## $ gender : chr [1:113] "1" "1" "0" "1" ... ## $ age ...

639 sym R (18753 sym/43 pcs) 2 img

525 Term Project - EDA

17.02.2021

#knitr::opts_chunk$set(echo = TRUE) library(dplyr) ## ## Attaching package: 'dplyr' ## The following objects are masked from 'package:stats': ## ## filter, lag ## The following objects are masked from 'package:base': ## ## intersect, setdiff, setequal, union library(tidyr) library(ggplot2) library(moments) library(tidyverse) ...

661 sym R (21670 sym/41 pcs) 3 img