Publications by Enwu Liu

Conduct natural spline regression 'by hand'

17.02.2023

The use of Natural (restricted) spline regression model has been very popular to model non-linear effects of continuous covariates. Statistical software such as R, SAS, STATA and SPSS, et al all can be used to perform the natural spline regression. However, the output results by these software sometimes are quite ‘confusing’,therefore, if ...

6953 sym R (6450 sym/14 pcs) 5 img

Notes for limiting distribution

08.02.2023

For i.i.d random variables, we can always say \(X_n\overset{d}{\rightarrow}X\) since \[\begin{align}%\label{eq:union-bound} F_{X_n}(x)=F_X(x), \qquad \textrm{ for all }x. \end{align}\] \[\therefore\] \[\begin{align}%\label{eq:union-bound} \lim_{n \rightarrow \infty} F_{X_n}(x)=F_X(x), \qquad \textrm{ for all }x. \end{align}\] For i.i.d random ...

719 sym

Converge in distribution

01.02.2023

For i.i.d random variables, we can always say \(X_n\overset{d}{\rightarrow}X\) since \[\begin{align}%\label{eq:union-bound} F_{X_n}(x)=F_X(x), \qquad \textrm{ for all }x. \end{align}\] \[\therefore\] \[\begin{align}%\label{eq:union-bound} \lim_{n \rightarrow \infty} F_{X_n}(x)=F_X(x), \qquad \textrm{ for all }x. \end{align}\] ...

336 sym

Power analysis

13.01.2023

The hypothesis tests, sample size calculations(power analyses) and diagnosis tests are all have the same mathematical base but with many different terms in Statistics and Epidemiology. Here I will review their calculations and their relationships. First, we summarize these terms into a table and a figure.(To Be Continued…) ...

333 sym 1 img

Random effect meta-analysis

29.11.2022

The calculations for random effect model meta-analysis are quite straight forward, the followings are steps we need to conduct a random effect model meta analysis using inverse variance weight method. Random effect meta analysis calculations: Suppose we got the coefficient(\(\beta_i\)) for a predictor and its variance (\(v_i\)), \(i=1,2,...n\)...

2134 sym

Variance of mixture normal distributions

30.11.2022

There are several statistical methods to deal competing risk in survival analysis. Mixture model also can be used when there was competing risk in survival analysis. The following paper introduced how to use mixture model to deal with competing risk in survival analysis. Larson, M. G., & Dinse, G. E. (1985). A mixture model for the regressio...

2392 sym

Notes for Student t distribution

13.12.2022

Let \(W\) denote a random variable with standard normal distribution that is \(N(0, 1)\); let \(V\) denote a random variable that is \(\chi^2(r)\); and let \(W\) and \(V\) be independent. Then the joint pdf of \(W\) and \(V\) ,say \(h(w, v)\), is the product of the pdf of \(W\) and that of \(V\) or \[ h(w,v)=\begin{cases} \frac{1}{\sqr...

2993 sym

Notes for student's theorem

20.12.2022

In proving Student’s Theorem, a linear transformation by matrix can be used,i.e \[W=\begin{bmatrix} \mathbf{\bar{X}}\\ \mathbf{Y} \end{bmatrix}=\begin{bmatrix} \mathbf{v'}\\ \mathbf{I-1v'} \end{bmatrix} \] where, \(\mathbf{v'}=(\frac{1}{n},\frac{1}{n},...,\frac{1}{n})=\frac{1}{n}\mathbf{1'}\) The covariance matrix of the multivariate no...

1571 sym

General format of mixture distribution

21.12.2022

The mixture distribution in general is defined as following: Suppose that we have \(k\) distributions with respective pdfs \(f_1(x), f_2(x), . . . , f_k(x)\),with supports \(\mathcal{S_1, S_2, . . . , S_k}\), means \(\mu_1, \mu_2, . . . , \mu_k\), and variances \(\sigma_1^2, \sigma_2^2,...,\sigma_k^2\), with positive mixing probabilities \(...

4859 sym

loggamma distribution

22.12.2022

There are several formats of loggamma distributions, such as in this PhD thesis https://macsphere.mcmaster.ca/bitstream/11375/6816/1/fulltext.pdf or here for online discussions https://stats.stackexchange.com/questions/370880/what-is-the-expected-value-of-the-logarithm-of-gamma-distribution. Here, we derive mean and variance of another format o...

2512 sym