Publications by Joshua Zhong
Williamsburg Bridge with quasi poisson
1 Introduction This data set was gathered by the New York City’s Traffic Information Management System (TIMS), which monitors and records cyclists 24 hours a day. Each entry is an observation of the total bicyclists on that day. This data set is a subset of a larger data set that captures monthly records of bike counts across New York City’...
31717 sym 1 img 6 tbl
Poisson modeling Williamsburg Bridge bike counts + rates
1 Data set Description This data set was gathered by the New York City’s Traffic Information Management System (TIMS), which monitors and records cyclists 24 hours a day. Each entry is an observation of the total bicyclists on that day. This data set is a subset of a larger data set that captures monthly records of bike counts across New Yor...
23351 sym 1 img 3 tbl
Predictive Model for Breast Cancer
1 Data set Description The data set was synthetic data set created for practice purposes in the Book Applied Analytics through Case Studies Using SAS and R by Deepti Gupta and APress. It has 600 observations with 11 total variables - 10 numerical variables and 1 categorical variable. Only 10 of these variables are predictors, however, because ...
21468 sym 6 img 6 tbl
Breast Cancer Case Study Echo
1 Data set Description The data set was synthetic data set created for practice purposes in the Book Applied Analytics through Case Studies Using SAS and R by Deepti Gupta and APress. It has 600 observations with 11 total variables - 10 numerical variables and 1 categorical variable. Only 10 of these variables are predictors, however, because ...
19317 sym 4 img 6 tbl
Case Study: Factors affecting California Home Prices
Abstract This statistical report delves into the intricate dynamics of California house prices, utilizing a modified version of the California Housing data set derived from the 1990 census data. Employing methodologies such as Box-Cox transformations and bootstrapping with cases and residuals, this analysis aims to uncover nuanced patterns and ...
28963 sym 19 img 7 tbl
Multiple Linear Regression - California Houses
1 Data set Description The data set was found on Kaggle. It includes information gathered from various block groups in California during the 1990 Census. The U.S. Census Bureau uses the block group as the smallest geographical unit, typically consisting of 600 to 3000 people. Each observation in the data set represents one block group and compr...
12813 sym 8 img 4 tbl
California Housing Prices - Bootstrap SLR
Introduction The data set used in the project includes information gathered from various block groups in California during the 1990 Census. Each block group comprises an average of 1425.5 individuals in a geographically compact area. The data set comprises of 20640 observations of 9 dependent variables and one independent variable, median house...
6882 sym 3 img
Protein Dietary Trends Amidst Covid-19
Introduction The COVID-19 pandemic created many shifts - from lifestyle changes to global health concerns. A key factor studied during this time is the dietary habits during the pandemic. A data set compiled from data collected by the United Nations Food and Agriculture Organization (FAO), country population counts from the Population Reference...
7615 sym 1 img