Publications by Joshua Zhong

Williamsburg Bridge with quasi poisson

14.01.2024

1 Introduction This data set was gathered by the New York City’s Traffic Information Management System (TIMS), which monitors and records cyclists 24 hours a day. Each entry is an observation of the total bicyclists on that day. This data set is a subset of a larger data set that captures monthly records of bike counts across New York City’...

31717 sym 1 img 6 tbl

Poisson modeling Williamsburg Bridge bike counts + rates

12.01.2024

1 Data set Description This data set was gathered by the New York City’s Traffic Information Management System (TIMS), which monitors and records cyclists 24 hours a day. Each entry is an observation of the total bicyclists on that day. This data set is a subset of a larger data set that captures monthly records of bike counts across New Yor...

23351 sym 1 img 3 tbl

Predictive Model for Breast Cancer

09.01.2024

1 Data set Description The data set was synthetic data set created for practice purposes in the Book Applied Analytics through Case Studies Using SAS and R by Deepti Gupta and APress. It has 600 observations with 11 total variables - 10 numerical variables and 1 categorical variable. Only 10 of these variables are predictors, however, because ...

21468 sym 6 img 6 tbl

Breast Cancer Case Study Echo

05.01.2024

1 Data set Description The data set was synthetic data set created for practice purposes in the Book Applied Analytics through Case Studies Using SAS and R by Deepti Gupta and APress. It has 600 observations with 11 total variables - 10 numerical variables and 1 categorical variable. Only 10 of these variables are predictors, however, because ...

19317 sym 4 img 6 tbl

Case Study: Factors affecting California Home Prices

02.01.2024

Abstract This statistical report delves into the intricate dynamics of California house prices, utilizing a modified version of the California Housing data set derived from the 1990 census data. Employing methodologies such as Box-Cox transformations and bootstrapping with cases and residuals, this analysis aims to uncover nuanced patterns and ...

28963 sym 19 img 7 tbl

Multiple Linear Regression - California Houses

30.12.2023

1 Data set Description The data set was found on Kaggle. It includes information gathered from various block groups in California during the 1990 Census. The U.S. Census Bureau uses the block group as the smallest geographical unit, typically consisting of 600 to 3000 people. Each observation in the data set represents one block group and compr...

12813 sym 8 img 4 tbl

California Housing Prices - Bootstrap SLR

23.12.2023

Introduction The data set used in the project includes information gathered from various block groups in California during the 1990 Census. Each block group comprises an average of 1425.5 individuals in a geographically compact area. The data set comprises of 20640 observations of 9 dependent variables and one independent variable, median house...

6882 sym 3 img

Protein Dietary Trends Amidst Covid-19

20.12.2023

Introduction The COVID-19 pandemic created many shifts - from lifestyle changes to global health concerns. A key factor studied during this time is the dietary habits during the pandemic. A data set compiled from data collected by the United Nations Food and Agriculture Organization (FAO), country population counts from the Population Reference...

7615 sym 1 img