Publications by Shiju Zhang

Document

08.05.2024

Introduction Data visualization is a powerful and essential tool in the realm of data analysis and communication. Its importance stems from its ability to make complex data more accessible, understandable, and insightful. Here are several key reasons why data visualization is crucial: Data Understanding: Visualizations provide a clear and intuitive...

41068 sym R (18977 sym/51 pcs) 26 img

Document

25.03.2024

Chapter 1. Introduction and Overview When tossing a coin twice, we may get a sum of 2, 3, 4, …, or 12 points, so the result is uncertain. What is the most likely sum? Uncertainty refers to the situation where results or outcomes are not completely determined and depend on a number of factors and pure chance. It exists every moment in everyday lif...

105071 sym 30 img 12 tbl

Publish Document

15.02.2024

Machine Learning Topic 1: Dimension Reduction Introduction Data may encompass numerous variables. The inclusion of highly correlated or irrelevant variables with the outcome variable can result in overfitting, compromising the reliability of predictions. During model deployment, unnecessary variables may escalate costs associated with data collect...

54798 sym R (19794 sym/47 pcs) 1 tbl

Document

06.02.2024

Introduction This is an R Markdown document. Markdown is a simple formatting syntax for authoring HTML, PDF, and MS Word documents. For more details on using R Markdown see http://rmarkdown.rstudio.com. When you click the Knit button a document will be generated that includes both content as well as the output of any embedded R code chunks within t...

420 sym

Document

30.01.2024

Introduction This note provides key examples to the book https://bookdown.org/speegled/foundations-of-statistics/RData.html#reading-data-from-files 1 Data in R Section 1.1 Arithmetic and Variable Assignment x <- 4 y <- "Hello World!" z <- TRUE Section 1.2 Vectors p <- c(5, 8, 12, 55) q <- c("cat", "dog", "snake") r <- c(FALSE, TRUE, TRUE, FA...

1954 sym

Publish Document

22.01.2024

Review of Probability and Distributions The properties of mean, variance, and covariance The additivity property of some distributions The moment-generating function as an analytic tool Sampling Distributions and the Central Limit Theorem (Chapter 7) The sampling distribution of the mean of a sample from a finite population The sampling distribu...

3444 sym

Publish Document

13.01.2024

Contact Information My E-Mail: szhang@stcloudstate.edu Textbook/references I will be using the following resources: https://bookdown.org/speegled/foundations-of-statistics/RData.html#data-types (Chapters 1, 2.1, 2.4.1, 3-7, 9, 10) https://sjzhang.shinyapps.io/Statistics/ https://afrozhussain.files.wordpress.com/2015/07/data-science-in-r.pdf (only...

3420 sym R (15905 sym/45 pcs) 2 img

Publish Document

13.01.2024

R reference https://bookdown.org/speegled/foundations-of-statistics/data-manipulation.html#data-manipulation (Chapters 1,5,6,7, and 10) Book Page https://www.stat2.org/ Data manual: https://www.stat2.org/manuals/Stat2DataManual.pdf Book R Code https://www.stat2.org/manuals/Stat2RCompanion.pdf ChatGPT Will Code for Us Register on https://opai.com...

9415 sym R (10361 sym/28 pcs) 6 img

Document

07.01.2024

What is Statistics Statistics is a branch of mathematics and a scientific discipline that involves the collection, analysis, interpretation, presentation, and organization of data. Its primary goal is to extract meaningful information from data, make inferences, and support decision-making processes. Statistics is broadly divided into two main bra...

7263 sym R (92 sym/1 pcs)

Publish Document

07.12.2023

Instructions: Answer all questions. Show your work where applicable. Question 1: One-Sample Hypothesis Testing (5 points) A company claims that their new software reduces the average response time for customer inquiries to less than 15 minutes. You collect a random sample of 20 customer inquiries and record the response times (in minutes) as follow...

4529 sym