Publications by Shiju Zhang

Publish Document

15.02.2024

Machine Learning Topic 1: Dimension Reduction Introduction Data may encompass numerous variables. The inclusion of highly correlated or irrelevant variables with the outcome variable can result in overfitting, compromising the reliability of predictions. During model deployment, unnecessary variables may escalate costs associated with data collect...

54798 sym R (19794 sym/47 pcs) 1 tbl

Document

06.02.2024

Introduction This is an R Markdown document. Markdown is a simple formatting syntax for authoring HTML, PDF, and MS Word documents. For more details on using R Markdown see http://rmarkdown.rstudio.com. When you click the Knit button a document will be generated that includes both content as well as the output of any embedded R code chunks within t...

420 sym

Document

30.01.2024

Introduction This note provides key examples to the book https://bookdown.org/speegled/foundations-of-statistics/RData.html#reading-data-from-files 1 Data in R Section 1.1 Arithmetic and Variable Assignment x <- 4 y <- "Hello World!" z <- TRUE Section 1.2 Vectors p <- c(5, 8, 12, 55) q <- c("cat", "dog", "snake") r <- c(FALSE, TRUE, TRUE, FA...

1954 sym

Publish Document

22.01.2024

Review of Probability and Distributions The properties of mean, variance, and covariance The additivity property of some distributions The moment-generating function as an analytic tool Sampling Distributions and the Central Limit Theorem (Chapter 7) The sampling distribution of the mean of a sample from a finite population The sampling distribu...

3444 sym

Publish Document

13.01.2024

Contact Information My E-Mail: szhang@stcloudstate.edu Textbook/references I will be using the following resources: https://bookdown.org/speegled/foundations-of-statistics/RData.html#data-types (Chapters 1, 2.1, 2.4.1, 3-7, 9, 10) https://sjzhang.shinyapps.io/Statistics/ https://afrozhussain.files.wordpress.com/2015/07/data-science-in-r.pdf (only...

3420 sym R (15905 sym/45 pcs) 2 img

Publish Document

13.01.2024

R reference https://bookdown.org/speegled/foundations-of-statistics/data-manipulation.html#data-manipulation (Chapters 1,5,6,7, and 10) Book Page https://www.stat2.org/ Data manual: https://www.stat2.org/manuals/Stat2DataManual.pdf Book R Code https://www.stat2.org/manuals/Stat2RCompanion.pdf ChatGPT Will Code for Us Register on https://opai.com...

9415 sym R (10361 sym/28 pcs) 6 img

Document

07.01.2024

What is Statistics Statistics is a branch of mathematics and a scientific discipline that involves the collection, analysis, interpretation, presentation, and organization of data. Its primary goal is to extract meaningful information from data, make inferences, and support decision-making processes. Statistics is broadly divided into two main bra...

7263 sym R (92 sym/1 pcs)

Publish Document

07.12.2023

Instructions: Answer all questions. Show your work where applicable. Question 1: One-Sample Hypothesis Testing (5 points) A company claims that their new software reduces the average response time for customer inquiries to less than 15 minutes. You collect a random sample of 20 customer inquiries and record the response times (in minutes) as follow...

4529 sym

Document

05.12.2023

Question 1: One-Sample z-Confidence Interval for a Population Proportion (8 points) You are investigating the proportion of customers satisfied with a product. From a sample of 100 customers, 70 express satisfaction. Calculate a 90% confidence interval for the true proportion of satisfied customers. prop.test(n = 100, x = 70, conf.level = 0.90) ##...

8873 sym

Publish Document

15.11.2023

A Data Analysis Tool Basic Math Arithmetic Skills: 12 - 7x2 + 8/2= 2, 1/3 + 1/6 + 0.8 = 1.3, 25% = 25/100 = 1/4 = 0.25. Algebraic Skills: 3x - 5 = 19, x = 8 Exponents and Radicals: \(3^2-\sqrt{36}=3\) Functions: y = 3x + 4. Data analysis The topics include Numerical display of data (e.g. mean, median, mode, range, variance/standard deviation, q...

1743 sym