Publications by Shiju Zhang
Publish Document
PART I: EXPLORING AND COLLECTING DATA 1. Data and Decisions Data are recorded values, whether numbers or labels, together with their context. Data help companies make good decisions on answering business questions. Data are recorded and stored electronically, in vast digital repositories called data warehouses. Data can be structured or unstruc...
8489 sym R (5449 sym/14 pcs) 4 img 1 tbl
Publish Document
PART I: EXPLORING AND COLLECTING DATA 1. Data and Decisions Data are recorded values, whether numbers or labels, together with their context. Data help companies make good decisions on answering business questions. Data are recorded and stored electronically, in vast digital repositories called data warehouses. Data can be structured or unstruc...
8489 sym R (5449 sym/14 pcs) 4 img 1 tbl
STAT 242 Lecture Notes
PART I: EXPLORING AND COLLECTING DATA 1. Data and Decisions Data are recorded values, whether numbers or labels, together with their context. Data help companies make good decisions on answering business questions. Data are recorded and stored electronically, in vast digital repositories called data warehouses. Data can be structured or unstruc...
8489 sym R (5449 sym/14 pcs) 4 img 1 tbl
Document
Introduction to R Studio An integrated development environment (IDE) for R and Python, with a console, syntax-highlighting editor that supports direct code execution, and tools for plotting, history, debugging and workspace management (From https://rstudio.com/). Watch the video: https://www.youtube.com/watch?v=PviVimazpz8 Register on https://rst...
5851 sym R (63134 sym/327 pcs) 31 img
Introduction to R Programming
Introduction to R Studio An integrated development environment (IDE) for R and Python, with a console, syntax-highlighting editor that supports direct code execution, and tools for plotting, history, debugging and workspace management (From https://rstudio.com/). Watch the video: https://www.youtube.com/watch?v=PviVimazpz8 Register on https://rst...
5444 sym R (61888 sym/306 pcs) 24 img
Data Visualization & Story Telling
0.1 Data Visualization Data visualization is a visual representation of data that turns chaos into clarity. It uses data in creative ways in order to explain what happened and how the findings (information such as patterns or trends) can be consumed by the audience. It is a process that focuses on an exploratory analysis rather than an explanator...
3560 sym R (1849 sym/8 pcs) 5 img
Stat 242 Assignments
Assignment #2 (Test of and Confidence interval for a population proportion) Question (1) A survey of 200 students is selected randomly on a large university campus. They are asked if they use a laptop in class to take notes. The result of the survey is that 70 of the 200 students responded “yes.” Use R to find a 90% confidence interval for ...
3778 sym R (24 sym/1 pcs)
Formula for Introductory Statistics
The general structure of confidence intervals for parameters \[\Large \text{(Point Estimate)} \pm \text{(Margin of Error)}=\text{(Point Estimate)} \pm \text{(Critical Value)}\cdot \text{(Standard Error)}\] The \(z\)-Confidence interval for a single population proportion (\(p\)) \[\color{red} {\Huge \hat{p}\pm z^*\cdot \sqrt{\frac{\hat{p}\cdot (1...
2179 sym
Document
0.1 The ggplot2 Package We have introduced it. Here we give more examples. 0.1.1 Case Study One The chief financial officer (CFO) of a music download site has just secured the rights to offer downloads of a new album. To see how well it’s selling, she collects the number of downloads per hour for the past 24 hours: Hour = c("12:00 a.m.", "1:00...
45284 sym R (18955 sym/27 pcs)
Document
A Dashboard for AGCO Corporation Column Overdue Course Occurences Overdue Course Occurences by Manager Column Training Hours Performed vs Needed ...
199 sym 6 img