Publications by Shiju Zhang
Publish Document
Types of Data Mining Problems Data mining is a process of data. We will cover a few types of data mining problems: Classification Prediction Clustering The first two are called supervised learning methods and the third is an unsupervised learning method. Generally, there are 4 steps in a data mining problem: Problem definition: Define the prob...
4634 sym R (26598 sym/28 pcs) 1 img
Publish Document
1 Introduction R is so powerful that it can do almost anything for you. 1.1 Example 1 The following example introduces the use of R. a=8 b=9 c=a+b print(c) ## [1] 17 1.2 A Second Example The following will make a scatterplot. # Data vector x x=c(3,6,8,2,6,7,4,9) # Data vector y y = c(89, 98, 95, 67, 88, 94, 56, 99) # Calculate mean of x mean(...
233 sym 1 img
Publish Document
Which Model to Use? Data Mining Models and Code model response variable (y) type predictors missing value handling dummify predictors needed? code multiple linear regression quantitative any yes yes lm(y~., data=D) binary logistic binary (0/1) any yes yes glm(y~., data = D, family = ‘binomial’) ordinal logistic factor wit...
378 sym 1 tbl
Publish Document
knitr::opts_chunk$set(echo = TRUE) suppressPackageStartupMessages(library(forecast)) # To use the accuracy() function suppressPackageStartupMessages(library(caret)) suppressPackageStartupMessages(library(NHANES)) suppressPackageStartupMessages(library(tidyverse)) suppressPackageStartupMessages(library(yardstick)) # For ROC suppressPackage...
4620 sym R (15650 sym/41 pcs) 7 img
Publish Document
HW1 Solutions We are given \(P(C)=0.6\) and \(P(C \cap J)=0.2\). \(P(J|C)=\frac{P(J\cap C)}{P(C)}=\frac{0.2}{0.6}=\frac{1}{3}\) Given \(P(C|J)=0.4\), want \(P(J)\). Since \[0.4 = P(C|J)=\frac{P(C\cap J)}{P(J)}=\frac{0.2}{P(J)}\] \[P(J)=0.5\] 2. Denote “Left-handed” by “LH”, “Red-Green color-blinded” by “RG”, “Blue-Yellow colo...
6100 sym 3 tbl
Publish Document
The notes start from Chapter 3 Chapter 0 A Change in Dates of the Schedule on the Syllabus Please refer to the new syllabus on D2L. Chapter 1 The Role of Statistics in Engineering 1 1.1 The Engineering Method and Statistical Thinking 2 1.1.1 Variability 3 1.1.2 Populations and Samples 5 1.2 Collecting Engineering Data 5 1.2.1 Basic Princip...
35799 sym 5 img
Publish Document
Instruction of submission Submit your R Markdown to an appropriate dropbox in D2L. When submitting your file, include a link to your app created with the learnr package. (File -> New File -> R Markdown -> From Template -> Interactive Tutorial) Assignment # 1 Register with https://rstudio.cloud/. Then get logged in. Start a new project. Name the...
9797 sym 1 tbl
Document
Instructor’s Contact Dr. Shiju Zhang: szhang@stcloudstate.edu Office Hours There will be no in-office assistance. Students can get help from the instructor through the Zoom link: https://minnstate.zoom.us/j/6499950400 (Passcode: 3.14). When help is needed, students are suggested to make an appointment with the instructor. The instructor’s a...
1929 sym 1 tbl
Document
Click any point inside the frame to start. If the red ball falls into the black whole, you win. Reload ...
110 sym
Publish Document
Chapter 0 Preliminaries and the Coverage Do the following problems to see if you have met the basic mathematics requirements. Solve the equation \(3x-7=26\). Solve the equation \(\frac{12}{25}=\frac{x}{15}\). Solve the equation \(\frac{18}{27}=\frac{14}{x}\). Solve the equation \(\frac{17+1}{27+1}=\frac{8+1}{x+1}\). Solve the equation \(256 = 64...
177555 sym 41 img 27 tbl