Publications by Zhi Ying Chen (Sec#1), Mengqin Cai (Sec#3), Fan Xu (Sec#4), Sin Ying Wong (Sec#4)

DATA624_Project 1

12.04.2021

Code Show All Hide All DATA 608 - PROJECT 1 Introduction Package Part A Data Exploration Data Summary Missing-value Check Timelineness Check Outliner Check Data Manipulation Imputing Missing Values Handling Outliners Data Visualization ATM 1 Observation on Raw Data Time Series Transformation Observation on Transformed Data Build ARIMA Model...

14879 sym R (13895 sym/78 pcs) 27 img

DATA624_Homework 4

08.03.2021

Code Show All Hide All DATA 624 - HOMEWORK 4 1 Question - 3.1 1.1 (a) 1.2 (b) 1.3 (c) 2 Question - 3.2 2.1 (a) 2.2 (b) 2.3 (c) Fan Xu 03/07/2021 library(tidyverse) library(corrplot) library(missForest) library(ggthemes) library(psych) library(naniar) library(DMwR) 1 Question - 3.1 The UC Irvine Machine Learning Repository contains...

4214 sym R (17255 sym/22 pcs) 9 img

DATA624_Homework 5

15.03.2021

Code Show All Hide All DATA 624 - HOMEWORK 5 Question - 7.1 (a) (b) Question - 7.5 (a) (b) (c) Question - 7.6 (a) (b) (c) (d) Question - 7.7 (a) (b) Question - 7.8 (a) (b) (c) (d) (e) Question - 7.9 (a) Fan Xu 03/14/2021 library(tidyverse) library(corrplot) library(missForest) library(ggthemes) library(rio) Question - 7.1 Consid...

9158 sym R (10981 sym/34 pcs) 14 img

DATA624_Homework 6

29.03.2021

Code Show All Hide All DATA 624 - HOMEWORK 6 Question - 8.1 (a) (b) Question - 8.2 (a) (b) Question - 8.3 (a) (b) (c) (d) (e) Question - 8.5 (a) Question - 8.6 (a) (b) (c) (d) (e) (f) (g) Question - 8.7 (a) (b) (c) (d) (e) (f) (g) Fan Xu 03/28/2021 library(tidyverse) library(corrplot) library(missForest) library(ggthemes) library...

12799 sym R (9216 sym/81 pcs) 29 img

DATA608_Final_Project_Proposal

05.04.2021

Code Show All Hide All DAT608 - Final Project Proposal Objective Data Introduction Data Source Data Dictionary Data Exploration Expected Project Deliverable Fan Xu 4/4/2021 library(RSocrata) library(tidyverse) library(plotly) library(blsAPI) library(jsonlite) library(knitr) library(kableExtra) Objective This project is to create a...

2285 sym R (2268 sym/4 pcs) 2 img 1 tbl

DATA624_Project 2

24.05.2021

Code Show All Hide All DATA 624 - PROJECT 2 1 Introduction 2 Load Package 3 Load Data 4 Exploratory data analysis 4.1 Training Data Summary 4.2 Evaluation Data Summary 4.3 Missing Value View 4.4 Numerical Predictor Correlation after Missing Data Imputation 5 Data Preprocess 6 Model building 6.1 Linear Regression Models 6.1.1 PLS Regression 6...

7055 sym R (57858 sym/72 pcs) 1 img 9 tbl

DATA624_Homework 8

26.04.2021

Code Show All Hide All DATA 624 - HOMEWORK 8 Question 7.2 SVM-Linear SVM-Radial SVM-Polynomial MARS Neural Networks Model Comparison Question 7.5 Load Data Data Imputation train_test_split (a) KNN SVM-Linear SVM-Radial SVM-Polynomial MARS Neural Networks Model Comparison (b) Top Impoartant Predictors Optimal linear model in #6.3 Comparison ...

4208 sym R (44987 sym/87 pcs) 2 img

DATA624_Homework 7

18.04.2021

Code Show All Hide All DATA 624 - HOMEWORK 7 Question 6.2 (a) (b) (c) Train-test-split Tune PLS model Variance Important Evaluation (d) (e) Ridge Train Ridge Model Prediction Lasso Train Lasso Model Prediction Elastic Net Train Elastic Net Model Prediction (f) Question 6.3 (a) (b) (c) train_test_split Build Elastic Net Model (d) (e) (f)...

5488 sym R (14225 sym/42 pcs) 2 img

DATA624_Homework 8

04.05.2021

Code Show All Hide All DATA 624 - HOMEWORK 9 Load Package Question 8.1 (a) (b) (c) (d) Question 8.2 Create 10 variables with different granularities build a regression tree Variable Importance Score vs # of Distinct Values Question 8.3 (a) (b) (c) Questions 8.7 Load Data Data Imputation train_test_split (a) Single Tree Random Forest Gradie...

7141 sym R (41795 sym/41 pcs) 3 img

DATA 698 Final Project

17.12.2021

Code Show All Hide All DATA 698 Final Project - Country Code Extractor For Wire Transactions Abstract Introduction Literature Review Methodology Evaluation Process Intro to Data Set Data Population Statistics Data Sample Statistics Load Data Sample Translate data into English Google Cloud Translation API Start translation process Training CN...

17285 sym R (9401 sym/14 pcs) 4 img