Publications by CHUN-LI HOU
Course Work Week 3
Instruction Create a web page presentation using R Markdown that features a plot created with Plotly. Host your webpage on either GitHub Pages, RPubs, or NeoCities. Your webpage must contain the date that you created the document, and it must contain a plot created with Plotly. We would love to see you show off your creativity! Review criteria T...
811 sym R (1779 sym/6 pcs) 1 img
Course Work Week 2
Instructions Create a web page using R Markdown that features a map created with Leaflet. Host your webpage on either GitHub Pages, RPubs, or NeoCities. Your webpage must contain the date that you created the document, and it must contain a map created with Leaflet. We would love to see you show off your creativity! Review criteria The rubric co...
735 sym R (1457 sym/4 pcs)
Regression Course Project
Executive Summary The 1974 Motor Trend US magazine dataset (mtcars) is used to evaluate the effect of transmission design on mpg (miles per gallon) in automobiles. Simply put we are asking the questions as following: Is an automatic or manual transmission better for mpg? How is the mpg difference between automatic and manual transmissions? Data...
5214 sym R (4797 sym/23 pcs) 6 img
Regression Model Protocol
Regression modeling processes are theroying, data collection, data cleaning, training set & test set, variable selection (feature selection), advancing by the goodness of fit, checking finalised model assumptions, evaluating with test set (no underfit nor overfit), causation & prediction with the finalised model. knitr::opts_chunk$set(echo = TRUE...
2993 sym R (3709 sym/22 pcs) 6 img
Practical Assignment Writeup
1. Overview Platform: Coursera Course: Practical Machine Learning Task: Week 4 Final Project Location: Texas, USA Link: www.coursera.org/learn/practical-machine-learning/ 2. Background Using devices such as Jawbone Up, Nike FuelBand, and Fitbit it is now possible to collect a large amount of data about personal activity relatively inexpensively...
2587 sym R (3155 sym/16 pcs) 5 img
Machine Learning Library
ML model Data preprocessing: Feature scaling & dropping (depend) Dimensionality reduction (depend): feature selection (regression model), feature extraction (PCA, LDA, Kernel PCA) Model fitting: ML algorithm Prediction: exception (real-time search) (no need) Evaluation: Supervised learning - regression (RMSE) Supervised learning - classificat...
16126 sym R (51797 sym/70 pcs)
Homework 2
0 This data set provides information on the number of women murdered each year (per 100,000 standard population) in the U.S. # load package & set environment if(!require("pacman")) install.packages("pacman") pacman::p_load(fpp2, fpp3, dplyr, patchwork, purrr) data("wmurders") options(digits = 3) set.seed(42) theme_set(theme_minimal()) 1 By...
1675 sym R (2184 sym/16 pcs) 6 img
Customer Value Analysis
Content 1. Introduction Background 2. Preparation Environment Dataset 3. Exploring Data Analysis Histogram Boxplot 4. Model Data Analysis Preprocessing Machine Learning Hierarchical Clustering Non Hierarchical Clustering Decision Tree Regression 5. Conclusion 6. Case Study Non Hierarchical Clustering Decision Tree Regression 7. Refere...
8433 sym R (13659 sym/44 pcs) 22 img
Telco Customer Churn
Content Introduction Background Preparation Environment Dataset Exploring Data Analysis Churn Percent in Overall Churn Proportion in Different Attributes 1 Churn Proportion in Different Attributes 2 Churn Proportion in Different Attributes 3 Churn Proportion in Different Attributes 4 Correlation between Numeric Variables Outliers in Numeric...
8757 sym R (21114 sym/30 pcs) 25 img
World Happiness Report
Content Introduction Background Mission Preparation Environment Dataset Visualization Correlation plot Comparing different continents regarding their happiness variables Correlation plot for each continent Happiness score comparison on different continents Scatter plot with regression line Scatter plot colored by Continents 3D Plot Predict...
6494 sym R (25952 sym/42 pcs) 25 img