Publications by Yun Wu
Market Basket Analysis with Association Rules
1.Introduction Association rule learning is a rule-based machine learning method for discovering interesting relations between variables in large databases. It is intended to identify strong rules discovered in databases using some measures of interestingness.In any given transaction with a variety of items, association rules are meant to disco...
7617 sym R (16230 sym/24 pcs) 6 img
Analyzing factors associated with coronary heart disease by PCA Methods
1.Introduction Principal Component Analysis (PCA) is the most common method of data dimensionality reduction. According to Wikipedia, it was first proposed by Karl Pearson (who also invented the chi-square test) in 1901, and has been around for more than a hundred years now. As a method of dimensionality reduction, PCA reduces redundancy and no...
5472 sym R (6512 sym/28 pcs) 8 img
Segmentation of customer using clustering methods
1.Introduction The main idea of this paper is to use clustering methods to refine the classification of members of a shopping mall and to portray user profiles. The dataset of this project is the basic information of the members of a shopping mall,including CustomerID,Gender, Age,Annual Income (k$) and Spending Score(1-100)(Spending Score: a sc...
7803 sym R (5562 sym/37 pcs) 19 img
PCA
1.Introduction Principal Component Analysis (PCA) is the most common method of data dimensionality reduction. According to Wikipedia, it was first proposed by Karl Pearson (who also invented the chi-square test) in 1901, and has been around for more than a hundred years now. As a method of dimensionality reduction, PCA reduces redundancy and no...
5499 sym R (7265 sym/55 pcs) 8 img
Segmentation of customer using clustering methods
1.Introduction The main idea of this paper is to use clustering methods to refine the classification of members of a shopping mall and to portray user profiles. The dataset of this project is the basic information of the members of a shopping mall,including CustomerID,Gender, Age,Annual Income (k$) and Spending Score(1-100)(Spending Score: a sc...
7839 sym R (6650 sym/71 pcs) 19 img
Publish Document
Segmentation of customer groups using clustering methods 1.Introduction Data clustering is one of the basic but very popular and important methods of the unsupervised learning. However, the most well-known clustering algorithms, i.a. k-means or k-medoids, are designed for the datasets which consist of continuous variables. While using count dat...
2350 sym 2 img