Publications by Yun Wu

Market Basket Analysis with Association Rules

02.02.2024

1.Introduction Association rule learning is a rule-based machine learning method for discovering interesting relations between variables in large databases. It is intended to identify strong rules discovered in databases using some measures of interestingness.In any given transaction with a variety of items, association rules are meant to disco...

7617 sym R (16230 sym/24 pcs) 6 img

Analyzing factors associated with coronary heart disease by PCA Methods

02.02.2024

1.Introduction Principal Component Analysis (PCA) is the most common method of data dimensionality reduction. According to Wikipedia, it was first proposed by Karl Pearson (who also invented the chi-square test) in 1901, and has been around for more than a hundred years now. As a method of dimensionality reduction, PCA reduces redundancy and no...

5472 sym R (6512 sym/28 pcs) 8 img

Segmentation of customer using clustering methods

01.02.2024

1.Introduction The main idea of this paper is to use clustering methods to refine the classification of members of a shopping mall and to portray user profiles. The dataset of this project is the basic information of the members of a shopping mall,including CustomerID,Gender, Age,Annual Income (k$) and Spending Score(1-100)(Spending Score: a sc...

7803 sym R (5562 sym/37 pcs) 19 img

PCA

06.01.2024

1.Introduction Principal Component Analysis (PCA) is the most common method of data dimensionality reduction. According to Wikipedia, it was first proposed by Karl Pearson (who also invented the chi-square test) in 1901, and has been around for more than a hundred years now. As a method of dimensionality reduction, PCA reduces redundancy and no...

5499 sym R (7265 sym/55 pcs) 8 img

Segmentation of customer using clustering methods

04.01.2024

1.Introduction The main idea of this paper is to use clustering methods to refine the classification of members of a shopping mall and to portray user profiles. The dataset of this project is the basic information of the members of a shopping mall,including CustomerID,Gender, Age,Annual Income (k$) and Spending Score(1-100)(Spending Score: a sc...

7839 sym R (6650 sym/71 pcs) 19 img

Publish Document

03.01.2024

Segmentation of customer groups using clustering methods 1.Introduction Data clustering is one of the basic but very popular and important methods of the unsupervised learning. However, the most well-known clustering algorithms, i.a. k-means or k-medoids, are designed for the datasets which consist of continuous variables. While using count dat...

2350 sym 2 img