Publications by Aaron Schlegel

Image Compression with Singular Value Decomposition

10.11.2016

As mentioned in a previous post, image compression with singular value decomposition is a frequently occurring application of the method. The image is treated as a matrix of pixels with corresponding color values and is decomposed into smaller ranks that retain only the essential information that comprises the image. In this example, we are inter...

4322 sym R (1574 sym/7 pcs) 18 img

Discriminant Analysis for Group Separation in R

17.11.2016

The term ‘discriminant analysis’ is often used interchangeably to represent two different objectives. These objectives of discriminant analysis are: Description of group separation. Linear combinations of variables, known as discriminant functions, of the dependent variables that maximize the separation between the groups are used to identif...

6868 sym R (1627 sym/11 pcs) 2 img

Multiple Analysis of Variance (MANOVA)

01.12.2016

MANOVA, or Multiple Analysis of Variance, is an extension of Analysis of Variance (ANOVA) to several dependent variables. The approach to MANOVA is similar to ANOVA in many regards and requires the same assumptions (normally distributed dependent variables with equal covariance matrices). This post will explore how MANOVA is performed and interpr...

8687 sym R (4814 sym/10 pcs)

MANOVA Test Statistics with R

08.12.2016

Multiple tests of significance can be employed when performing MANOVA. The most well known and widely used MANOVA test statistics are Wilk’s \(\Lambda\), Pillai, Lawley-Hotelling, and Roy’s test. Unlike ANOVA in which only one dependent variable is examined, several tests are often utilized in MANOVA due to its multidimensional nature. Each M...

5016 sym R (4224 sym/11 pcs)

Discriminant Analysis of Several Groups

15.12.2016

Discriminant analysis is also applicable in the case of more than two groups. In the first post on discriminant analysis, there was only one linear discriminant function as the number of linear discriminant functions is \(s = min(p, k – 1)\), where \(p\) is the number of dependent variables and \(k\) is the number of groups. In the case of more...

6466 sym R (5259 sym/10 pcs)

Classification with Linear Discriminant Analysis

23.12.2016

Classification with linear discriminant analysis is a common approach to predicting class membership of observations. A previous post explored the descriptive aspect of linear discriminant analysis with data collected on two groups of beetles. In this post, we will use the discriminant functions found in the first post to classify the observation...

6600 sym R (2594 sym/12 pcs)

Quadratic Discriminant Analysis of Two Groups

29.12.2016

As mentioned in the post on classification with linear discriminant analysis, LDA assumes the groups in question have equal covariance matrices \((\Sigma_1 = \Sigma_2 = \cdots = \Sigma_k)\). Therefore, often when the groups do not have equal covariance matrices, observations are frequently assigned to groups with large variances on the diagonal o...

4408 sym R (3915 sym/7 pcs)

LDA for Classification into Several Groups

05.01.2017

Similar to the two-group linear discriminant analysis for classification case, LDA for classification into several groups seeks to find the mean vector that the new observation \(y\) is closest to and assign \(y\) accordingly using a distance function. The several group case also assumes equal covariance matrices amongst the groups \((\Sigma_1 = ...

4299 sym R (4599 sym/10 pcs)

Quadratic Discriminant Analysis of Several Groups

12.01.2017

Quadratic discriminant analysis for classification is a modification of linear discriminant analysis that does not assume equal covariance matrices amongst the groups \((\Sigma_1, \Sigma_2, \cdots, \Sigma_k)\). Similar to LDA for several groups, quadratic discriminant analysis of several groups classification seeks to find the group that maximize...

5327 sym R (5181 sym/11 pcs)

Principal Component Analysis

19.01.2017

Often, it is not helpful or informative to only look at all the variables in a dataset for correlations or covariances. A preferable approach is to derive new variables from the original variables that preserve most of the information given by their variances. Principal component analysis is a widely used and popular statistical method for reduci...

14843 sym R (10308 sym/21 pcs) 8 img