Publications by Ralph
R Blogs
There are many blogs on Statistics, R and other related topics scattered around the internet. The R bloggers website provides a central hub where feeds from participating blogs are collated so that they can be viewed from a single website. This resources certainly appears to be a good idea so that people can more easily identify blogs with inform...
852 sym
Codecogs – Open-Source library of numerical components
The Codecogs website provides an Open-source library of functions for numerical analysis. One interesting component available on the website is the LaTeX equation editor which can be used to create graphics files of equations to include on webpages. The webpage describe this component as a A web-based LaTeX equation editor that generates graphica...
1019 sym
One-way Analysis of Variance (ANOVA)
Analysis of Variance (ANOVA) is a commonly used statistical technique for investigating data by comparing the means of subsets of the data. The base case is the one-way ANOVA which is an extension of two-sample t test for independent groups covering situations where there are more than two groups being compared. In one-way ANOVA the data is sub-d...
3756 sym R (1659 sym/8 pcs) 4 img
One-way ANOVA (cont.)
In a previous post we considered using R to fit one-way ANOVA models to data. In this post we consider a few additional ways that we can look at the analysis. In the analysis we made use of the linear model function lm and the analysis could be conducted using the aov function. The code used to fit the model is very similar: > plant.mod2 = aov(we...
1394 sym R (721 sym/3 pcs) 2 img
Two-way Analysis of Variance (ANOVA)
The analysis of variance (ANOVA) model can be extended from making a comparison between multiple groups to take into account additional factors in an experiment. The simplest extension is from one-way to two-way ANOVA where a second factor is included in the model as well as a potential interaction between the two factors. As an example consider ...
5580 sym Python (2415 sym/12 pcs) 12 img
Design of Experiments – Block Designs
In many experiments where the investigator is comparing a set of treatments there is the possibility of one or more sources of variability in the experimental measurements that can be accounted for during the design stage of the experimentation. For example we might be investigating four different pieces of machinery using say two different opera...
2353 sym 4 img
Contingency Tables – Fisher’s Exact Test
A contingency table is used in statistics to provide a tabular summary of categorical data and the cells in the table are the number of occassions that a particular combination of variables occur together in a set of data. The relationship between variables in a contingency table are often investigated using Chi-squared tests. The simplest contin...
1843 sym R (313 sym/2 pcs)
Summarising data using dot plots
A dot plot is a type of display that compares counts, frequencies, totals or other summary measures for a series of categories. The dot plot can be arranged with the categories either on the vertical or horizontal axis of the display to allow comparising between the different categories as well as comparison within categories where there are mult...
3545 sym R (1275 sym/6 pcs) 6 img
Summarising data using histograms
The histogram is a standard type of graphic used to summarise univariate data where the range of values in the data set is divided into regions and a bar (usually vertical) is plotted in each of these regions with height proportional to the frequency of observations in that region. In some cases the proportion of data points in each region is sho...
3827 sym R (429 sym/4 pcs) 6 img
Working with themes in Lattice Graphics
The Trellis graphics approach provides facilities for creating effective graphs with a consistent look and feel and one of the good things about the system is the use of themes to define the colour, size and other features of the components that make up a graph. The lattice package in R is an implementation of the approach and in this post we wil...
2775 sym R (1634 sym/7 pcs) 2 img