Publications by David Smith

Because it’s Friday: Detecting Cylons

11.12.2009

Battlestar Galactica (Ronald D Moore’s reimagined version of the rather cheesy 70’s sci-fi series) has been my favourite TV series (of any genre) of recent years, so I’m especially excited that Chris Bilder has given me the chance to blog about it. Chris, an Associate Professor in the Department of Statistics at the University of Nebraska-L...

3605 sym 2 img

R 2.10.1 released

14.12.2009

The latest update to R, R 2.10.1, is now available for download in source form from your local CRAN mirror. Binary versions (for Mac, Windows, and Linux) will become available over the next few days. As a maintenance release, this update focuses on minor changes and bug fixes. The complete list of changes is available in the NEWS file, but some o...

1166 sym

NYT on breast cancer screening and probability

14.12.2009

The New York Times last weekend looked at the controversy around the recent changes to the mammogram guidelines from a mathematical perspective. Compared to the analysis based on Bayes’ Theorem from the Harvard Social Science Statistics blog (which apparently caused some controversy itself: that post was deleted and later replaced after some er...

2376 sym

According to Microsoft, the fourth paradigm of science is data

16.12.2009

In scientific discovery, the first three paradigms were experimental, theoretical and (more recently) computational science. A new book of essays published by Microsoft (and available for free download — kudos, MS!) argues that a fourth paradigm of scientific discovery is at hand: the analysis of massive data sets. The book is dedicated to the...

1983 sym

Why use plyr?

17.12.2009

The “apply” family of functions in R (apply, sapply, lapply) is a very powerful suite of tools for iterating through structures of data and returning the combined results of each iteration. But with great power comes great responsibility (or something like that): these functions can sometimes be frustratingly difficult to get working exactly ...

1270 sym

Because it’s Friday: The decline of empires

18.12.2009

Here’s a neat visualization of the decline of the British, Spanish, Portugese and French empires from 1800 to present day. It’s definitely more art than stats — judging by the relative size of India and Australia I think the circles are scaled to area, not population — but it definitely does capture the drama and the ebb and flow of colon...

995 sym

Singapore, February 19-20: Computational Topics in Finance

21.12.2009

With all of the winter snows in the US this weekend, a trip to equatorial climes sounds pretty good right about now. That makes this email announcement from Rmetrics leader Diethelm Wuertz all the more tempting: Conference on ‘Computational Topics in Finance’February 19/20, 2010, National University of Singapore Dear R/Rmetrics Community, We...

1854 sym

Forecasting the weather with R

22.12.2009

The US National Centers for Environment Prediction (NCEP) produces weather forecasts for the entire world from a model that’s updated every 6 hours. The data is made freely available, and with a couple of free tools to convert the data and R you can easily produce am unpdated global weather forecast like this (click to enlarge): (Check out the ...

1038 sym 2 img

R in India: The Hindu

23.12.2009

The Hindu, a leading English-language newspaper in India, published an article on December 21 about doing research with open-source tools and R got a prominent mention:  Though commercial statistical packages are popular among researchers, their licensing costs drive people away from them. In this context, R https://www.r-project.org, the open s...

1935 sym

A web-based graphics application based on R

24.12.2009

FlowingData recently took a look at Jeroen Ooms’ latest web-based statistical tool based on R. We’ve looked at his tools for random-effects models and finance visualizations before, but this one is a more general tool for creating graphs from data sets using the ggplot2 package. It’s pretty slick. All you need to do is upload a data set (in...

1935 sym 2 img