Publications by Tony Hirst

Generating Alerts From Guardian University Tables Data

23.06.2013

One of the things I’ve been pondering with respect to the whole data journalism process is how journalists without a lot of statistical training can quickly get a feel for whether there may be interesting story leads in a dataset, or how they might be able to fashion “alerts” that bring attention to data elements that might be worth investi...

8509 sym Python (3305 sym/5 pcs) 10 img

Generating Sankey Diagrams from rCharts

23.07.2013

A couple of weeks or so ago, I picked up an inlink from an OCLC blog post about Visualizing Network Flows: Library Inter-lending. The post made use of Sankey diagrams to represent borrowing flows, and by implication suggested that the creation of such diagrams is not as easy as it could be… Around the same time, @tiemlyportfolio posted a recipe...

6671 sym R (3455 sym/7 pcs) 38 img

Negative Payments in Local Spending Data

17.08.2013

In anticipation of a new R library from School of Data data diva @mihi_tr that will wrap the OpenSpending API and providing access to OpenSpending.org data directly from within R, I thought I’d start doodling around some ideas raised in Identifying Pieces in the Spending Data Jigsaw. In particular, common payment values, repayments/refunds and ...

4059 sym R (6565 sym/8 pcs) 8 img

Generating d3js Motion Charts from rCharts

04.11.2013

Remember Gapminder, the animated motion chart popularised by Hans Rosling in his TED Talks and Joy of Stats TV programme? Well it’s back on TV this week in Don’t Panic – The Truth About Population, a compelling piece of OU/BBC co-produced stats theatre featuring Hans Rosling, and a Pepper’s Ghost illusion brought into the digital age cour...

4258 sym R (5080 sym/7 pcs) 4 img

Local Council Spending Data – Time Series Charts

06.11.2013

In What Role, If Any, Does Spending Data Have to Play in Local Council Budget Consultations? I started wondering about the extent to which local spending transparency data might play a role in supporting consultation around new budgets. As a first pass, I’ve popped up a quick application up at http://glimmer.rstudio.com/psychemedia/iwspend2013_...

1925 sym 14 img

Data Textualisation – Making Human Readable Sense of Data

18.11.2013

A picture may be worth a thousand words, but whilst many of us may get a pre-attentive gut reaction reading from a data set visualised using a chart type we’re familiar with, how many of us actually take the time to read a chart thoroughly and maybe verbalise, even if only to ourselves, what the marks on the chart mean, and how they relate to e...

4862 sym 6 img

Setting Axis Limits on ggplot Charts

03.12.2013

I’ve been doodling some chart in R/ggplot using geom_text() to generate a labelled scatterplot. The chart actually builds up several layers using different datasets, so it’s not obvious how to set the ranges cleanly: I know the lower bound I want for the y-axis (y=0), but I want to let the upper bound float. There’s also an issue with the l...

880 sym R (324 sym/1 pcs) 8 img

Using One Programming Language In the Context of Another – Python and R

22.01.2014

Over the last couple of years, I’ve settled into using R an python as my languages of choice for doing stuff: R, because RStudio is a nice environment, I can blend code and text using R markdown and knitr, ggplot2 and Rcharts make generating graphics easy, and reshapers such as plyr make wrangling with data realtvely easy(?!) once you get into...

3570 sym 4 img

Quoting Tukey on Visual Storytelling with Data

23.01.2014

Time was when I used to be a reasonably competent scholar, digging into the literature chasing down what folk actually said, and chasing forward to see whether claims had been refuted. Then I fell out of love with the academic literature – too many papers that said nothing, too many papers that contained errors, too many papers… …but as we ...

16862 sym 4 img

Recreational Data

21.03.2014

Part of my weekend ritual is to buy the weekend papers and have a go at the recreational maths problems that are Sudoku and Killer. I also look for news stories with a data angle that might prompt a bit of recreational data activity… In a paper that may or may not have been presented at the First European Congress of Mathematics in Paris, July,...

4702 sym 6 img