Publications by Tony Hirst

Experimenting With R – Point to Point Mapping With Great Circles

24.03.2014

I’ve started doodling again… This time, around maps, looking for recipes that make life easier plotting lines to connect points on maps. The most attractive maps seem to use great circles to connect one point with another, these providing the shortest path between two points when you consider the Earth as a sphere. Here’s one quick experime...

4247 sym R (6614 sym/9 pcs) 6 img

Visualising Pandas DataFrames With IPythonBlocks – Proof of Concept

26.03.2014

A few weeks ago I came across IPythonBlocks, a Python library developed to support the teaching of Python programming. The library provides an HTML grid that can be manipulated using simple programming constructs, presenting the outcome of the operations in a visually meaningful way. As part of a new third level OU course we’re putting together...

2931 sym R (64 sym/1 pcs) 8 img

Recreational Data: Data Golf

23.05.2014

I’m still hopeful of working up the idea of recreational data as a popular pastime activity with a regular column somewhere and a stocking filler book each Christmas (?!;-), but haven’t had much time to commit to working up some great examples lately:-( However, here’s a neat idea – data golf – as described in a post by Bogumił Kamińs...

1843 sym 4 img

F1 Doing the Data Visualisation Competition Thing With Tata?

02.07.2014

Sort of via @jottevanger, it seems that Tata Communications announces the first challenge in the F1® Connectivity Innovation Prize to extract and present new information from Formula One Management’s live data feeds. (The F1 site has a post Tata launches F1® Connectivity Innovation Prize dated “10 Jun 2014″? What’s that about then?) Tat...

6783 sym Python (4722 sym/5 pcs) 10 img

Opening Up Access to Data: Why APIs May Not Be Enough…

11.08.2014

Last week, a post on the ONS (Office of National Statistics) Digital Publishing blog caught my eye: Introducing the New Improved ONS API which apparently “mak[es] things much easier to work with”. Ooh… exciting…. maybe I can use this to start hacking together some notebooks?:-) It was followed a few days later by this one – ONS-API, Jus...

7010 sym 4 img

Running “Native” Data Wrangling Applications in the Browser – IPython Notebooks (and R?) in Chrome

22.08.2014

Using browser based data analysis toolkits such as pandas in IPython notebooks, or R in RStudio, means you need to have access to python or R and the corresponding application server either on your own computer, or running on a remote server that you have access to. When running occasional training sessions or workshops, this can cause several he...

3333 sym 4 img

Wrangling F1 Data With R – F1DataJunkie Book

30.10.2014

Earlier this year I started trying to pull together some of my #f1datajunkie R-related ramblings together in a book form. The project stalled, but to try to reboot it I’ve started publishing it as a living book over on Leanpub. Several of the chapters are incomplete – with TO DO items sketched in, others are still unpublished. The beauty of t...

3067 sym 4 img

F1 Championship Race, 2014 – Winning Combinations…

08.11.2014

As we come up to the final two races of the 2014 Formula One season, the double points mechanism for the final race means that two drivers are still in with a shot at the Drivers’ Championship: Lewis Hamilton and Nico Rosberg. As James Allen describes in Hamilton closes in on world title: maths favour him but Abu Dhabi threat remains: Hamilton ...

2480 sym 6 img

Information Density and Custom Chart Designs

21.11.2014

I’ve been doodling today with a some charts for the Wrangling F1 Data With R living book, trying to see how much information I can start trying to pack into a single chart. The initial impetus came simply from thinking about a count of laps led in a particular race by each drive; this morphed into charting the number of laps in each position fo...

2408 sym R (1201 sym/1 pcs) 6 img

Identifying Position Change Groupings in Rank Ordered Lists

09.12.2014

The title says it all, doesn’t it?! Take the following example – it happens to show race positions by driver for each lap of a particular F1 grand prix, but it could be the evolution over time of any rank-based population. The question I had in mind was – how can I identify positions that are being contested during a particular window of t...

3285 sym R (1496 sym/5 pcs) 10 img