Publications by Tony Hirst

New Version of “Wrangling F1 Data With R” Just Released…

05.02.2016

So I finally got round to pushing a revised (and typo corrected!) version of Wrangling F1 Data With R: A Data Junkie’s Guide, that also includes a handful of new section and chapters, including descriptions of how to detect undercuts, the new style race history chart that shows the on-track position of each driver for each lap of a race relativ...

1928 sym 4 img

Another Route to Jupyter Notebooks – Azure Machine Learning

31.03.2016

In much the same way that the IBM DataScientist Workbench seeks to provide some level of integration between analysis tools such as Jupyter notebooks and data access and storage, Azure Machine Learning studio also provides a suite of tools for accessing and working with data in one location. Microsoft’s offering is new to me, but it crossed my ...

3539 sym 18 img

Visualising F1 Stint Strategies

06.04.2016

With the new F1 season upon us, I’ve started tinkering with bits of code from the Wrangling F1 Data With R book and looking at the data in some new ways. For example, I started wondering whether we might be able to learn something interesting about the race strategies by looking at laptimes on a stint by stint basis. To begin with, we need some ...

3626 sym Python (2222 sym/7 pcs) 12 img

Accessing a Neo4j Graph Database Server from RStudio and Jupyter R Notebooks Using Docker Containers

12.04.2016

In Getting Started With the Neo4j Graph Database – Linking Neo4j and Jupyter SciPy Docker Containers Using Docker Compose I posted a recipe demonstrating how to link a Jupyter notebook container with a neo4j container to provide a quick way to get up an running with neo4j from a Python environment. It struck me that it should be just as easy to...

2006 sym R (386 sym/1 pcs) 10 img

First Thoughts on Automatically Generating Accessible Text Descriptions of ggplot Charts in R

29.04.2016

In a course team accessibility briefing last week, Richard Walker briefly mentioned a tool for automatically generating text descriptions of Statistics Canada charts to support accessibility. On further probing, the tool, created by Leo Ferres, turned out to be called iGraph-Lite: … an extensible system that generates natural language descripti...

6595 sym R (1129 sym/1 pcs) 6 img

When Documents Become Databases – Tabulizer R Wrapper for Tabula PDF Table Extractor

02.05.2016

Although not necessarily the best way of publishing data, data tables in PDF documents can often be extracted quite easily, particularly if the tables are regular and the cell contents reasonably space. For example, official timing sheets for F1 races are published by the FIA as event and timing information in a set of PDF documents containing ta...

3219 sym 16 img

First Thoughts on Detecting Motorsport Safety Car Periods from Laptimes

02.05.2016

Prompted by Markku Hänninen, I thought I’d have a quick look at estimating motorsport safety car laps from a set of laptime data. For the uninitiated, if there is a dangerous hazard on track, the race-cars are kept out while the hazard is cleared, but led around by a safety car that limits the pace. No overtaking is allowed for race position,...

3394 sym R (857 sym/6 pcs) 4 img

Using Docker as a Personal Productivity Tool – Running Command Line Apps Bundled in Docker Containers

03.05.2016

With its focus on enterprise use, it’s probably with good reason that the Docker folk aren’t that interested in exploring the role that Docker may have to play as a technology that supports the execution of desktop applications, or at least, applications for desktop users. (The lack of significant love for Kitematic seems to be representative...

4473 sym 4 img

DH Box – Digital Humanities Virtual Workbench

26.05.2016

As well as offering digital application shelves, should libraries offer, or act as instituional sponsors of, digital workbenches? I’ve previously blogged about things like SageMathCloud, and application based learning environment, and the IBM Data Scientist Workbench, and today came across another example: DHBox, CUNY’s digital humanities lab...

4046 sym 18 img

Running the Numbers – How Can Hamilton Still Take the 2016 F1 Drivers’ Championship?

31.10.2016

Way back in 2012, I posted a simple R script for trying to work out the finishing combinations in the last two races of that year’s F1 season for Fernando Alonso and Sebastien Vettel to explore the circumstances under which Alonso could take the championship (Paths to the F1 2012 Championship Based on How They Might Finish in the US Grand Prix)...

1359 sym 6 img