Publications by R | datawookie
Private Security and the Pareto Principle
Private Security is a big industry in South Africa. Most Private Security companies promise to provide a rapid response to every callout generated by any of their customers. There is a delicate balance between the number of response vehicles and the number of customers (and the frequency of their callouts!), which determines whether or not they a...
4966 sym 10 img
Installing Prophet on CentOS
I’ve been struggling to get the {prophet} package to install on two RHEL 7 systems. Since I only have intermittent access to those systems I opted to use a local install of CentOS as a proxy environment. These instructions work for CentOS and should generalise to RHEL too. Setting Up the Environment The first thing that I needed to do was crea...
3049 sym R (2177 sym/18 pcs) 2 img
Installing Prophet on CentOS
I’ve been struggling to get the {prophet} package to install on two RHEL 7 systems. Since I only have intermittent access to those systems I opted to use a local install of CentOS as a proxy environment. These instructions work for CentOS and should generalise to RHEL too. Setting Up the Environment The first thing that I needed to do was creat...
2680 sym R (1807 sym/14 pcs)
Scraping Machinery Parts
I’ve been exploring the feasibility of aggregating data on prices of replacement parts for heavy machinery. There are a number of websites which list this sort of data. I’m focusing on the static sites for the moment. I’m using are R with {rvest} (and a few other Tidyverse packages thrown in for good measure). library(glue) library(dplyr) l...
1896 sym R (2366 sym/7 pcs) 2 img
Scraping Machinery Parts
I’ve been exploring the feasibility of aggregating data on prices of replacement parts for heavy machinery. There are a number of websites which list this sort of data. I’m focusing on the static sites for the moment. I’m using are R with {rvest} (and a few other Tidyverse packages thrown in for good measure). library(glue) library(dplyr) l...
1903 sym R (2359 sym/7 pcs) 2 img
An API for @racently
@racently is a side project that I have been nursing along for a couple of years. It addresses a problem that I have as a runner: my race results are distributed across a variety of web sites. This makes it difficult to create a single view on my running performance (or lack thereof) over time. I suspect that I am not alone in this. Anyway, @race...
3806 sym R (2691 sym/6 pcs) 6 img
An API for @racently
@racently is a side project that I have been nursing along for a couple of years. It addresses a problem that I have as a runner: my race results are distributed across a variety of web sites. This makes it difficult to create a single view on my running performance (or lack thereof) over time. I suspect that I am not alone in this. Anyway, @race...
3812 sym R (2685 sym/6 pcs) 6 img
Durban EDGE DataQuest
The Durban EDGE (Economic Development and Growth in eThekwini) DataQuest was held at UKZN (Westville Campus) on 13 November 2019. Participants were tasked with creating something interesting and useful with the civic data on the new Durban EDGE Open Data Portal developed by Open Data Durban. These datasets were available: EThekwini Water and San...
2945 sym 10 img
Durban EDGE DataQuest
The Durban EDGE (Economic Development and Growth in eThekwini) DataQuest was held at UKZN (Westville Campus) on 13 November 2019. Participants were tasked with creating something interesting and useful with the civic data on the new Durban EDGE Open Data Portal developed by Open Data Durban. These datasets were available: EThekwini Water and San...
2878 sym 10 img
R Package for @racently
I recently wrote about an API for @racently. The next logical step was to build a package which wraps the API so that the data can easily be pulled into R. The package is available here. It is still very much a work in progress: the API only exposes two endpoints, but both of them are wrapped in the package. Installation Install using {devtools}...
1697 sym R (4825 sym/3 pcs) 2 img