Publications by benjaminlmoore

9 reasons to use RStudio

16.10.2012

In no particular order, here are nine reasons why I really like the RStudio IDE for the R statistical programming language. 1) R benefits from an IDE – I accept that in some languages an IDE is unnecessary—Perl is the first example that comes to mind—and in some languages it’s near-essential (Java). A good case can be made that R is at ...

4541 sym 8 img

Analyse your bank statements using R

04.01.2014

Online banking has made reviewing statements and transferring money more convenient than ever before, but most still rely on external methods for looking at their personal finances. However, many banks will happily give you access to long-term transaction logs, and these provide a great opportunity to take a DIY approach. I’ll be walking throug...

4412 sym R (1590 sym/7 pcs) 10 img

Meticulously recreating bitmap plots in R

03.02.2014

There’s a hard-fought drive on Wikimedia commons to convert those images that should be in vector format (i.e. graphs, diagrams) from their current bitmap form. At the time of writing, the relevant category has over 7000 images in the category “Images that should use vector graphics”. The usual way people move between the two is by tracing ...

3489 sym 12 img

Slidify: Modern, simple presentations written in R Markdown

24.02.2014

As a LaTeX fan I’m used to using Beamer for presentations, but the built-in themes are definitely starting to show their age — and writing a custom .sty file looks like a nightmare — so for a while I’ve been looking at trying out an HTML5 framework. Reveal.js is a great looking HTML presentation framework from Hakim El Hattab. The first ...

6855 sym R (442 sym/2 pcs) 10 img

What are the most common RNG seeds used in R scripts on Github?

06.03.2014

In the R programming language, the random number generator (RNG) is seeded each session using the current time and process ID. Via the magic of the popular Mersenne Twister PRNG, the values stored in .Random.seed are used sequentially each time “randomness” is invoked in a function. This means, of course, that the same function run in differe...

4546 sym R (644 sym/2 pcs) 10 img

Guardian data blog — UK general election analysis in R

18.03.2014

The Guardian newspaper has for a few years been running a data blog and has built up a massive repository of (often) well-curated datasets on a huge number of topics. They even have an indexed list of all data sets they’ve put together or reused in their articles. It’s a great repository of interesting data for exploratory analysis, and the...

3257 sym R (372 sym/1 pcs) 10 img

Author inflation in academic literature

06.04.2014

There seems to be a general consensus that author lists in academic articles are growing. Wikipedia says so, and I’ve also come across a published letter and short Nature article which accept this is the case and discuss ways of mitigating the issue. Recently there was an interesting discussion on academia.stackexchange on the subject but again...

7158 sym R (461 sym/2 pcs) 16 img

What are the most overrated films?

05.05.2014

“Overrated” and “underrated” are slippery terms to try to quantify. An interesting way of looking at this, I thought, would be to compare the reviews of film critics with those of Joe Public, reasoning that a film which is roundly-lauded by the Hollywood press but proved disappointing for the real audience would be “overrated” and vi...

6293 sym 14 img

Celebrity twitter followers by gender

25.05.2014

The most popular accounts on twitter have millions of followers, but what are their demographics like? Twitter doesn’t collect or release this kind of information, and even things like name and location are only voluntarily added to people’s profiles. Unlike Google+ and Facebook, twitter has no real name policy, they don’t care what you c...

6573 sym R (953 sym/3 pcs) 10 img

EdinbR: A new R usergroup for Edinburgh

11.02.2015

Inspired by succesful RUGs like LondonR and CambR, I’m pleased to announce a new R usergroup for those in and around Edinburgh: EdinbR! Edinburgh has a large research community using R, spread across different campuses and even universities so a centralised discussion group is long overdue. Many R packages have been developed by Edinburgh rese...

1678 sym 6 img