Publications by R on Publishable Stuff

A template for creating card sorting games in R

03.11.2024

Last week I made the small card sorting game called The Climate Impact Sorting Challenge where the challenge is to sort cards with different foods in the order of their climate impact. But then the thought hit me: Any time you find yourself with a dataset with labels (say, types of foods) mapped to numbers (say, climate impact in CO2e) you could...

2488 sym R (2005 sym/3 pcs) 4 img

The Climate Impact Sorting Challenge

18.10.2024

Try out The Climate Impact Sorting Challenge! A quick game I just made that teaches you about the climate impact of different kinds of food. This game can be played in two different ways. You can play it by yourself and try to beat your own high score (mine is 9). If you’re in a group, you can play “last man standing” style, where you tak...

2441 sym 2 img

A Bayesian Plackett-Luce model in Stan applied to pinball championship data

21.09.2024

Sometimes it feels a bit silly when a simple statistical model has a fancy-sounding name. But it also feels good to drop the following in casual conversation: “Ah, then I recommend a Plackett-Luce model, a straightforward generalization of the Bradley–Terry model, you know”, when a friend wonders how they could model their, say, pinball champ...

9886 sym R (10586 sym/13 pcs) 10 img

CopenhagenR, the 2024 spring season

03.08.2024

This is just a post to brag about that the CopenhagenR useR group is alive and kicking, again. After COVID-19, the group (like so many other meetups) was on hiatus for a couple of years and without an organizer. In 2023, I thought I would try starting it again and, while it took a little while, I’m happy that I got together five great meetups...

3290 sym 12 img

Public Pinball Machines per Capita: A new global indicator

06.06.2024

There are tons of well-known global indicators. We’ve all heard of gross domestic product, life expectancy, rate of literacy, etc. But, ever since I discovered pinballmap.com, possibly the world’s most comprehensive database of public pinball locations, I’ve been thinking about a potential new global indicator: Public Pinball Machines per ...

4482 sym R (8627 sym/19 pcs) 14 img

Why pandas feels clunky when coming from R

19.02.2024

Five years ago I started a new role and I suddenly found myself, a staunch R fan, having to code in Python on a daily basis. Working with data, most of my Python work involved using pandas, the Python data frame library, and initially I found it quite hard and clunky to use, being used to the silky smooth API of R’s tidyverse. And you know ...

8321 sym R (6121 sym/21 pcs)

Baking the cake dataset cake

11.02.2024

Now that I’ve got my hands on the source of the cake dataset I knew I had to attempt to bake the cake too. Here, the emphasis is on attempt, as there’s no way I would be able to actually replicate the elaborate and cake-scientifically rigorous recipe that Cook followed in her thesis. Skipping things like beating the eggs exactly “125 st...

3641 sym Python (1297 sym/2 pcs) 10 img

The source of the cake dataset

27.01.2024

In statistics, there are a number of classic datasets that pop up in examples, tutorials, etc. There’s the infamous iris dataset (just type iris in your nearest R prompt), the Palmer penguins (the modern iris replacement), the titanic dataset(s) (I hope you’re not a guy in 3rd class!), etc. While looking for a dataset to illustrate a s...

7459 sym 34 img 1 tbl

Get a Git repo where your team can stow their throwaway data science code!

01.12.2023

When I started working as a Data Scientist nearly ten years ago, the data science team I joined did something I found really strange at first: They had a single GitHub repo where they put all their “throwaway” code. An R script to produce some plots for a presentation, a Python notebook with a machine learning proof-of-concept, a bash script fo...

2293 sym 2 img

officialcocktails.com – An AI-generated cocktails site

20.03.2023

The first thing I thought when I tried all the cool tools of the Year of the AI Revolution (aka 2022) was: OMG this is amazing, it’s the AI future that I never thought I would see. The second thing I thought was: OMG this is going to be used to spam the internet with so much bland auto-generated content. I hate bland auto-generated content as muc...

4744 sym R (3944 sym/2 pcs) 6 img