Publications by Learning Machines

Euro 2020: Will Switzerland kick out Spain too?

29.06.2021

One of the big sensations of the UEFA Euro 2020 is that Switzerland kicked out world champion France. We take this as an opportunity to share with you a simple statistical model to predict football (soccer) results with R, so read on! Football is a highly stochastic game, which is one of the reasons for its appeal: anything can happen! But there...

3086 sym R (966 sym/1 pcs) 6 img

Euler Coding Challenge: Build Maths’ Most Beautiful Formula in R

14.07.2021

In this post, we will first give some intuition for and then demonstrate what is often called the most beautiful formula in mathematics, Euler’s identity, in R – first numerically with base R and then also symbolically, so read on! Euler’s identity (also known as Euler’s equation) is the equality: where is Euler’s number, the base ...

4144 sym R (332 sym/2 pcs) 50 img

The Small Data Rule: Infer the Big Picture from only Five Values!

27.07.2021

Everybody is talking about big data but the real skill lies in the art of inferring useful information from only a handful of values! If you want to learn how to determine the range of the typical value of a dataset (i.e. the median) with just five values and why this works, read on! This blog post is inspired by a chapter from the wonderful boo...

4472 sym R (315 sym/4 pcs) 2 img

Learning Path for “Data Science with R” – Part I

17.08.2021

Over the course of the last two and a half years, I have written over one hundred posts for my blog “Learning Machines” on the topics of data science, i.e. statistics, artificial intelligence, machine learning, and deep learning. I use many of those in my university classes and in this post, I will give you the first part of a learning path f...

6518 sym 56 img

The “Youth Bulge” of Afghanistan: The Hidden Force behind Political Instability

20.08.2021

In view of the current dramatic events in Afghanistan many wonder why the extensive international efforts to bring some stability to the country have failed so miserably. In this post, we will present and analytically examine a fascinating theory that seems to be able to explain political (in-)stability almost mono-causally, so read on! The theo...

5726 sym R (4161 sym/4 pcs) 4 img

New Bundesliga Forecasting Tool: Can Underdog Herta Berlin beat Bayern Munich?

25.08.2021

The Bundesliga is Germany’s primary football league. It is one of the most important football leagues in the world, broadcast on television in over 200 countries. If you want to get your hands on a tool to forecast the result of any game (and perform some more statistical analyses), read on! The basis of our forecasting tool was laid in this b...

3071 sym R (1757 sym/3 pcs) 6 img

The Pólya Urn Model: A simple Simulation of “The Rich get Richer”

07.09.2021

What is the “opposite” of sampling without replacement? In a classical urn model sampling without replacement means that you don’t replace the ball that you have drawn. Therefore the probability of drawing that colour becomes smaller. How about the opposite, i.e. that the probability becomes bigger? Then you have a so-called Pólya urn mode...

3054 sym R (1046 sym/6 pcs) 14 img

Is the Stock Market Efficient? Let your ZIP Compression Tool give an Answer!

14.09.2021

One of the most fiercely fought debates in quantitative finance is whether the stock market (or financial markets in general) is (are) efficient, i.e. whether you can find patterns in them that can be profitably used. If you want to learn about an ingenious method (that is already present in anyone’s computer) to approach that question, read o...

5662 sym R (4237 sym/6 pcs) 2 img

The Most Dangerous Equation, or Why Small is Not Beautiful!

22.09.2021

Over one billion dollars have been spent in the US to split up big schools into smaller ones because small schools regularly show up in rankings as top performers. In this post, I will show you why that money was wasted because of a widespread (but not so well known) statistical artifact, so read on! Why do small schools perform better? Many are...

4810 sym R (3915 sym/5 pcs) 12 img

Solving Einstein’s Puzzle with Constraint Programming

30.09.2021

The following puzzle is a well-known meme in social networks. It is said to have been invented by young Einstein and back in the days I was ambitious enough to solve it by hand (you should try too!). Yet, even simpler is to use Constraint Programming (CP). An excellent choice for doing that is MiniZinc, a free and open-source constraint modelling...

2737 sym R (3673 sym/2 pcs) 2 img