Publications by Econometrics and Free Software

Manipulate dates easily with {lubridate}

14.12.2018

This blog post is an excerpt of my ebook Modern R with the tidyverse that you can read for free here. This is taken from Chapter 5, which presents the {tidyverse} packages and how to use them to compute descriptive statistics and manipulate data. In the text below, I scrape a table from Wikipedia, which shows when African countries gained indepen...

4473 sym R (6715 sym/10 pcs) 4 img

Using the tidyverse for more than data manipulation: estimating pi with Monte Carlo methods

20.12.2018

This blog post is an excerpt of my ebook Modern R with the tidyverse that you can read for free here. This is taken from Chapter 5, which presents the {tidyverse} packages and how to use them to compute descriptive statistics and manipulate data. In the text below, I show how you can use the {tidyverse} functions and principles for the estimation...

3653 sym R (907 sym/5 pcs) 6 img

Objects types and some useful R functions for beginners

23.12.2018

This blog post is an excerpt of my ebook Modern R with the tidyverse that you can read for free here. This is taken from Chapter 2, which explains the different R objects you can manipulate as well as some functions to get you started. Objects, types and useful R functions to get started All objects in R have a given type. You already know most ...

14296 sym R (4419 sym/72 pcs) 8 img

Some fun with {gganimate}

26.12.2018

Your browser does not support the video tag. In this short blog post I show you how you can use the {gganimate} package to create animations from {ggplot2} graphs with data from UNU-WIDER. WIID data Just before Christmas, UNU-WIDER released a new edition of their World Income Inequality Database: *NEW #DATA*We’ve just released a new version o...

5990 sym R (2062 sym/10 pcs) 6 img

R or Python? Why not both? Using Anaconda Python within R with {reticulate}

29.12.2018

This short blog post illustrates how easy it is to use R and Python in the same R Notebook thanks to the {reticulate} package. For this to work, you might need to upgrade RStudio to the current preview version. Let’s start by importing {reticulate}: library(reticulate) {reticulate} is an RStudio package that provides “a comprehensive set of t...

3321 sym R (4459 sym/12 pcs) 6 img

Looking into 19th century ads from a Luxembourguish newspaper with R

03.01.2019

The national library of Luxembourg published some very interesting data sets; scans of historical newspapers! There are several data sets that you can download, from 250mb up to 257gb. I decided to take a look at the 32gb “ML Starter Pack”. It contains high quality scans of one year of the L’indépendence Luxembourgeoise (Luxembourguish ind...

9086 sym R (5688 sym/9 pcs) 16 img

Making sense of the METS and ALTO XML standards

12.01.2019

Last week I wrote a blog post where I analyzed one year of newspapers ads from 19th century newspapers. The data is made available by the national library of Luxembourg. In this blog post, which is part 1 of a 2 part series, I extract data from the 257gb archive, which contains 10 years of publications of the L’Union, another 19th century Luxem...

5606 sym R (6055 sym/13 pcs) 12 img

Using Data Science to read 10 years of Luxembourguish newspapers from the 19th century

30.01.2019

I have been playing around with historical newspaper data (see here and here). I have extracted the data from the largest archive available, as described in the previous blog post, and now created a shiny dashboard where it is possible to visualize the most common words per article, as well as read a summary of each article. The summary was made ...

1798 sym 4 img

Building a shiny app to explore historical newspapers: a step-by-step guide

03.02.2019

Introduction I started off this year by exploring a world that was unknown to me, the world of historical newspapers. I did not know that historical newspapers data was a thing, and have been thoroughly enjoying myself exploring the different datasets published by the National Library of Luxembourg. You can find the data here. In my first blog po...

9029 sym R (14593 sym/9 pcs) 12 img

Manipulating strings with the {stringr} package

09.02.2019

This blog post is an excerpt of my ebook Modern R with the tidyverse that you can read for free here. This is taken from Chapter 4, in which I introduce the {stringr} package. Manipulate strings with {stringr} {stringr} contains functions to manipulate strings. In Chapter 10, I will teach you about regular expressions, but the functions containe...

11329 sym R (18328 sym/45 pcs) 4 img