Publications by Roel M. Hogervorst

Creating One Unified Calendar of all Data Science Events in the Netherlands

01.12.2022

I enjoy learning new things about machine learning, and I enjoy meeting like minded people too. That is why I go to meetups and conferences. But not everyone I meet becomes a member of every group. So I keep sending my coworkers new events that I hear about here in the Netherlands. And it is easy to overlook a new event that comes in over email. ...

2425 sym R (848 sym/2 pcs) 6 img

WTF is Kubernetes and Should I Care as R User?

16.04.2022

I’m going to give you a high overview of kubernetes and how you can make your R work shine in kubernetes. Are you, an R-user in a company that uses kubernetes? building R applications (models that do predictions, shiny applications, APIs)? curious about this whole kubernetes thing that your coworkers are talking about? somewhat afraid? Then I...

12604 sym 10 img

Don’t Panic! a Scientific Approach to Debugging Production Failure

24.04.2022

Your production system just broke down. What should you do now? Can you imagine your shiny application / flask app, or your API service breaking down? As a beginning programmer, or operations (or devops) person it can be overwhelming to deal with logs, messages, metrics and other possible relevant information that is coming at you at such a point...

7459 sym 6 img

Portioning projects

13.02.2016

Often we write programs to automate things. The programs range from simple to complex. But in essence, you always do the same thing: You are trying to solve a problem. A common pitfall, at least for me, is that you start out to big. What you need to do is start simple and small, and only if your simple thing works, increase the complexity. Separa...

3707 sym 2 img

From spss to R, part 1

19.02.2016

Introduction This whole blog is devoted to R and clean coding in R. But what if you want to start with R? There are millions of websites devoted to learning R. just look at the number of hits on a certain search machine. Most of these hits start with the basics and slowly work your way up to more advanced examples. There is often one reason to...

6341 sym R (2131 sym/6 pcs) 12 img

From spss to R, part 2

21.02.2016

Introduction In this lesson we will open a .sav file in Rstudio and manipulate the data.frame. We will select parts of the file and create some simple overviews. First time with R? No problem, see lesson 1 (https://rmhogervorst.github.io/cleancode/blog/2016/02/20/from-spss-to-r-part1.html#introduction “From spss to R, part 1”) Introduction ...

9457 sym R (3622 sym/21 pcs) 2 img

Tidying your data

23.02.2016

Introduction To make analyses work we often need to change the way files look. Sometimes information is recorded in a way that was very efficient for input but not workable for your analyses. In other words, the data is messy and we need to make it tidy. Tidy data means 1: Each variable forms a column. Each observation forms a row. Each type of ...

4785 sym R (954 sym/5 pcs) 4 img

Version control with Git

29.02.2016

Keeping track of versions You work on a project and would like to keep track of what you did. That is why keep old versions of your files. That way you can go back if you messed up beyond recognition. Usually that looks like this: Or you use dropbox or something like it: Other people use email. Emailing to themselves or to collaborators when ...

9333 sym R (136 sym/1 pcs) 30 img

From spss to R, part 3

01.03.2016

In this post we will start with a build-in dataset and some basic ggplot graphics. In the next post we will combine dplyr and ggplot to do awesome stuff with the Dutch University student counts from the previous lessons. We will work with the build-in dataset mtcars. There are many datasets in r library(help = "datasets") but in many examples onl...

8288 sym R (2703 sym/13 pcs) 20 img

Creating a package for your data set

06.03.2016

Turning your dataset into a package is very useful for reproducable research. This tutorial is for you, even if you’ve never created a package in r. Why would you turn your dataset into a package? very easy to share easy to load (library(name) is easier then load("path/to/file") or data<-read.csv("path/to/file") etc.) documentation is part of ...

7572 sym R (399 sym/5 pcs) 20 img