Publications by Florian Privé

R package primefactr

09.08.2016

In this post, I will present my first R package, available on CRAN. It makes use of Prime Factorization for computations. This small R package was initially developed to compute hypergeometric probabilities which are used in Fisher’s exact test, for instance. It was also a way to get introduced with CRAN submission :’). Installation and Atta...

2351 sym R (420 sym/4 pcs)

A website and blog for R users

18.08.2016

In this post, I will show you how to quickly create your website, blog and first posts. This is designed for R users only. Philosophy This had to be free. This had to be easy. This had to only need RStudio and GitHub. Every content had to be previewable from RStudio. The website Follow this tutorial to create your own website in minutes. I cho...

2105 sym

Showing a different approach to making statistical tests

05.09.2016

In this post, I will talk about an alternative way to choose quantiles (and more broadly, decision boundaries) for statistical tests, the ones you choose in order to have a 95% confidence interval (5% of type-I error). I will then show that this idea can be used to combine tests. I will use some illustrations in R to make this clearer. I got this...

5091 sym R (1846 sym/3 pcs) 14 img

Making a team survey to get my colleagues hooked on R

06.10.2016

In this post, I will talk about the presentation of R that I did today, in the first week of my PhD. Usually, it is a team-only presentation. Yet, other people came because they were interested in learning more about R. How I get this idea? I get the idea of doing an R presentation while reading Getting Your Colleagues Hooked on R on R-bloggers....

2473 sym R (863 sym/4 pcs)

On the ifelse function

14.10.2016

In this post, I will talk about the ifelse function, which behaviour can be easily misunderstood, as pointed out in my latest question on SO. I will try to show how it can be used, and misued. We will also check if it is as fast as we could expect from a vectorized base function of R. How can it be used? The first example comes directly from the...

2052 sym R (2840 sym/9 pcs)

Tip: Optimize your Rcpp loops

28.12.2016

In this post, I will show you how to optimize your Rcpp loops so that they are 2 to 3 times faster than a standard implementation. Context Real data example For this post, I will use a big.matrix which represents genotypes for 15,283 individuals, corresponding to the number of mutations (0, 1 or 2) at 287,155 different loci. Here, I will use on...

2661 sym R (4393 sym/6 pcs) 2 img

Tip: Optimize your Rcpp loops

28.12.2016

In this post, I will show you how to optimize your Rcpp loops so that they are 2 to 3 times faster than a standard implementation. Context Real data example For this post, I will use a big.matrix which represents genotypes for 15,283 individuals, corresponding to the number of mutations (0, 1 or 2) at 287,155 different loci. Here, I will use on...

2670 sym R (4393 sym/6 pcs) 2 img

(Linear Algebra) Do not scale your matrix

02.06.2017

In this post, I will show you that you generally don’t need to explicitly scale a matrix. Maybe you wanted to know more about WHY matrices should be scaled when doing linear algebra. I will remind about that in the beginning but the rest will focus on HOW to not explicitly scale matrices. We will apply our findings to the computation of Princip...

4530 sym R (2704 sym/8 pcs) 4 img

Package bigstatsr: Statistics with matrices on disk (useR 2017)

20.07.2017

In this post, I will talk about my package bigstatsr, which I’ve just presented in a lightning talk of 5 minutes at useR!2017. You can listen to me in action there. I should have chosen a longer talk to explain more about this package, maybe next time. I will use this post to give you a more detailed version of the talk I gave in Brussels. Mot...

7133 sym R (5760 sym/14 pcs) 14 img

One month as a procrastinator on Stack Overflow

26.07.2017

Hello everyone, I’m 6103040 aka F. Privé. In this post, I will give some insights about answering questions on Stack Overflow (SO) for a month. One of the reason I’ve began frenetically answering questions on Stack Overflow was to procrastinate while finishing a scientific manuscript. My activity on Stack Overflow We’ll use David Robinson...

3063 sym R (2714 sym/7 pcs) 4 img