Publications by Random R Ramblings

poorman: Select helpers, bug fixes and tests, tests, tests!

13.04.2020

Introduction Hello all and welcome to another edition of the poorman series of blog posts. In this series I am discussing my progress in writing a base R equivalent of dplyr. What’s nice about this series is that if you’re not into poorman and would prefer just to use dplyr, then that’s absolutely OK! By highlighting poorman functionality, ...

5931 sym R (3796 sym/9 pcs)

poorman: The Selectificator 2000!

17.05.2020

Introduction Welcome to my series of blog posts about my data manipulation package, {poorman}. For those of you that don’t know, {poorman} is aiming to be a replication of {dplyr} but using only {base} R, and therefore be completely dependency free. What’s nice about this series is that if you’re not into {poorman} and would prefer just to ...

8080 sym R (1679 sym/16 pcs)

poorman: Version 0.2.1 Release

01.07.2020

Introduction Welcome to my series of blog posts about my data manipulation package, {poorman}. For those of you that don’t know, {poorman} is aiming to be a replication of {dplyr} but using only {base} R, and therefore be completely dependency free. What’s nice about this series is that if you would rather just use {dplyr}, then that’s abso...

5545 sym R (2863 sym/13 pcs)

poorman: Versions 0.2.2 and 0.2.3 Releases

14.11.2020

Introduction Welcome to my series of blog posts about my data manipulation package, {poorman}. For those of you that don’t know, {poorman} is aiming to be a replication of {dplyr} but using only {base} R, and therefore be completely dependency free. What’s nice about this series is that if you would rather just use {dplyr}, then that’s abso...

4398 sym R (5795 sym/7 pcs)

Using Functions As An Input To Functions With {dbplyr}

18.02.2021

The Problem I wish to write a function that can mutate a column of my data using a function which is passed as an input. For example, I want to write a function which can modify an existing tbl_spark using dplyr::mutate() by adding a column that is the mean() of a column. To do this, we will see that we require a different solution depending on w...

7165 sym R (7125 sym/16 pcs)

Filling NA Values Using the Last Observation Carried Forward with {dplyr}, {dbplyr} and {sparklyr}

21.03.2021

The Problem Real life data is often riddled with missing values – or NAs – where no data value are stored for the variable in observation. Missing data such as this can have a significant effect on the conclusions which can be drawn from the data. For example individuals dropping out of a study or subjects not properly reporting responses. A ...

3536 sym R (1939 sym/4 pcs)