Publications by Random R Ramblings
poorman: Select helpers, bug fixes and tests, tests, tests!
Introduction Hello all and welcome to another edition of the poorman series of blog posts. In this series I am discussing my progress in writing a base R equivalent of dplyr. What’s nice about this series is that if you’re not into poorman and would prefer just to use dplyr, then that’s absolutely OK! By highlighting poorman functionality, ...
5931 sym R (3796 sym/9 pcs)
poorman: The Selectificator 2000!
Introduction Welcome to my series of blog posts about my data manipulation package, {poorman}. For those of you that don’t know, {poorman} is aiming to be a replication of {dplyr} but using only {base} R, and therefore be completely dependency free. What’s nice about this series is that if you’re not into {poorman} and would prefer just to ...
8080 sym R (1679 sym/16 pcs)
poorman: Version 0.2.1 Release
Introduction Welcome to my series of blog posts about my data manipulation package, {poorman}. For those of you that don’t know, {poorman} is aiming to be a replication of {dplyr} but using only {base} R, and therefore be completely dependency free. What’s nice about this series is that if you would rather just use {dplyr}, then that’s abso...
5545 sym R (2863 sym/13 pcs)
poorman: Versions 0.2.2 and 0.2.3 Releases
Introduction Welcome to my series of blog posts about my data manipulation package, {poorman}. For those of you that don’t know, {poorman} is aiming to be a replication of {dplyr} but using only {base} R, and therefore be completely dependency free. What’s nice about this series is that if you would rather just use {dplyr}, then that’s abso...
4398 sym R (5795 sym/7 pcs)
Using Functions As An Input To Functions With {dbplyr}
The Problem I wish to write a function that can mutate a column of my data using a function which is passed as an input. For example, I want to write a function which can modify an existing tbl_spark using dplyr::mutate() by adding a column that is the mean() of a column. To do this, we will see that we require a different solution depending on w...
7165 sym R (7125 sym/16 pcs)
Filling NA Values Using the Last Observation Carried Forward with {dplyr}, {dbplyr} and {sparklyr}
The Problem Real life data is often riddled with missing values – or NAs – where no data value are stored for the variable in observation. Missing data such as this can have a significant effect on the conclusions which can be drawn from the data. For example individuals dropping out of a study or subjects not properly reporting responses. A ...
3536 sym R (1939 sym/4 pcs)