Publications by pacha.dev/blog
The REDATAM format and its challenges for data access and information creation in public policy
I am glad to share my 1st PhD article in collaboration with Lital Barkai: http://dx.doi.org/10.1017/dap.2025.4. Abstract The REDATAM (retrieval of data for small areas by microcomputer) statistical package and format, developed by ECLAC, has been a critical tool for disseminating census data across Latin America since the 1990s. However, significan...
1572 sym
cpp11armadillo: An R package to use the Armadillo C++ library
I am glad to share my 2nd PhD article in collaboration with Jonathan Schneider Malamud: https://doi.org/10.1016/j.softx.2025.102087. Abstract This article introduces ‘cpp11armadillo’, an R package that integrates the highly efficient Armadillo C++ linear algebra library with R through the ‘cpp11’ interface. Designed to offer significant per...
1110 sym
Cpp11 pull requests to improve the integration of R and C++
About From cpp11 description: “Provides a header only, C++11 interface to R’s C interface. Compared to other approaches ‘cpp11’ strives to be safe against long jumps from the C API as well as C++ exceptions, conform to normal R function semantics and supports interaction with ‘ALTREP’ vectors.” I have used cpp11 for two years right af...
2698 sym R (2234 sym/6 pcs)
Reading REDATAM databases in R
REDATAM REDATAM (Retrieval of Data for Small Areas by Microcomputer) is a data storage and retrieval system created by ECLAC and it is widely used by national statistics offices to store and manipulate census and survey data. However, conducting statistical analysis with REDATAM databases, such as Poisson or Negative Binomial regression, can be tri...
2737 sym R (138 sym/2 pcs) 2 img
armadillo 0.3.0 is available on CRAN
The goal of armadillo (previously cpp11armadillo) is to provide a novel approach to use the Armadillo C++ library by using the header-only cpp11 R package and to simplify things for the end-user. New features Completely rewritten documentation, which now aims to cover all the essentials to get started with C++ and Armadillo. New examples, which a...
1332 sym R (82 sym/2 pcs) 2 img
Kendallknight: an R package for Kendall’s correlation coefficient
Motivation Existing R packages, such as pcaPP, provide efficient implementations of the Kendall correlation coefficient. However, I wanted to create my own package exclusively for this purpose, without additional functions, and that it also allows to test hypothesis about the correlation coefficient. Installation You can install the development ve...
920 sym R (355 sym/2 pcs)
CP 1919 / PSR B1919+21 Dataset
If you like these contributions, please consider buying me a coffee. About I added the CP 1919 / PSR B1919+21 Dataset to my GitHub. This dataset, found in one of my old external drives, corresponds to the famous plot from Radio Observations of the Pulse Profiles and Dispersion Measures of Twelve Pulsars (Craft, 1970). This is broadly known as the ...
1436 sym R (1881 sym/7 pcs) 6 img
Advanced International Trade in R
About The goal of these solutions is to provide a reference for those who come from Stata and want to learn R. I prioritized readability and simplicity over performance and elegance. There were parts of the code were it was challenging to stick to a literal code translation, and I had to use R idioms to make the code more readable. You can access ...
790 sym 2 img
The Hitchhiker’s Guide to Linear Models with codes and datasets sale
About Last week I wrote that the 2nd edition of The Hitchhiker’s Guide to Linear Models can be downloaded for free (or for a suggested price of 10 USD) from Leanpub. The 2nd edition now also features a sale featuring an extra with all the datasets and codes used in the book. These are presented as RStudio projects with R scripts to enhance the h...
12659 sym 2 img
The Hitchhiker’s Guide to Linear Models
About The Hitchhiker’s Guide to Linear Models can be downloaded for free (or for a suggested price of 10 USD) from Leanpub. For every exercise I did my best to connect the specific statistical concepts with R code, and every time I use linear algebra I connect it with a concrete R example. In this book you will not find something such as “this ...
12299 sym