Publications by pacha.dev/blog

The REDATAM format and its challenges for data access and information creation in public policy

16.02.2025

I am glad to share my 1st PhD article in collaboration with Lital Barkai: http://dx.doi.org/10.1017/dap.2025.4. Abstract The REDATAM (retrieval of data for small areas by microcomputer) statistical package and format, developed by ECLAC, has been a critical tool for disseminating census data across Latin America since the 1990s. However, significan...

1572 sym

cpp11armadillo: An R package to use the Armadillo C++ library

18.02.2025

I am glad to share my 2nd PhD article in collaboration with Jonathan Schneider Malamud: https://doi.org/10.1016/j.softx.2025.102087. Abstract This article introduces ‘cpp11armadillo’, an R package that integrates the highly efficient Armadillo C++ linear algebra library with R through the ‘cpp11’ interface. Designed to offer significant per...

1110 sym

Cpp11 pull requests to improve the integration of R and C++

24.01.2025

About From cpp11 description: “Provides a header only, C++11 interface to R’s C interface. Compared to other approaches ‘cpp11’ strives to be safe against long jumps from the C API as well as C++ exceptions, conform to normal R function semantics and supports interaction with ‘ALTREP’ vectors.” I have used cpp11 for two years right af...

2698 sym R (2234 sym/6 pcs)

Reading REDATAM databases in R

02.10.2024

REDATAM REDATAM (Retrieval of Data for Small Areas by Microcomputer) is a data storage and retrieval system created by ECLAC and it is widely used by national statistics offices to store and manipulate census and survey data. However, conducting statistical analysis with REDATAM databases, such as Poisson or Negative Binomial regression, can be tri...

2737 sym R (138 sym/2 pcs) 2 img

armadillo 0.3.0 is available on CRAN

21.08.2024

The goal of armadillo (previously cpp11armadillo) is to provide a novel approach to use the Armadillo C++ library by using the header-only cpp11 R package and to simplify things for the end-user. New features Completely rewritten documentation, which now aims to cover all the essentials to get started with C++ and Armadillo. New examples, which a...

1332 sym R (82 sym/2 pcs) 2 img

Kendallknight: an R package for Kendall’s correlation coefficient

15.08.2024

Motivation Existing R packages, such as pcaPP, provide efficient implementations of the Kendall correlation coefficient. However, I wanted to create my own package exclusively for this purpose, without additional functions, and that it also allows to test hypothesis about the correlation coefficient. Installation You can install the development ve...

920 sym R (355 sym/2 pcs)

CP 1919 / PSR B1919+21 Dataset

21.06.2024

If you like these contributions, please consider buying me a coffee. About I added the CP 1919 / PSR B1919+21 Dataset to my GitHub. This dataset, found in one of my old external drives, corresponds to the famous plot from Radio Observations of the Pulse Profiles and Dispersion Measures of Twelve Pulsars (Craft, 1970). This is broadly known as the ...

1436 sym R (1881 sym/7 pcs) 6 img

Advanced International Trade in R

17.06.2024

About The goal of these solutions is to provide a reference for those who come from Stata and want to learn R. I prioritized readability and simplicity over performance and elegance. There were parts of the code were it was challenging to stick to a literal code translation, and I had to use R idioms to make the code more readable. You can access ...

790 sym 2 img

The Hitchhiker’s Guide to Linear Models with codes and datasets sale

16.06.2024

About Last week I wrote that the 2nd edition of The Hitchhiker’s Guide to Linear Models can be downloaded for free (or for a suggested price of 10 USD) from Leanpub. The 2nd edition now also features a sale featuring an extra with all the datasets and codes used in the book. These are presented as RStudio projects with R scripts to enhance the h...

12659 sym 2 img

The Hitchhiker’s Guide to Linear Models

07.06.2024

About The Hitchhiker’s Guide to Linear Models can be downloaded for free (or for a suggested price of 10 USD) from Leanpub. For every exercise I did my best to connect the specific statistical concepts with R code, and every time I use linear algebra I connect it with a concrete R example. In this book you will not find something such as “this ...

12299 sym