Publications by pacha.dev/blog
Reading REDATAM databases in R
REDATAM REDATAM (Retrieval of Data for Small Areas by Microcomputer) is a data storage and retrieval system created by ECLAC and it is widely used by national statistics offices to store and manipulate census and survey data. However, conducting statistical analysis with REDATAM databases, such as Poisson or Negative Binomial regression, can be tri...
2737 sym R (138 sym/2 pcs) 2 img
armadillo 0.3.0 is available on CRAN
The goal of armadillo (previously cpp11armadillo) is to provide a novel approach to use the Armadillo C++ library by using the header-only cpp11 R package and to simplify things for the end-user. New features Completely rewritten documentation, which now aims to cover all the essentials to get started with C++ and Armadillo. New examples, which a...
1332 sym R (82 sym/2 pcs) 2 img
Kendallknight: an R package for Kendall’s correlation coefficient
Motivation Existing R packages, such as pcaPP, provide efficient implementations of the Kendall correlation coefficient. However, I wanted to create my own package exclusively for this purpose, without additional functions, and that it also allows to test hypothesis about the correlation coefficient. Installation You can install the development ve...
920 sym R (355 sym/2 pcs)
CP 1919 / PSR B1919+21 Dataset
If you like these contributions, please consider buying me a coffee. About I added the CP 1919 / PSR B1919+21 Dataset to my GitHub. This dataset, found in one of my old external drives, corresponds to the famous plot from Radio Observations of the Pulse Profiles and Dispersion Measures of Twelve Pulsars (Craft, 1970). This is broadly known as the ...
1436 sym R (1881 sym/7 pcs) 6 img
Advanced International Trade in R
About The goal of these solutions is to provide a reference for those who come from Stata and want to learn R. I prioritized readability and simplicity over performance and elegance. There were parts of the code were it was challenging to stick to a literal code translation, and I had to use R idioms to make the code more readable. You can access ...
790 sym 2 img
The Hitchhiker’s Guide to Linear Models with codes and datasets sale
About Last week I wrote that the 2nd edition of The Hitchhiker’s Guide to Linear Models can be downloaded for free (or for a suggested price of 10 USD) from Leanpub. The 2nd edition now also features a sale featuring an extra with all the datasets and codes used in the book. These are presented as RStudio projects with R scripts to enhance the h...
12659 sym 2 img
The Hitchhiker’s Guide to Linear Models
About The Hitchhiker’s Guide to Linear Models can be downloaded for free (or for a suggested price of 10 USD) from Leanpub. For every exercise I did my best to connect the specific statistical concepts with R code, and every time I use linear algebra I connect it with a concrete R example. In this book you will not find something such as “this ...
12299 sym
Install PostGIS on Red Hat Enterprise Linux (RHEL) 8
Motivation I was ready to copy a table with a geometry column from R to PostgreSQL, and I saw this error that wasn’t there yesterday: Error: Failed to fetch row : ERROR: type "geometry" does not exist at character 108 I ran sudo -i -u postgres; psql -d mydatabase -c "CREATE EXTENSION postgis;", which means to re-activate PostGIS for my particula...
3096 sym R (3414 sym/16 pcs)
If Doom runs everywhere, it must run on Shiny
Motivation I’ve been playing with the idea of running Doom on Shiny for a while. If Doom runs “everywhere”, it must run on Shiny, right? Doom was released in 1993 and since then it has been ported to many different platforms including: A Kodak digital camera from 1998 An ATM A pregnancy test The result I was able to run Doom on Shiny! I hav...
1213 sym R (969 sym/1 pcs) 2 img
Logistic regression is not advanced ‘machine learning’ or ‘artificial intelligence’
Motivation One of the most common comments I hear is that logistic regression (also called Binomial regression) is some kind of “advanced magic”, “machine learning”, “artificial intelligence” or “big data”. This is not true. In this post, I will show you how logistic regression works and why it is not as complex as some people think...
3938 sym R (1426 sym/12 pcs)