Publications by Pachá

A Crash Course on PostgreSQL for R Users

08.08.2020

Updated on 2020-09-19: I changed the “Creating tables” section. In my original script I used copy_to(), which creates (unique) indexes, while here I wrote dbWriteTable() when I adapted my code, which doesn’t create indexes and only adds records. Updated on 2020-08-11: @jbkunst suggested me a more concise approach to write large tables by us...

4828 sym R (4260 sym/10 pcs) 2 img

A Crash Course on PostgreSQL for R Users

08.08.2020

Updated on 2020-09-19: I changed the “Creating tables” section. In my original script I used copy_to(), which creates (unique) indexes, while here I wrote dbWriteTable() when I adapted my code, which doesn’t create indexes and only adds records. Updated on 2020-08-11: @jbkunst suggested me a more concise approach to write large tables by us...

4828 sym R (4260 sym/10 pcs) 2 img

SQL Databases for Students and Educators

17.09.2020

I’ve created the site db-edu.pacha.dev to offer free access to SQL databases. Publicly accessible databases often impose query limits or require registration. Even when I maintain public and limit-free APIs, I never wanted to host a public database because I tend to think that the connection strings are a problem for the user. Hosting different...

1661 sym

SQL Databases for Students and Educators

17.09.2020

I’ve created the site db-edu.pacha.dev to offer free access to SQL databases. Publicly accessible databases often impose query limits or require registration. Even when I maintain public and limit-free APIs, I never wanted to host a public database because I tend to think that the connection strings are a problem for the user. Hosting different...

1661 sym

The Hitchhiker’s Guide to Ggplot2 and Plotnine Flash Sale

03.01.2021

Flash sale Jodie Burchell and yours truly have published two books, The Hitchhiker’s Guide to Ggplot and The Hitchhiker’s Guide to Plotnine, on graphing in both R and Python using ggplot2 and plotnine, which are cross-language equivalents. You can get each book for $10 or the two in a bundle for $20 (the previous price was $30 each book). All...

858 sym

The Hitchhiker’s Guide to Ggplot2 and Plotnine Flash Sale

03.01.2021

Flash sale Jodie Burchell and yours truly have published two books, The Hitchhiker’s Guide to Ggplot and The Hitchhiker’s Guide to Plotnine, on graphing in both R and Python using ggplot2 and plotnine, which are cross-language equivalents. You can get each book for $10 or the two in a bundle for $20 (the previous price was $30 each book). All...

858 sym

Hyperplane Separation Theorem

20.02.2021

A bit of context to put this on my stats blog: I’m reading Real Analysis books again as a part of my studies. I used to visit Kim C. Border site from time to time to read his excellent materials, and now I read that he passed away. I never audited one of his courses nor studied at Caltech, but we exchanged several emails from 2012 to 2019, most...

4045 sym

Microeconomic Theory and Linear Regression (Part 1)

01.04.2021

Introduction This is a summary of a very old collection of materials that I used as teaching assistant (before 2017). Among those materials I found some class notes from Arne Henningsen, the author of micEcon R package. I’ll use that package and some others to show some concepts from Microeconomic Theory. Packages installation: # install.packag...

9471 sym R (15620 sym/23 pcs) 14 img 3 tbl

Wes McKinney InteRview

05.06.2021

I interviewed Wes McKinney, the creator of pandas, and author of Python for Data Analysis. This interview was conducted as a way to provide something illustrative about Open Source Software now that I am committing actively to the arrow R package, of which he’s a very active collaborator and a co-creator of the multi-platform library Apache Arr...

7857 sym

analogsea: Using Arrow, S3 and DigitalOcean for efficient model fitting in RStudio

16.07.2021

Introduction This tutorial explains how to use arrow with analogsea to take fully advantage from S3 filesystems and parallel computing. Analogsea is a community project created by and for statisticians from the R community: Scott Chamberlain, Hadley Wickham, Winston Chang, Bob Rudis, Bryce Mecum, and yours truly. This package provides an interfac...

5544 sym R (5290 sym/13 pcs) 2 img