Publications by Jan Górecki - R
Scaling data.table using index
R can handle fairly big data working on a single machine, 2B (2E9) rows and couple of columns require about 100 GB of memory. This is already well enough to care about performance. With this post I'm going discuss scalability of filter queries. The index has been introduced to data.table in 1.9.4. It is also known as secondary keys. Unlike wit...
2954 sym R (4789 sym/7 pcs) 2 img
Solve common R problems efficiently with data.table
I was recently browsing stackoverflow.com (often called SO) for the most voted questions under R tag. To my surprise, many questions on the first page were quite well addressed with the data.table package. I found a few other questions that could benefit from a data.table answer, therefore went ahead and answered them. In this post, I’d like to...
5996 sym R (4321 sym/15 pcs) 1 tbl
Solve common R problems efficiently with data.table
I was recently browsing stackoverflow.com (often called SO) for the most voted questions under R tag. To my surprise, many questions on the first page were quite well addressed with the data.table package. I found a few other questions that could benefit from a data.table answer, therefore went ahead and answered them. In this post, I’d like to...
5996 sym R (4321 sym/15 pcs) 1 tbl
Boost Your Data Munging with R
This article was first published on the toptal.com blog. Additionally be noticed that my blog is migrating to new host due to GitHub Pages drops support for RDiscount, Redcarpet, and RedCloth (Textile) markup engines. Old host will be still available but new posts will be published on jangorecki.gitlab.io, drop-in replacement after changing fro...
12974 sym R (16764 sym/18 pcs) 2 img
Boost Your Data Munging with R
This article was first published on the toptal.com blog. Additionally be noticed that my blog is migrating to new host due to GitHub Pages drops support for RDiscount, Redcarpet, and RedCloth (Textile) markup engines. Old host will be still available but new posts will be published on jangorecki.gitlab.io, drop-in replacement after changing fro...
12974 sym R (16764 sym/18 pcs) 2 img