Publications by Jan Górecki - R

Scaling data.table using index

22.11.2015

R can handle fairly big data working on a single machine, 2B (2E9) rows and couple of columns require about 100 GB of memory. This is already well enough to care about performance. With this post I'm going discuss scalability of filter queries. The index has been introduced to data.table in 1.9.4. It is also known as secondary keys. Unlike wit...

2954 sym R (4789 sym/7 pcs) 2 img

Solve common R problems efficiently with data.table

10.12.2015

I was recently browsing stackoverflow.com (often called SO) for the most voted questions under R tag. To my surprise, many questions on the first page were quite well addressed with the data.table package. I found a few other questions that could benefit from a data.table answer, therefore went ahead and answered them. In this post, I’d like to...

5996 sym R (4321 sym/15 pcs) 1 tbl

Solve common R problems efficiently with data.table

10.12.2015

I was recently browsing stackoverflow.com (often called SO) for the most voted questions under R tag. To my surprise, many questions on the first page were quite well addressed with the data.table package. I found a few other questions that could benefit from a data.table answer, therefore went ahead and answered them. In this post, I’d like to...

5996 sym R (4321 sym/15 pcs) 1 tbl

Boost Your Data Munging with R

29.06.2016

This article was first published on the toptal.com blog. Additionally be noticed that my blog is migrating to new host due to GitHub Pages drops support for RDiscount, Redcarpet, and RedCloth (Textile) markup engines. Old host will be still available but new posts will be published on jangorecki.gitlab.io, drop-in replacement after changing fro...

12974 sym R (16764 sym/18 pcs) 2 img

Boost Your Data Munging with R

29.06.2016

This article was first published on the toptal.com blog. Additionally be noticed that my blog is migrating to new host due to GitHub Pages drops support for RDiscount, Redcarpet, and RedCloth (Textile) markup engines. Old host will be still available but new posts will be published on jangorecki.gitlab.io, drop-in replacement after changing fro...

12974 sym R (16764 sym/18 pcs) 2 img