Publications by The Jumping Rivers Blog
Job vacancies at Jumping Rivers!
In line with the continuous growth at Jumping Rivers, we are looking to expand our team of dedicated professionals working in our teams. If you are enthusiastic and keen to develop your skills in cutting edge data science or infrastructure please read on! Who are we and what do we do? Jumping Rivers is an analytics company whose passion is data a...
2242 sym 2 img
Cleaning up forked GitHub repositories with {gh}
One great thing about using GitHub is the ability to view and contribute to others’ code. Even the code underlying many of our favourite packages is available for us to examine and play around with. Forking a repository is a great way to create an exact replica of someone else’s project in our own user space. We can then freely make changes t...
6137 sym R (782 sym/11 pcs) 14 img
Webinars: Practical Advice for R in Production
Many organisations have a robust infrastructure that allows their data science teams to provide, fast and reliable insights. But for many groups, they are just starting down this path. We, Jumping Rivers, have partnered with RStudio to launch a two-part webinar series which examines and explores the usage of R in production environments. The firs...
1634 sym 2 img
Understanding the Parquet file format
Apache Parquet is a popular column storage file format used by Hadoop systems, such as Pig, Spark, and Hive. The file format is language independent and has a binary representation. Parquet is used to efficiently store large data sets and has the extension .parquet. This blog post aims to understand how parquet works and the tricks it uses to eff...
6325 sym R (1590 sym/16 pcs) 4 img
Your first D3 visualisation with {r2d3} and Scooby-Doo
Get the code for this blog on GitHub What is this tutorial and who is it for? This tutorial is aimed mainly at R users who want to learn a bit of D3, and specifically those who are interested in how you can incorporate D3 into your existing workflows in RStudio. It will gloss over a lot of the fundamentals of D3 and related topics (JavaScript...
16103 sym R (5848 sym/17 pcs) 12 img
Git: Moving from Master to Main
In June 2020, GitHub announced that is was moving the default branch name from master to the more neutral name, main. GitLab followed suit in a few months later. Tobie Langel makes the salient point on why changing the name is a good thing: So master is not only racist, it’s also a silly name in the first place. The purpose of this post is sum...
10175 sym R (474 sym/2 pcs) 6 img
2020 Training Review
This blog post was written by our intern Osheen Macoscar. 2020 is a year most of us would like to leave behind. But not all change is bad, and many interesting developments, especially in education, happened due to the constraints imposed by COVID. Like many other training providers, we had to pivot to online learning, which brought with it chall...
5127 sym 18 img
Job vacancies at Jumping Rivers!
In line with the continuous growth at Jumping Rivers, we are looking to expand our team of dedicated professionals working in our teams. If you are enthusiastic and keen to develop your skills in cutting edge data science or infrastructure please read on! Who are we and what do we do? Jumping Rivers is an analytics company whose passion is data a...
2225 sym 2 img
Thinking about maps and ice cream
In November 2021, I took part in the third edition of the 30 Day Map Challenge created by Topi Tjukanov. Participants are given a theme for each day of November, and are tasked with creating a map within that theme. Details of the challenge can be found here. My own contributions can be found on GitHub. Creating thirty maps was indeed a challenge...
8216 sym 14 img
Forgotten features of R 4.0.0
R version 4.0.0 was released almost two years ago. The change in the major version, 3.x.y to 4.0.0, represented significant and potentially breaking changes. For an organisation to start using these new features, everyone in the company must have access to that version; otherwise code isn’t shareable. This naturally slows down adoption. We move...
3575 sym R (203 sym/2 pcs) 4 img