Publications by rstats | Emily Riederer

Python Rgonomics

28.12.2023

Interoperability was a key theme in open-source data languages in 2023. Ongoing innovations in Arrow (a language-agnostic in-memory standard for data storage), growing adoption of Quarto (the language-agnostic heir apparent to R Markdown), and even pandas creator Wes McKinney joining Posit (the language-agnostic rebranding of RStudio) all illust...

13739 sym

Oh, I’m sure it’s probably nothing

04.09.2022

Language interoperability and different ways of enabling “polyglot” workflows have seemed to take centerstage in the data world recently: Apache Arrow promises a language-independent memory format for interoperability, – RStudio its rebranding as Posit to cement their place as a leader in language-agnostic data tooling, RStudio simultaneo...

9551 sym R (3540 sym/25 pcs) 4 img 20 tbl

RMarkdown Driven Development (RmdDD)

03.05.2019

Introduction RMarkdown is an excellent platform for capturing narrative analysis and code to create reproducible reports, blogs, slides, books, and more. One benefit of RMarkdown is its abilities to keep an analyst in the “flow” of their work and to capture their thought process along the way. However, thought processes are rarely linear;...

16795 sym 10 img

Rtistic: A package-by-numbers repo

24.05.2019

Last winter, I attended a holiday party at a “paint-and-sip” venue. For those unfamiliar, “paint-and-sip” is a semi-trendy cottage industry offering evenings of music, wine, and a guided painting activity. For example, my group painted sasquatch on a snowy winter’s eve: As often happens, this completely unrelated thing set me thinking ...

9875 sym R (3941 sym/5 pcs) 8 img

Resource Round-Up: R in Industry Edition

29.08.2019

One of the ways that practices of reproducible research can be brought into industry is through the development of custom R packages and data tools for one’s company / organization. Not only can these tools deliver large efficiency gains and standardization, they ideally infuse corporate culture with the shared passion and mission found in open...

10084 sym

RMarkdown Driven Development: the Technical Appendix

31.01.2020

Motivation My initial post on RMarkdown Driven Development focuses on major concepts in the process of evolving a one-time, single-file analysis into a sustainable analytical tool. In the spirit of Etsy’s immutable documentation, I intentionally minimized references to specific tools or packages. After all, software is transient; principles a...

31124 sym R (1545 sym/14 pcs) 6 img

projmgr: Managing the human dependencies of your projects

13.05.2020

Many tools and packages aim to eliminate the pain and uncertainty of technical project management. For example, git, make, Docker, renv, and drake are just a few existing tools that enable collaboration, manage softwatre dependencies, and promote reproducibility. However, there is no analogous gold standard for managing the most time-consuming an...

9450 sym R (5799 sym/15 pcs) 2 img

RMarkdown CSS Selector Tips

29.06.2020

When working with R Markdown’s HTML output type, it’s possible to add a custom style to your output by passing in a CSS style sheet to the YAML header like this: output: html_document: css: "my-style-sheet.css" To use CSS effectively, it’s critical to understand how to specificy which selectors one wishes to style. This requires a mi...

11275 sym R (55 sym/1 pcs)

Resource Round-Up: Latent and Lasting Documentation

02.07.2020

The importance of documentation is uncontroversial. For many data and analytical products, documentation is the user interface and key to promoting user success and future reuse. However, when project timelines get tight, too many data products are considered complete without appropriate documentation. Even when these resources initially exist, t...

9186 sym

A beginner’s guide to Shiny modules

25.07.2020

Recently, I argued the case on Twitter that Shiny modules are not an advanced topic and can actually be a great way for novice Shiny developers to start building more complex applications. My Shiny hot take is that modules are **not** an advanced topic. IMHO it’s so much easier and more natural for \#rstats users to write small, modular functi...

10011 sym R (4058 sym/7 pcs) 6 img