Publications by hrbrmstr

Removing Personal Bias From Flu Severity Estimation (a.k.a. Misery Loves Data)

02.01.2017

The family got hit pretty hard with the flu right as the Christmas festivities started and we were all pretty much bed-ridden zombies up until today (2017-01-02). When in the throes of a very bad ILI it’s easy to imagine that you’re a victim of a severe outbreak, especially with ancillary data from others that they, too, either just had/have ...

2200 sym R (3693 sym/5 pcs) 8 img

The Most Important Commodity in 2017 is Data

05.01.2017

Despite being in cybersecurity nigh forever (a career that quickly turns one into a determined skeptic if you’re doing your job correctly) I have often trusted various (not to be named) news sources, reports and data sources to provide honest and as-unbiased-as-possible information. The debacle in the U.S. in late 2016 has proven (to me) that w...

3268 sym R (1797 sym/1 pcs) 4 img

2017-01 Authored Package Updates

08.01.2017

The rest of the month is going to be super-hectic and it’s unlikely I’ll be able to do any more to help the push to CRAN 10K, so here’s a breakdown of CRAN and GitHub new packages & package updates that I felt were worth raising awareness on: epidata I mentioned this one last week but it wasn’t really a package announcement post. epidata ...

3885 sym R (2678 sym/1 pcs) 4 img

Knit directly to jupyter notebooks from RStudio

10.01.2017

Did you know that you can completely replace the “knitting” engine in R Markdown documents? Well, you can! Why would you want to do this? Well, in the case of this post, to commit the unpardonable sin of creating a clunky jupyter notebook from a pristine Rmd file. I’m definitely not “a fan” of “notebook-style” interactive data scien...

2997 sym R (415 sym/1 pcs)

The Devil’s in the [Davos] Details — A quick look at this year’s WEF Global Risks Report

16.01.2017

It’s Davos time again. Each year the World Economic Forum (WEF) gathers the global elite together to discuss how they’re going to shape our collective future. WEF also releases their annual Global Risks Report at the same time. I read it every year and have, in the past, borrowed some risk communication visualization idioms from it since — ...

5376 sym 16 img

Workout Wednesday Redux (2017 Week 3)

18.01.2017

I had started a “52 Vis” initiative back in 2016 to encourage folks to get practice making visualizations since that’s the only way to get better at virtually anything. Life got crazy, 52 Vis fell to the wayside and now there are more visible alternatives such as Makeover Monday and Workout Wednesday. They’re geared towards the “T” cr...

3135 sym R (2409 sym/1 pcs) 2 img

Create Parquet Files From R Data Frames With sergeant & Apache Drill (a.k.a. Make Parquet Files Great Again in R)

22.01.2017

Apache Drill is a nice tool to have in the toolbox as it provides a SQL front-end to a wide array of database and file back-ends and runs in standalone/embedded mode on every modern operating system (i.e. you can get started with or play locally with Drill w/o needing a Hadoop cluster but scale up almost effortlessly). It’s also a bit more ligh...

5367 sym R (2792 sym/2 pcs)

One View of the Impact of the New Immigration Ban (+ freeing PDF data with tabulizer)

26.01.2017

Dear Leader has made good on his campaign promise to “crack down” on immigration from “dangerous” countries. I wanted to both see one side of the impact of that decree — how many potential immigrants per year might this be impacting — and show toss up some code that shows how to free data from PDF documents using the @rOpenSci tabuliz...

2704 sym R (1909 sym/3 pcs) 4 img

Exploring News Coverage With newsflash

01.02.2017

I was enthused to see a mention of this on the GDELT blog since I’ve been working on an R package dubbed newsflash to work with the API that the form front-ends. Given the current climate, I feel compelled to note that I’m neither a Clinton supporter/defender/advocate nor a ? supporter/defender/advocate) in any way, shape or form. I’m only ...

2500 sym R (2238 sym/2 pcs) 2 img

Candy Coated Confidence Intervals

03.02.2017

@mrshrbrmstr hinted that she would like this post by @RickWicklin translated into R for her stats class. She’s quite capable of cranking out the translation of the core component of that post — a call to chisq.test — but she wanted to show the entire post (in R) and really didn’t have time (she’s teaching… Continue reading...

731 sym