Publications by hrbrmstr
Removing Personal Bias From Flu Severity Estimation (a.k.a. Misery Loves Data)
The family got hit pretty hard with the flu right as the Christmas festivities started and we were all pretty much bed-ridden zombies up until today (2017-01-02). When in the throes of a very bad ILI it’s easy to imagine that you’re a victim of a severe outbreak, especially with ancillary data from others that they, too, either just had/have ...
2200 sym R (3693 sym/5 pcs) 8 img
The Most Important Commodity in 2017 is Data
Despite being in cybersecurity nigh forever (a career that quickly turns one into a determined skeptic if you’re doing your job correctly) I have often trusted various (not to be named) news sources, reports and data sources to provide honest and as-unbiased-as-possible information. The debacle in the U.S. in late 2016 has proven (to me) that w...
3268 sym R (1797 sym/1 pcs) 4 img
2017-01 Authored Package Updates
The rest of the month is going to be super-hectic and it’s unlikely I’ll be able to do any more to help the push to CRAN 10K, so here’s a breakdown of CRAN and GitHub new packages & package updates that I felt were worth raising awareness on: epidata I mentioned this one last week but it wasn’t really a package announcement post. epidata ...
3885 sym R (2678 sym/1 pcs) 4 img
Knit directly to jupyter notebooks from RStudio
Did you know that you can completely replace the “knitting” engine in R Markdown documents? Well, you can! Why would you want to do this? Well, in the case of this post, to commit the unpardonable sin of creating a clunky jupyter notebook from a pristine Rmd file. I’m definitely not “a fan” of “notebook-style” interactive data scien...
2997 sym R (415 sym/1 pcs)
The Devil’s in the [Davos] Details — A quick look at this year’s WEF Global Risks Report
It’s Davos time again. Each year the World Economic Forum (WEF) gathers the global elite together to discuss how they’re going to shape our collective future. WEF also releases their annual Global Risks Report at the same time. I read it every year and have, in the past, borrowed some risk communication visualization idioms from it since — ...
5376 sym 16 img
Workout Wednesday Redux (2017 Week 3)
I had started a “52 Vis” initiative back in 2016 to encourage folks to get practice making visualizations since that’s the only way to get better at virtually anything. Life got crazy, 52 Vis fell to the wayside and now there are more visible alternatives such as Makeover Monday and Workout Wednesday. They’re geared towards the “T” cr...
3135 sym R (2409 sym/1 pcs) 2 img
Create Parquet Files From R Data Frames With sergeant & Apache Drill (a.k.a. Make Parquet Files Great Again in R)
Apache Drill is a nice tool to have in the toolbox as it provides a SQL front-end to a wide array of database and file back-ends and runs in standalone/embedded mode on every modern operating system (i.e. you can get started with or play locally with Drill w/o needing a Hadoop cluster but scale up almost effortlessly). It’s also a bit more ligh...
5367 sym R (2792 sym/2 pcs)
One View of the Impact of the New Immigration Ban (+ freeing PDF data with tabulizer)
Dear Leader has made good on his campaign promise to “crack down” on immigration from “dangerous” countries. I wanted to both see one side of the impact of that decree — how many potential immigrants per year might this be impacting — and show toss up some code that shows how to free data from PDF documents using the @rOpenSci tabuliz...
2704 sym R (1909 sym/3 pcs) 4 img
Exploring News Coverage With newsflash
I was enthused to see a mention of this on the GDELT blog since I’ve been working on an R package dubbed newsflash to work with the API that the form front-ends. Given the current climate, I feel compelled to note that I’m neither a Clinton supporter/defender/advocate nor a ? supporter/defender/advocate) in any way, shape or form. I’m only ...
2500 sym R (2238 sym/2 pcs) 2 img
Candy Coated Confidence Intervals
@mrshrbrmstr hinted that she would like this post by @RickWicklin translated into R for her stats class. She’s quite capable of cranking out the translation of the core component of that post — a call to chisq.test — but she wanted to show the entire post (in R) and really didn’t have time (she’s teaching… Continue reading...
731 sym