Publications by hrbrmstr

R⁶ — Tracking WannaCry Bitcoin Wallet Payments with R

15.05.2017

If you follow me on Twitter or monitor @Rapid7’s Community Blog you know I’ve been involved a bit in the WannaCry ransomworm triage. One thing I’ve been doing is making charts of the hourly contribution to the Bitcoin addresses that the current/main attackers are using to accept ransom payments (which you really shouldn’t pay, now, even i...

1453 sym R (1741 sym/1 pcs) 2 img

R⁶ — Using R With Amazon Athena & AWS Temporary Security Credentials

16.05.2017

Most of the examples of working with most of the AWS services show basic username & password authentication. That’s all well-and-good, but many shops use the AWS Security Token Service to provide temporary credentials and session tokens to limit exposure and provide more uniform multi-factor authentication. At my workplace, Frank Mitchell creat...

2304 sym R (1033 sym/1 pcs)

A Very Palette-able Post

21.05.2017

Many of my posts seem to begin with a link to a tweet, and this one falls into that pattern: And @_inundata is already working on a #rstats palette. https://t.co/bNfpL7OmVl— Timothée Poisot (@tpoi) May 21, 2017 I’d seen the Ars Tech post about the named color palette derived from some training data. I could tell at a glance of the resultant...

4830 sym R (7029 sym/10 pcs) 6 img

R⁶ — Idiomatic (for the People)

23.05.2017

NOTE: I’ll do my best to ensure the next post will have nothing to do with Twitter, and this post might not completely meet my R⁶ criteria. A single, altruistic, nigh exuberant R tweet about slurping up a directory of CSVs devolved quickly — at least in my opinion, and partly (sadly) with my aid — into a thread that ultimately strayed fr...

3653 sym R (900 sym/2 pcs)

L.A. Unconf-idential : a.k.a. an rOpenSci #runconf17 Retrospective

28.05.2017

Last year, I was able to sit back and lazily “RT” Julia Silge’s excellent retrospective on her 2016 @rOpenSci “unconference” experience. Since Julia was not there this year, and the unconference experience is still in primary storage (LMD v2.0 was a success!) I thought this would be the perfect time for a mindful look-back. And Now, A W...

5981 sym 10 img

Drilling Into CSVs — Teaser Trailer

31.05.2017

I used reading a directory of CSVs as the foundational example in my recent post on idioms. During my exchange with Matt, Hadley and a few others — in the crazy Twitter thread that spawned said post — I mentioned that I’d personally “just use Drill”. I’ll use this post as a bit of a teaser trailer for the actual post (or, more likely,...

7828 sym R (2981 sym/6 pcs)

R⁶ — Scraping Images To PDFs

05.06.2017

I’ve been doing intermittent prep work for a follow-up to an earlier post on store closings and came across this CNN Money “article” on it. Said “article” is a deliberately obfuscated or lazily crafted series of GIF images that contain all the Radio Shack impending store closings. It’s the most comprehensive list I’ve found, but the...

1952 sym R (698 sym/1 pcs)

Engaging the tidyverse Clean Slate Protocol

10.06.2017

I caught the 0.7.0 release of dplyr on my home CRAN server early Friday morning and immediately set out to install it since I’m eager to finish up my sergeant package and get it on CRAN. “Tidyverse” upgrades aren’t trivial for me as I tinker quite a bit with the tidyverse and create packages that depend on various components. The sergeant...

3769 sym 2 img

Keeping Users Safe While Collecting Data

13.06.2017

I caught a mention of this project by Pete Warden on Four Short Links today. If his name sounds familiar, he’s the creator of the DSTK, an O’Reilly author, and now works at Google. A decidedly clever and decent chap. The project goal is noble: crowdsource and make a repository of open speech data for researchers to make a better world. Said s...

5312 sym 6 img

Replicating the Apache Drill ‘Yelp’ Academic Dataset Analysis with sergeant

17.06.2017

The Apache Drill folks have a nice walk-through tutorial on how to analyze the Yelp Academic Dataset with Drill. It’s a bit out of date (the current Yelp data set structure is different enough that the tutorial will error out at various points), but it’s a great example of how to work with large, nested JSON files as a SQL data source. By ‘...

1769 sym