Publications by Jeffrey Horner

Oracle R, Hash Table Results, And VIM To The Rescue

27.04.2015

Downloaded and installed Solaris 11.2 on my laptop, and WOW! That was a throwback to the late 90’s! Old version of GNOME, no truetype fonts so the whole visual experience was very pixelly. Firefox was installed but every website I visited yelled, “YOU’RE RUNNING AN OLD BROWSER!!! UPGRADE NOW!” The purpose was to benchmark hash table perfo...

1968 sym 8 img

TidyR Challenge: Help Me Do My Job

06.05.2015

Last week I was handed a drug prescription data set and asked to create some interesting graphics. But before I can even get to the fun part, I was faced with actually transforming the set into something that ggplot2 could read. Obviously I can’t share the data, but Tyler Rinker has created a fantastic package called wakefield that creates “r...

1482 sym R (3463 sym/2 pcs)

TidyR Challenge: Update

08.05.2015

In my last post I described a data set that was a bit cumbersome to parse and I wanted to tidy it up before I could begin visually exploring. You know, the fun part. I wasn’t 100% happy with my solution so I candidly asked the internet to Help Me Do My Job. One solution involved a development version of data.table; it had some issues so I’ll ...

1155 sym R (3665 sym/4 pcs)

TidyR Challenge: Data.Table Solution

19.05.2015

Arun Srinivasan is the man! Once he saw that his data.table solution to the TidyR Challenge had an issue, he fixed it! His solution is below along with a quick equivalence test to my original solution, and check out this stackOverflow question for a more engaging discussion of the strengths and weaknesses of both dplyr/tidyr and data.table. Fak...

861 sym R (3020 sym/4 pcs)

Old is New: XML and rvest

22.05.2015

Huh… I didn’t realize just how similar rvest was to XML until I did a bit of digging. After my wonderful experience using dplyr and tidyr recently, I decided to revisit some of my old RUNNING code and see if it could use an upgrade by swapping out the XML dependency with rvest. Ultra Signup: Treasure Trove of Ultra Data If you’re into ultra...

2673 sym R (1405 sym/12 pcs)

Is This How You Dplyr?

27.05.2015

Yesterday I ran into a fairly complex issue regarding dplyr mutation and I wanted to get your take on my solution. I have two data frames with the same identifiers and two different date columns which I need to merge into one date column, with the value of the earlier of the two dates if both are present, or any valid date when one or the other i...

1230 sym R (2850 sym/5 pcs)

No THIS Is How You Dplyr and Data.Table!

28.05.2015

So, I got some great solutions to my dplyr mutation problem to share. Just wait until you see these things! Remember, I was having trouble reconciling two date columns into a minimum value in the presence of NA values. Here’s the fake data again: library(wakefield) library(tidyr) library(dplyr) library(data.table) x <- r_data_frame(n=10,id,dat...

1026 sym R (1403 sym/3 pcs)

This is one of my favorite ggplot2 plots I’ve ever made, but it…

21.07.2015

This is one of my favorite ggplot2 plots I’ve ever made, but it makes me sad. Can you deduce what this plot conveys? Explain the sporadically dashed colored horizontal bands. Explain the red vertical bars. Explain the black vertical bars. If you answer all three correctly, and you can explain the rest of the plot, I’ll give you the code. Rel...

736 sym 2 img