Publications by Bob Rudis (@hrbrmstr)

Short is the new Long with longurl for R (plus working with weblogs & URLs in R)

23.06.2015

Necessity is the mother of invention and I had the opportunity today to take an R package from development to CRAN in less than the span of 24 hours. Despite being on vacation, I answered an R question on StackOverflow pertaining to the use of decode_short_url from the twitteR package. That function has no option for validity checking (i.e. whet...

4042 sym R (2515 sym/6 pcs)

iptools 0.2.0 is now on CRAN

01.07.2015

We are happy to announce that the iptools package is now on CRAN. Formerly only available on GitHub, iptools now compiles under Debian/Ubuntu, Fedora/CentOS/RedHat and Mac OS X (we’re still working on that other operating system). Oliver (the package co-author and on-CRAN instigator) wrote some excellent vignettes that cover the functionality...

1734 sym

Chart makeover – Unisys Security Insights Survey

02.07.2015

It seems that not a day goes by without some information security vendor releasing a report based on a survey. Thankfully, this post is not about the efficacy of survey-based reports or their findings. Today, we’re doing a makeover for Unisys, who just released their Findings from the 2015 Unisys Security Insights Survey. I started reading thei...

3784 sym R (3647 sym/7 pcs) 4 img

“Just the text ma’am” – Web Site Content Extraction with XSLT & R

09.07.2015

Sometimes you just need the salient text from a web site, often as a first step towards natural language processing (NLP) or classification. There are many ways to achieve this, but XSLT (eXtensible Stylesheet Language) was purpose-built for slicing, dicing and transforming XML (and, hence, HTML) so, it can make more sense and even be speedier us...

3296 sym R (16723 sym/2 pcs)

R 101 – Aggregate By Quarter

14.07.2015

We were asked a question on how to (in R) aggregate quarterly data from what I believe was a daily time series. This is a pretty common task and there are many ways to do this in R, but we’ll focus on one method using the zoo and dplyr packages. Let’t get those imports out of the way: library(dplyr) library(zoo) library(ggplot2) Now, we nee...

1539 sym R (1642 sym/5 pcs) 2 img

R 101 – Aggregate By Quarter

14.07.2015

We were asked a question on how to (in R) aggregate quarterly data from what I believe was a daily time series. This is a pretty common task and there are many ways to do this in R, but we’ll focus on one method using the zoo and dplyr packages. Let’t get those imports out of the way: library(dplyr) library(zoo) library(ggplot2) Now, we nee...

1539 sym R (1640 sym/5 pcs) 2 img

Introducing the cymruservices R Package

22.07.2015

The R world has come a long way since Jay & I wrote Data-Driven Security. We had to make a conscious decision to stick with R 2.14.0 (R is at version 3.2.1 now) and packages such as knitr and dplyr either didn’t exist or were in their infancy. In Chapter 4, we showed some very basic exploratory data analysis and visualization. One of those exa...

2230 sym R (1071 sym/1 pcs)

Introducing the cymruservices R Package

22.07.2015

The R world has come a long way since Jay & I wrote Data-Driven Security. We had to make a conscious decision to stick with R 2.14.0 (R is at version 3.2.1 now) and packages such as knitr and dplyr either didn’t exist or were in their infancy. In Chapter 4, we showed some very basic exploratory data analysis and visualization. One of those exa...

2230 sym R (1054 sym/1 pcs)

RBerkeley Was Just Pining For The Fjords

27.07.2015

UPDATE: RBerkeley is now on CRAN If you made it to Chapter 8 of Data-Driven Security after ~October 2014 and tried to run the BerkeleyDB R example, you were greeted with: Warning in install.packages : package ‘RBerkely’ is not available (for R version [YOUR_R_VERSION]) That’s due to the fact that it was removed from CRAN at the end of ...

1745 sym R (101 sym/1 pcs)

RBerkeley Was Just Pining For The Fjords

27.07.2015

If you made it to Chapter 8 of Data-Driven Security after ~October 2014 and tried to run the BerkeleyDB R example, you were greeted with: Warning in install.packages : package ‘RBerkely’ is not available (for R version [YOUR_R_VERSION]) That’s due to the fact that it was removed from CRAN at the end of September, 2014 because the packag...

1712 sym R (101 sym/1 pcs)