Publications by Jeffrey Breen
googleVis-0.2.4 requires older version of RJSONIO (0.5-0) #rstats
[Update: the new release of googleVis accounts for changes in RJSONIO’s handling of backslashes, so you probably won’t need the older version.] Something has apparently changed in the way RJSON’s toJSON() function works which is causing all sorts of extra escape characters (backslashes) to appear in the googleVis-generated JavaScript, at le...
1487 sym R (2792 sym/3 pcs) 4 img
slides from my R tutorial on Twitter text mining #rstats
Update: An expanded version of this tutorial will appear in the new Elsevier book Practical Text Mining and Statistical Analysis for Non-structured Text Data Applications by Gary Miner et. al which is now available for pre-order from Amazon. In conjunction with the book, I have cleaned up the tutorial code and published it on github. Last month ...
2034 sym R (1401 sym/1 pcs) 4 img
installing R 2.13.1 on Amazon EC2′s “Amazon Linux” AMI #rstats
Condensed from this post (and comments) on David Chudzicki’s blog, tweaked, and updated for R-2.13.1. Assumes you’re starting with a virgin “Amazon Linux” AMI. I picked “Basic 64-bit Amazon Linux AMI 2011.02.1 Beta” (AMI Id: ami-8e1fece7) because it was marked as free tier eligible on the “Quick Start” tab of AWS’s “Launch In...
974 sym R (875 sym/2 pcs) 4 img
One-liners which make me love R: Make your data dance (Hans Rosling style) with googleVis #rstats
It may be a cliché, but much of R’s utility comes from its amazing community. And by community, I am specifically referring to the bright, hard-working people who are willing to share their knowledge and code with the rest of us. Because of their contributions, we can do some amazingly cool and useful things with very little code of our own. I...
3064 sym R (178 sym/4 pcs) 8 img
One-liners which make me love R: twitteR’s searchTwitter() #rstats
R reminds me a lot of English. It’s easy to get started, but very difficult to master. So for all those times I’ve spent… well, forever… trying to figure out the “R way” of doing something, I’m glad to share these quick wins. My recent R tutorial on mining Twitter for consumer sentiment wouldn’t have been possible without Jeff Gen...
1827 sym R (1950 sym/4 pcs) 6 img
Use geom_rect() to add recession bars to your time series plots #rstats #ggplot
Zach Mayer’s work reproducing John Hussman’s Recession Warning Composite prompted me to dig this trick out of my (Evernote) notebook. First, let’s grab some data to plot using the very handy getSymbols() function from Jeffrey Ryan’s quantmod package. We’ll load the U.S. unemployment rate (UNRATE) from the St. Loius Fed’s Federal Reser...
1720 sym R (1174 sym/3 pcs) 6 img
Slides from today’s Big Data Step-by-Step Tutorials: Infrastructure series and Intro to R+Hadoop with RHadoop’s rmr
Here are my presentations from today’s Boston Predictive Analytics Big Data Workshop. All code and config files are available at github: https://github.com/jeffreybreen/tutorial-201203-big-data My portion of the workshop was divided into four parts, three focusing on different infrastructure scenarios and ending with a deep dive into the rmr R ...
3493 sym 4 img
Slides from “Tapping the Data Deluge with R” lightning talk #rstats #PAWCon
Here is my presentation from last night’s Boston Predictive Analytics Meetup graciously hosted by Predictive Analytics World Boston. The talk is meant to provide an overview of (some) of the different ways to get data into R, especially supplementary data sets to assist with your analysis. All code and data files are available at github: http:/...
972 sym 4 img
Slides and replay of my “Using R with Hadoop” webinar now available #rstats #hadoop
I owe a big “thank you” to all of you who attended my webinar yesterday “Using R with Hadoop”. Revolution Analytics partnered with us at Think Big Analytics to produce the webinar, and I owe them thanks as well. For those of you who missed it, the slides and replay are now available from Revolution Analytics. Related To leave a comment ...
726 sym 4 img