Publications by David Smith
An introduction to Econometrics, using R
If your econometrics is a bit rusty and you're also looking to learn the R language, you can kill two birds with one stone with Introductory Econometrics using Quandl and R. The first three parts of this seven-part tutorial introduces the basics of regression analysis, while the remaining sections provide R code you can try yourself to reprod...
901 sym
Customize your R session with .Rprofile
The .Rprofile file is a great way to customize your R session every time you start it up. You can use it to change R's defaults, define handy command-line functions, automatically load your favourite packages — anything you like! The Getting Genetics Blog has a nice example .Rprofile file to give you some inspiration on what to do. One popular ...
1295 sym
How to switch from spreadsheets to R for data analysis
To take a spreadsheet beyond what it's designed for — data presentation, summarization and simple calculations — into the world of complex data analysis can be an alluring prospect. But it can also be dangerous: consider these examples of spreadsheet errors that led to monumental financial losses, mistaken government policies, and even the wr...
1201 sym
Introducing Revolution R Enterprise 7
We're very excited to formally announce that Revolution R Enterprise 7 is here! This release includes the latest release of Open Source R (R 3.0.2). It brings R and the massively-parallel R functions from Revolution Analytics to Cloudera and Hortonworks in-Hadoop, and in-database on Teradata. It also brings a new drag-and-drop user interface via ...
1287 sym
Video: Revolution R Enterprise 7 interview on theCUBE
I'm in New York City for the Strata + Hadoop World conference, and last night I got the chance to stop by theCUBE for an live interview about Revolution R Enterprise 7. You can watch the full interview below, or click the links on the topics to skip ahead. Many thanks to Dave Vellante of Wikibon and John Furrier of SiliconAngle for the insigh...
1125 sym
What Hadley Wickham uses
You know Hadley Wickham as the inventor of the ggplot2 visualization phenomenon, the creator of time-saving R packages like plyr and lubridate, and the Chief Scientist at RStudio. But do you know what laptop Hadley uses, what software he uses (besides, R, of course), or his favourite kitchen appliance? Find out Hadley's interview with The Setup. ...
445 sym
What’s new in Revolution R Enterprise? Webinar November 5
Revolution R Enterprise 7 is a major update to Revolution Analytics' flagship predictive analytics software. Not only does it include the latest release of Open Source R (providing compatability with more than 6,000 user-contributed packages), it also adds in-Hadoop and in-database predictive modeling with big data, new statistical capabilities,...
1965 sym
Strata + Hadoop World 2013 Recap
We're back from NYC after a very busy Strata + Hadoop World 2013 conference. Many thanks to all the friendly folks who dropped by the Revolution Analytics booth, attended Joe and Antonio's R and Hadoop tutorial, or simply came up to say hello during the event. It was a jam-packed conference, literally standing room only at many times, so it's gre...
3955 sym 2 img
Webinar replay: What’s new in Revolution R Enterprise 7
In case you missed yesterday's webinar, the slides and replay are now available for Introducing Revolution R Enterprise 7: The Big Data Big Analytics Platform. I've embedded the slides below — if you're already familiar with Revolution R Enterprise the new features start at Slide 21. You can also download the slides or watch the video replay ...
887 sym
Was 2013 a record year for strikeouts in World Series Baseball?
The new book Analyzing Baseball Data with R by Max Marchi and Jim Albert is now available, and the authors have also launched a companion blog to share some of the analyses from the book. For example, they used the Lahman package in R to look at the strikeout rate in World Series baseball games over the last century and found (after a little nonp...
1305 sym 2 img