Publications by David Smith

Revolution Analytics and Teradata bring R into the Database

18.09.2013

Today, Teradata announced the new Teradata Database 14.10 and with it some exciting news for R programmers: the first next-generation in-database R analytics that are fully parallel and scalable. In conjunction with Revolution R Enterprise, R users will soon be able to use the power of the Teradata Database as a massively-parallel R platform, and...

3127 sym

Rrrrr! It’s Talk Like a Pirate Day!

19.09.2013

Arrr, Mateys! Today be “Talk Like a Pirate Day“, t' unofficial day o' R users everywhere. (Rrrrr!) T' celebrate this swashbucklin' day, Revolution Analytics has a bunch o' pirate bandanas, eaypatches and “I love R” T-shirts t' give away t' t' best pirate pictures posted t' t' #RPirate2013 hashtag on Twitter. (If you're not on Twitter, fee...

1032 sym 2 img

Real Pirate Attacks, Charted with R

19.09.2013

To mark Talk Like a Pirate Day, Bob Rudis uses R to animate a map of the cumulative real-world pirate attacks since 1978: Looks like the Carribean and the West Indes, traditional pirate haunts, are still active. But the real hot-spot in modern times is Africa. Find the R code behind the animation at the blog post linked below. rud.is: Animated ...

764 sym 2 img

Hortonworks Hadoop, Big Data and Data Science

20.09.2013

Over at the Hortonworks blog, I've shared some resources for getting started with Data Science and R. And if you'd like to learn more about how Hadoop and R fit into the modern data artchitecture, I'll be participating in a joint webinar with Hortonworks' John Kreisa. If you'd like to join, register at the link below. The Modern Data Architectur...

861 sym

Big Data Bytes: How Open Source is Changing Business

23.09.2013

I had a fun time on Friday in a Google Hangout chat with David Pittman (IBM), Eric Kavanagh (Bloor Group) and Tom Deutsch (IBM), where we talked about how open source is changing business. The conversation covered several open source projects including R and Hadoop, and ranged from the impact of open source on total cost of ownership, finding ta...

1008 sym 2 img

R as a command-line tool for data science

24.09.2013

Data Scientist Jeroen Janssens recently published a useful list of 7 data science tools that you can use from the command line. This doesn't just mean they're convenient tools for command-line junkies: it also means you can easily chain them together with data sources for offline, automated processes. Included in the list are JSON processing too...

1751 sym

R 3.0.2 "Frisbee Sailing" now available

25.09.2013

The latest update to the open source R project, R 3.0.2, is now available. This incremental update, codenamed “Frisbee Sailing”, mainly fixes some minor bugs, improves the organization of the distribution, improves performance in some areas, and adds a few small features (see the NEWS file for a complete list). There's also new documentation ...

1476 sym

Forecasting Using R: A new online course from Rob Hyndman

27.09.2013

Statistical forecasting is a critical component of every modern business, and Rob J Hyndman, Professor of Statistics at Monash University, is an expert in the field. He's the co-author of several books on forecasting, including Forecasting: Principles and Practice, a free on-line book that provides a comprehensive introduction to forecasting me...

1947 sym

Which is the best "Flyover" state?

30.09.2013

If you were to hop into your personal aircraft, and plotted a straight line course taking off in one state and landing in the SAME state, how many other states might you fly over? On other words, what's the best state for “flyovers” of other states? Todd Schnieder from the Rapgenius engineering team answered that question using the R language...

1731 sym 4 img

R with Big Data on Hortonworks

01.10.2013

If you missed last week's webinar with John Kreisa from Hortonworks (hosted by Data Science Central), we described how R fits into the Modern Data Architecture. Not only can you extract and distil data in Hadoop with the open-source RHadoop project, but with the forthcoming release of Revolution R Enterprise 7 you will be able to run the high-p...

1164 sym