Publications by David Smith

Revolution Analytics joins Microsoft

23.01.2015

by David Smith, Chief Community Officer On behalf of the entire Revolution Analytics team I am excited to announce that Revolution Analytics is joining forces with Microsoft to bring R to even more enterprises. Microsoft announced today that it will acquire Revolution Analytics. Now, Microsoft might seem like a strange bedfellow for an open-sourc...

4050 sym 2 img

xkcd on P-values

26.01.2015

From the “statistician humour” department, today's xkcd cartoon will ring a bell for anyone who's ever published (or read!) a scientific article including a P-value for a statistical test: If finding P-value excuses is a common activity for you (and let's hope not!) then R has you covered with the Significantly Improved Significance Test. ...

1035 sym 2 img

Microsoft acquires Revolution Analytics – news roundup

26.01.2015

There was a lot of news coverage on Friday and over the weekend about the news that Microsoft will acquire Revolution Analytics. Here are some links to just a few of the articles published. Wired: Microsoft is “heavily embracing the R programming language”; “the move deepens Microsof's investments in open source”. TechCrunch: Microsoft'...

1940 sym

Now’s a great time to learn R. Here’s how.

28.01.2015

In a recent article at opensource.com, I offer up some reasons why now is the time to learn R: data scientists are in high demand, R is the natural language for data scientists, and companies around the world are using R (and hiring R programmers) to make sense of new data sources. Sharp Sight Labs also offers some excellent reasons why you sho...

1570 sym

Reproducibility with Revolution R Open and the checkpoint package

30.01.2015

Thanks to everyone at the Chicago R User Group for giving me such a warm welcome for my presentation last night. In my talk, I gave an introduction to Revolution R Open, with a focus on how the checkpoint package makes sharing R code in a reproducible way easy: If you'd like to try out the checkpoint package, it's available on CRAN now. You can...

1089 sym

Paris’s history, captured in its streets

02.02.2015

The following image by Mathieu Rajerison has been doing the rounds of French media recently. It shows the streets of Paris, color-coded by their compass direction. It's been featured in an article in Telerama magazine, and even on French TV Channel LCI (skip ahead to 8:20 in the linked video. which also features an interview with Mathieu). Ma...

2019 sym 2 img

Quickcheck: Randomized unit testing for R

04.02.2015

Hadley Wickham's testthat package has been a boon for R package authors, making it easy to write tests to verify that your code is working directly, and alerting you when you make changes to your code that inadvertently breaks things.  For the RHadoop project, though, developer Antonio Piccolboni needed a different testing framework, that includ...

1832 sym

Finding the dramatic arc of novels with sentiment analysis

06.02.2015

Sentiment analysis has been widely used to infer the mood of customers in emails, tweets and other short communications. The base assumption is that the sentiment is a fixed value: the email is either angry or happy; positive or negative. But in longer writings like a novel, we naturally expect the sentiment to vary over time. Can we apply sentim...

2138 sym 2 img

In case you missed it: January 2015 roundup

09.02.2015

In case you missed them, here are some articles from January of particular interest to R users. Slides on reproducible data analysis with Revolution R Open and the checkpoint package.  A review of a recent Bay Area R User Group meetup, featuring Hadley Wickham, Ryan Hafen and Nick Elprin. In an article at opensource.com, I explain why now is a...

2833 sym

R among top languages on GitHub

11.02.2015

The site githut.info provides quarterly statistics on programming language activity on GitHub, by number of repositories, pushes, forks etc. Ranked by number of active repositories on GitHub, R is the 12th most popular programming language as of Q4 2014. JavaScript, Java and Python appear as the top 3 in the same list. As I've said before, suc...

1055 sym 2 img