Publications by Joseph Rickert

Mini-tutorial for Quandl: How to access financial data with R

12.06.2013

by Joseph Rickert Quandl.com, the open source website for financial data, made rapid progress earlier this year in becoming an R friendly source for financial time series data. Tammer Kamel, Quandl’s founder introduced the site on Revolutions blog in late February as a “search engine” for numerical data and explained how Quandl’s “Q-bo...

3668 sym R (2370 sym/2 pcs) 6 img

Draw nicer Classification and Regression Trees with the rpart.plot package

19.06.2013

by Joseph Rickert The basic way to plot a classification or regression tree built with R’s rpart() function is just to call plot. However, in general, the results just aren’t pretty. As it turns out, for some time now there has been a better way to plot rpart() trees: the prp() function in Stephen Milborrow’s rpart.plot package. This funct...

4054 sym R (1759 sym/1 pcs) 4 img

Learning Time Series with R

27.06.2013

by Joseph Rickert Late last Saturday afternoon I was reading in my usual spot at the Dana Street Coffee House in Mt. View. A stranger walking by my table noticed my copy of Madsen’s Time Series Analysis (sitting there untouched again) said he needed to learn something about time series and asked if I could recommend a book. He looked serious so...

6427 sym R (737 sym/1 pcs)

rxDTree(): a new type of tree algorithm for big data

11.07.2013

by Joseph Rickert The rxDTree() function included in the RevoScaleR package distributed with Revolution R Enterprise is an an example of a new class of algorithms that are being developed to deal with very large data sets. Although the particulars differ, what these algorithms have in common is the use of approximations, methods of summarizing or...

5395 sym R (2398 sym/1 pcs)

Deepen your R experience with Rcpp

17.07.2013

by Joseph Rickert It is very likely that even a very casual observer what is happening in the world of R these past few months would have come across some mention of Rcpp, the R package that greatly facilitates R and C++ integration. Rcpp is hot! Over 130 R packages now depend on Rcpp and it is likely to keep growing. The following plot built usi...

4777 sym R (723 sym/1 pcs) 2 img

Revolution Analytics Supports the R Community

25.07.2013

by Joseph Rickert Early on, Revolution Analytics realized that R is more than just a tool for statistical computing — it is also the culture that has grown up around the use of the tool. The R culture is open and inclusive, competitive but also nourishing. There is a strong sense of community that encourages contribution and growth. We very muc...

2627 sym

Statisticians: an endangered species?

01.08.2013

by Joseph Rickert Every month I look forward to getting my copy of AMSTATNEWS, the monthly magazine of the American Statistical Association, in the mail. This July, I was both pleased and bemused by ASA President Marie Davidian’s article Aren’t We Data Science?. I was pleased to see a follow up to last month’s article The ASA and Big Data, ...

4730 sym

R, drug development and the FDA

15.08.2013

by Joseph Rickert When you not directly working in an industry it is often extremely difficult to get any real insight into common practices that may be blindly transparent to people who are. With some persistence though, every once in awhile you can stumble into an opportunity to see why things are the way they are. Last week, at the JSM in Mont...

4776 sym

Big Data Sets you can use with R

22.08.2013

by Joseph Rickert The world may indeed be awash with data, however, it is not always easy to find a suitable data set when you need one. As the number of people becoming involved with R and data science increases so does the need for interesting data sets for creating examples, showcasing machine learning algorithms and developing statistical ana...

4658 sym R (2169 sym/1 pcs) 2 img

R and Linear Algebra

29.08.2013

by Joseph Rickert I was recently looking through upcoming Coursera offerings and came across the course Coding the Matrix: Linear Algebra through Computer Science Applications taught by Philip Klein from Brown University. This looks like a fine course; but why use Python to teach linear algebra? I suppose this is a blind spot of mine: MATLAB I ca...

6682 sym R (1803 sym/1 pcs)