Publications by Joseph Rickert

Highlights of the Milwaukee Workshop on R and Bioinformatics

23.05.2013

by Joseph Rickert On May 10th and 11th, in honor of this being the International Year of Statistics, the Milwaukee Chapter of the American Statistical Association (MILWASA) held a workshop on cutting edge uses of R in Bioinformatics. One objective of the workshop was to show the “nuts and bolts” details of how R with C++ integration and the s...

2171 sym R (1219 sym/2 pcs) 2 img

Ryan Sheftel: "R on the Trading Desk"

30.05.2013

by Joseph Rickert In a post last week, I offered some first impressions about R/Finance 2013. Apparently, I was way off in estimating that 30% of the attendees were academics. The R/Finance organizers were quick to point out that percentage of academics attending the conference has been a constant 10% over the years; and this year was no differen...

4988 sym

Intro to Parallel Random Number Generation with RevoScaleR

06.06.2013

by Joseph Rickert Random number generation is fundamental to doing computational statistics. As you might expect, R is very rich in random number resources. The R base code provides several high quality random number generators including: Wichmann-Hill, Marsaglia-Multicarry, Super-Duper, Mersenne-Twister, Knuth-TAOCP-2002 and L’Ecuyer-CMRG. (Se...

4754 sym R (1911 sym/2 pcs) 2 img

Mini-tutorial for Quandl: How to access financial data with R

12.06.2013

by Joseph Rickert Quandl.com, the open source website for financial data, made rapid progress earlier this year in becoming an R friendly source for financial time series data. Tammer Kamel, Quandl’s founder introduced the site on Revolutions blog in late February as a “search engine” for numerical data and explained how Quandl’s “Q-bo...

3668 sym R (2370 sym/2 pcs) 6 img

Draw nicer Classification and Regression Trees with the rpart.plot package

19.06.2013

by Joseph Rickert The basic way to plot a classification or regression tree built with R’s rpart() function is just to call plot. However, in general, the results just aren’t pretty. As it turns out, for some time now there has been a better way to plot rpart() trees: the prp() function in Stephen Milborrow’s rpart.plot package. This funct...

4054 sym R (1759 sym/1 pcs) 4 img

Learning Time Series with R

27.06.2013

by Joseph Rickert Late last Saturday afternoon I was reading in my usual spot at the Dana Street Coffee House in Mt. View. A stranger walking by my table noticed my copy of Madsen’s Time Series Analysis (sitting there untouched again) said he needed to learn something about time series and asked if I could recommend a book. He looked serious so...

6427 sym R (737 sym/1 pcs)

rxDTree(): a new type of tree algorithm for big data

11.07.2013

by Joseph Rickert The rxDTree() function included in the RevoScaleR package distributed with Revolution R Enterprise is an an example of a new class of algorithms that are being developed to deal with very large data sets. Although the particulars differ, what these algorithms have in common is the use of approximations, methods of summarizing or...

5395 sym R (2398 sym/1 pcs)

Deepen your R experience with Rcpp

17.07.2013

by Joseph Rickert It is very likely that even a very casual observer what is happening in the world of R these past few months would have come across some mention of Rcpp, the R package that greatly facilitates R and C++ integration. Rcpp is hot! Over 130 R packages now depend on Rcpp and it is likely to keep growing. The following plot built usi...

4777 sym R (723 sym/1 pcs) 2 img

Revolution Analytics Supports the R Community

25.07.2013

by Joseph Rickert Early on, Revolution Analytics realized that R is more than just a tool for statistical computing — it is also the culture that has grown up around the use of the tool. The R culture is open and inclusive, competitive but also nourishing. There is a strong sense of community that encourages contribution and growth. We very muc...

2627 sym

Statisticians: an endangered species?

01.08.2013

by Joseph Rickert Every month I look forward to getting my copy of AMSTATNEWS, the monthly magazine of the American Statistical Association, in the mail. This July, I was both pleased and bemused by ASA President Marie Davidian’s article Aren’t We Data Science?. I was pleased to see a follow up to last month’s article The ASA and Big Data, ...

4730 sym