Publications by Joseph Rickert

A Machine Learning Result

05.02.2015

by Joseph Rickert Learning to effectively use any of the dozens of popular machine learning algorithms requires mastering many details and dealing with all kinds of practical issues. With all of this to consider, it might not be apparent to a person coming to machine learning from a background other than computer science or applied math that ther...

5038 sym R (797 sym/1 pcs) 2 img

rcrunchbase – An API Interface to CrunchBase

10.02.2015

James PeruvankalSr. Program Manager, Revolution Analytics Information about the technology business ecosystems is valuable to both established companies as well as startups. Fortunately CrunchBase – the world’s most comprehensive dataset of startup activity, captures quite a bit of such information. Founded in 2007 by Mike Arrington, CrunchBa...

3161 sym R (612 sym/3 pcs)

The HP Workshop on Distributed Computing in R

12.02.2015

by Joseph Rickert In the last week of January, HP Labs in Palo Alto hosted a workshop on distributed computing in R that was organized by Indrajit Roy (Principal Researcher, HP) and Michael Lawrence (Genentech and R-core member). The goal was to bring together a small group of R developers with significant experience in parallel and distributed c...

7917 sym R (184 sym/1 pcs) 2 img

Some R Conferences in 2015

19.02.2015

by Joseph Rickert For the past few years, the Strata + Hadoop World Conference in San Jose has kicked off my personal conference season. With its focus on Data Science, Strata always seems to present some interesting R related talks, and I am looking forward to the various events over the next couple of days. But, Strata and other large conferenc...

6558 sym

Collaborative Computing with distcomp

26.02.2015

by Joseph Rickert Distcomp, a new R package available on GitHub from a group of Stanford researchers has the potential to significantly advance the practice of collaborative computing with large data sets distributed over separate sites that may be unwilling to explicitly share data. The fundamental idea is to be able to rapidly set up a web serv...

4099 sym 2 img

Plotly Graphs with Domino’s New R Notebook

03.03.2015

by Matt Sundquistco-founder of Plotly Domino's new R Notebook and Plotly's R API let you code, make interactive R and ggplot2 graphs, and collaborate entirely online. Here is the Notebook in action: Published R Notebook To execute this Notebook, or to build your own, head to Domino's Plotly Project. The GIF below shows how to get started: cho...

3037 sym 8 img

R User Group Activity

05.03.2015

by Joseph Rickert R user group activity is still on the rise. The following plot of the number of R user group meetings listed on Revolution Analytics' Community Calendar over the most recent 114 weeks shows a slight  to upward trend along with a couple of annual cycles. Predictably, meetings trail off in the summer months and again late in the...

3105 sym 2 img

A Monte Carlo Simulation for Pi Day

12.03.2015

by Joseph Rickert What will you be doing at 26 minutes and 53 seconds past 9 this coming Saturday morning? I will probably be running simulations. I have become obsessed with an astounding result from number theory and have been trying to devise Monte Carlo simulations to get at it. The result, well known to number theorists says: choose two inte...

3551 sym R (1080 sym/3 pcs) 2 img

Some thoughts on Vim

17.03.2015

by Gary R. MoserDirector of Institutional Research and PlanningThe California Maritime Academy I recently contacted Joseph Rickert about inviting Vim guru Drew Niel (web: vimcasts.org, book: “Practical Vim: Edit Text at the Speed of Thought”) to speak at the Bay Area R User Group group. Due to Drew's living in Great Britain that might not be...

4895 sym 4 img

A first look at rxBTrees

19.03.2015

by Joseph Rickert The gradient boosting machine as developed by Friedman, Hastie, Tibshirani and others, has become an extremely successful algorithm for dealing with both classification and regression problems and is now an essential feature of any machine learning toolbox. R’s gbm() function (gbm package) is a particularly well crafted implem...

5281 sym R (2462 sym/2 pcs) 4 img