Publications by Joseph Rickert
Coarse Grain Parallelism with foreach and rxExec
by Joseph Rickert I have written a several posts about the Parallel External Memory Algorithms (PEMAs) in Revolution Analytics’ RevoScaleR package, most recently about rxBTrees(), but I haven’t said much about rxExec(). rxExec() is not itself a PEMA, but it can be used to write parallel algorithms. Pre-built PEMAs such as rxBTrees(), rxLinMod...
5129 sym R (3254 sym/5 pcs)
Exploring San Francisco with choroplethrZip
by Ari Lamstein Introduction Today I will walk through an analysis of San Francisco Zip Code Demographics using my new R package choroplethrZip. This package creates choropleth maps of US Zip Codes and connects to the US Census Bureau. A choropleth is a map that shows boundaries of regions (such as zip codes) and colors those regions according to...
4604 sym 10 img
Where are the R users?
by Joseph Rickert A recent post by David Smith included a map that shows the locations of R user groups around the world. While is exhilarating to see how R user groups span the globe, the map does not give any idea about the size of the community at each location. The following plot, constructed from information on the websites of the groups li...
2708 sym 2 img
RPowerLabs: Electric power system virtual laboratories online
by Ben UbahFounder, RPowerLabs No disregard to R's colleagues, R is pioneering the creation of online virtual electric power system laboratories via RPowerLABS. RPowerLABS is a project, with the vision of deploying online, a vast array of highly demanded power system simulations for teaching and research using R. It started as an attempt to a...
4140 sym 10 img
R User Group Meetings this week in the Bay Area and around the world
by Joseph Rickert Tracking R user group meetings is a good way to stay informed about what's happening in the R world. On Tuesday the Bay Area useR Group (BARUG) met at AdRoll in San Francisco. It was a mini-conference with 6 talks: Bryan Galvin our host at AdRoll (many thanks for the pizza and beer) kicked off the evening by showing how his com...
3770 sym 2 img
R for more powerful clustering
by Vidisha VachharajaniFreelance Statistical Consultant R showcases several useful clustering tools, but the one that seems particularly powerful is the marriage of hierarchical clustering with a visual display of its results in a heatmap. The term “heatmap” is often confusing, making most wonder – which is it? A “colorful visual represen...
4455 sym R (1374 sym/2 pcs) 4 img
The new science journalism and open science
by Joseph Rickert The New York Times is quietly changing the practice of science journalism. The Tuesday April 21, 2015 article: Ebola Lying in Wait, reports on “A growing body of scientific clues – some ambiguous, other substantive” that the Ebola virus may have lain dormant in West African rain forest for years before igniting last year's...
2848 sym R (2369 sym/1 pcs)
The First NY R Conference
by Joseph Rickert Last Friday and Saturday the NY R Conference briefly lit up Manhattan's Union Square neighborhood as the center of the R world. You may have caught some of the glow on twitter. Jared Lander, volunteers from the New York Open Statistical Programming Meetup along with the staff at Workbench (the conference venue) set the bar pre...
3333 sym 4 img
Data Science in HR
by Joseph Rickert Last year in a post on interesting R topics presented at the JSM I described how data scientists in Google's human resources department were using R and predictive analytics to better understand the characteristics of its workforce. Google may very well have done the pioneering work, but predictive analytics for HR application...
1808 sym 2 img
Digging up embedded plots
by Joseph Rickert The following multi-panel graph, which graces the cover of the most recent issue of the Journal of Computational and Graphical Statistics ,JCGS, (Vol 24, Num 1, March 2015) is from the paper by Grolemund and Wickham entitled Visualizing Complex Data With Embedded Plots. The four plots are noteworthy for a couple or reasons: T...
4671 sym R (4244 sym/2 pcs) 4 img