Publications by R Views
Multiple Versions of R
Data scientists prefer using the latest R packages to analyze their data. To ensure a good user experience, you will need a recent version of R running on a modern operating system. If you run R on an production server – and especially if you use RStudio Connect – plan to support multiple versions of R side by side so that your code, reports,...
5389 sym Python (375 sym/1 pcs) 2 img
Feb 2018: “Top 40” New Package Picks
Here are my picks for the “Top 40” packages of the 171 new packages that made it to CRAN (and stuck) in February, organized into the following categories: Computational Methods, Data, Finance, Science, Statistics, Time Series, and Utilities. Computational Methods adnuts v1.0.0: Provides an implementation of the no-U-turn (NUTS) algorithm by H...
10691 sym 38 img
R and TensorFlow Presentations
In early March, the Bay Area useR Group was able to hold an R and TensorFlow mini-conference on Google’s new Sunnyvale campus. Pete Mohanty, a Stanford researcher and frequent BARUG speaker, lead off with a talk on his recent kerasformula package, which allows R users to call a keras-based neural net with R formula objects. Pete’s slides show...
1947 sym 1 tbl
Introduction to Fama French
In two previous posts, we calculated and then visualized the CAPM beta of a portfolio by fitting a simple linear model. Today, we move beyond CAPM’s simple linear regression and explore the Fama French (FF) multi-factor model of equity risk/return. For more background, have a look at the original article published in The Journal Financial Econo...
8721 sym R (7881 sym/18 pcs) 2 img
Reticulated Shiny
RStudio recently announced the reticulate package, which is designed to help R users inter-operate with Python code. I was immediately excited by this announcement. In a past life, I worked with a team at the National Renewable Energy Lab (NREL) on vehicle simulations. Their models could predict MPG for vehicles based on driving routes. At the ti...
5732 sym R (802 sym/5 pcs) 2 img
An Introduction to Greta
I was surprised by greta. I had assumed that the tensorflow and reticulate packages would eventually enable R developers to look beyond deep learning applications and exploit the TensorFlow platform to create all manner of production-grade statistical applications. But I wasn’t thinking Bayesian. After all, Stan is probably everything a Bayesia...
4490 sym R (3830 sym/9 pcs) 8 img
March 2018: “Top 40” New Package Picks
By my count, just over 200 new packages made it to CRAN and stuck during March. The trend for specialized, and sometimes downright esoteric science packages continues. I counted 40 new packages in this class. Most, but not all of these, are focused on bio-science applications. For example, the foreSIGHT package profiled below focuses on climate s...
10824 sym 40 img
Rolling Fama French
In a previous post, we reviewed how to import the Fama French 3-Factor data, wrangle that data, and then regress our portfolio returns on the factors. Please have a look at that previous post, as the following work builds upon it. For more background on Fama French, see the original article published in The Journal of Financial Economics, Common ...
3966 sym R (5444 sym/11 pcs) 2 img
2018 R Conferences
rstudio::conf 2018 and the New York R Conference are both behind us, but we are rushing headlong into the season for conferences focused on the R Language and its applications. The European R Users Meeting (eRum) begins this coming Monday, May 14th, in Budapest with three days of workshops and talks. Headlined by R Core member Martin Mächler an...
5626 sym 10 img
Enterprise Dashboards with R Markdown
This is a second post in a series on enterprise dashboards. See our previous post, Enterprise-ready dashboards with Shiny Databases. We have been living with spreadsheets for so long that most office workers think it is obvious that spreadsheets generated with programs like Microsoft Excel make it easy to understand data and communicate insights....
7968 sym R (593 sym/3 pcs) 10 img