Publications by R Views

Communicating results with R Markdown

31.10.2018

In my training as a consultant, I learned that long hours of analysis were typically followed by equally long hours of preparing for presentations. I had to turn my complex analyses into recommendations, and my success as a consultant depended on my ability to influence decision makers. I used a variety of tools to convey my insights, but over ti...

5603 sym 2 img 2 tbl

In-database xgboost predictions with R

06.11.2018

Moving predictive machine learning algorithms into large-scale production environments can present many challenges. For example, problems arise when attempting to calculate prediction probabilities (“scores”) for many thousands of subjects using many thousands of features located on remote databases. xgboost (docs), a popular algorithm for c...

5009 sym R (2425 sym/5 pcs) 4 img

A Mathematician’s Perspective on Topological Data Analysis and R

13.11.2018

A few years ago, when I first became aware of Topological Data Analysis (TDA), I was really excited by the possibility that the elegant theorems of Algebraic Topology could provide some new insights into the practical problems of data analysis. But time has passed, and the sober assessment of Larry Wasserman seems to describe where things stand. ...

15403 sym 2 img

Many Factor Models

18.11.2018

Today, we will return to the Fama French (FF) model of asset returns and use it as a proxy for fitting and evaluating multiple linear models. In a previous post, we reviewed how to run the FF three-factor model on the returns of a portfolio. That is, we ran one model on one set of returns. Today, we will run multiple models on multiple streams of...

7255 sym R (13827 sym/18 pcs) 6 img

Slack and Plumber, Part Two

26.11.2018

This is the final entry in a three-part series about the plumber package. The first post introduces plumber as an R package for building REST API endpoints in R. The second post builds a working example of a plumber API that powers a Slack slash command. In this final entry, we will secure the API created in the previous post so that it only resp...

9501 sym R (4331 sym/5 pcs) 4 img

October 2018: “Top 40” New Packages

28.11.2018

One hundred eighty-five new packages made it to CRAN in October. Here are my picks for the “Top 40” in eight categories: Computational Methods, Data, Machine Learning, Medicine, Science, Statistics, Utilities, and Visualization. Computational Methods compboost v0.1.0: Provides a C++ implementation of component-wise boosting written to obtain ...

9810 sym 30 img

Statistics in Glaucoma: Part I

02.12.2018

Samuel Berchuck is a Postdoctoral Associate in Duke University’s Department of Statistical Science and Forge-Duke’s Center for Actionable Health Data Science. Joshua L. Warren is an Assistant Professor of Biostatistics at Yale University. Introduction Glaucoma is a leading cause of blindness worldwide, with a prevalence of 4% in the populati...

8437 sym R (1305 sym/7 pcs) 6 img

Statistics in Glaucoma: Part II

06.12.2018

Samuel Berchuck is a Postdoctoral Associate in Duke University’s Department of Statistical Science and Forge-Duke’s Center for Actionable Health Data Science. Joshua L. Warren is an Assistant Professor of Biostatistics at Yale University. Analyzing Visual Field Data In Part I of this series on statistic in glaucoma, we detailed the use of vi...

10035 sym R (3495 sym/10 pcs) 4 img

Rsampling Fama French

12.12.2018

Today we will continue our work on Fama French factor models, but more as a vehicle to explore some of the awesome stuff happening in the world of tidy models. For new readers who want get familiar with Fama French before diving into this post, see here where we covered importing and wrangling the data, here where we covered rolling models and vi...

10301 sym R (7601 sym/19 pcs)

Statistics in Glaucoma: Part III

17.12.2018

Samuel Berchuck is a Postdoctoral Associate in Duke University’s Department of Statistical Science and Forge-Duke’s Center for Actionable Health Data Science. Joshua L. Warren is an Assistant Professor of Biostatistics at Yale University. Looking Forward in Glaucoma Progression Research The contribution of the womblR package and correspondin...

8420 sym R (3744 sym/6 pcs) 4 img