Publications by R Views
January 2019: “Top 40” New CRAN Packages
One hundred and fifty-three new packages made it to CRAN in January. Here are my “Top 40” picks in eight categories: Computational Methods, Data, Machine Learning, Medicine, Science, Statistics, Utilities, and Visualization. Computational Methods cPCG v1.0: Provides a function to solve systems of linear equations using a (preconditioned) conj...
9007 sym 42 img
Some R Packages for ROC Curves
In a recent post, I presented some of the theory underlying ROC curves, and outlined the history leading up to their present popularity for characterizing the performance of machine learning models. In this post, I describe how to search CRAN for packages to plot ROC curves, and highlight six useful packages. Although I began with a few ideas abo...
5023 sym R (2936 sym/12 pcs) 18 img
Graph analysis using the tidyverse
It is because I am not a graph analysis expert that I though it important to write this article. For someone who thinks in terms of single rectangular data sets, it is a bit of a mental leap to understand how to apply tidy principles to a more robust object, such as a graph table. Thankfully, there are two packages that make this work much easier...
8643 sym R (10778 sym/16 pcs) 8 img
Paid in Books: An Interview with Christian Westergaard
R is greatly benefiting from new users coming from disciplines that traditionally did not provoke much serious computation. Journalists1 and humanist scholars2, for example, are embracing R. But, does the avenue from the Humanities go both ways? In a recent conversation with Christian Westergaard, proprietor of Sophia Rare Books in Copenhagen, I ...
7582 sym 4 img
Parsnipping Fama French
Today, we will continue our exploration of developments in the world of tidy models, and we will stick with our usual Fama French modeling flow to do so. For new readers who want get familiar with Fama French before diving into this post, see here where we covered importing and wrangling the data, here where we covered rolling models and visualiz...
5657 sym R (5537 sym/13 pcs) 2 img
The reticulate package solves the hardest problem in data science: people
Andrew Mangano is the Director of eCommerce Analytics at Albertsons Companies. Part I – Modelling The reticulate package integrates Python within R and, when used with RStudio 1.2, brings the two languages together like never before. Much more important than the technical details of how it all works is the impact that it has on on both individu...
6341 sym R (2755 sym/10 pcs) 10 img
How to Avoid Publishing Credentials in Your Code
Roland Stevenson is a data scientist and consultant who may be reached on Linkedin. When accessing an API or database in R, it is often necessary to provide credentials such as a login name and password. You may find yourself being prompted with something like this: When writing an R script that requires a user to provide credentials, you will w...
4365 sym R (1517 sym/3 pcs) 2 img
February 2019: “Top 40” New CRAN Packages
One hundred and fifty-one new packages arrived at CRAN in February. Here are my “Top 40” picks organized into eight categories: Bioinformatics, Data, Machine Learning, Medicine, Statistics, Time Series, Utilities and Visualization. Bioinfomatics Cascade v1.7: Implements a modeling tool allowing gene selection, reverse engineering, and predict...
9063 sym 46 img
RInside Help in Testing
A problem arises when building R interfaces to C/C++ libraries involves testing: how to go about replicating the existing C/C++ tests in R without undue effort. If the C/C++ tests are simple and small enough, they can be manually translated. However, when there are many tests, and each test initializes its own large data structures, the task beco...
5195 sym R (8225 sym/7 pcs)
How to share R visualizations in Microsoft PowerPoint
Hadrien Dykiel is an RStudio Customer Success Engineer Microsoft PowerPoint is often the de facto choice for creating presentation slides, especially at larger companies. In many organizations, it comes pre-installed on workstations and pretty much everybody knows how to use it. This can make it an effective medium for sharing information, since ...
3473 sym 2 img