Publications by Joseph Rickert

Bike Rental Demand Estimation with Microsoft R Server

10.05.2016

by Katherine Zhao, Hong Lu, Zhongmou Li, Data Scientists at Microsoft Bicycle rental has become popular as a convenient and environmentally friendly transportation option. Accurate estimation of bike demand at different locations and different times would help bicycle-sharing systems better meet rental demand and allocate bikes to locations. In ...

7724 sym R (8840 sym/8 pcs) 4 img 1 tbl

Good R Packages

12.05.2016

by Joseph Rickert What makes for a good R package? With over 8,000 packages up on CRAN the quantity of packages is clearly not an issue for R users. Developing an instinct to recognize quality, however, both requires and deserves some effort. I regularly spend time on Dirk Eddelbuettel’s CRANberries site investigating new packages and monitorin...

5482 sym R (1149 sym/1 pcs) 4 img

Principal Components Regression in R, an operational tutorial

17.05.2016

John Mount Ph. D.Data Scientist at Win-Vector LLC Win-Vector LLC's Dr. Nina Zumel has just started a two part series on Principal Components Regression that we think is well worth your time. You can read her article here. Principal Components Regression (PCR) is the use of Principal Components Analysis (PCA) as a dimension reduction step prior to...

2887 sym 2 img

User Groups and R Awareness

19.05.2016

by Joseph Rickert For quite a few years now we have attempted to maintain the Revolution Analytics' Local R User Group Directory as the complete and authoritative list of R user groups. Meetup groups make this list in one of two ways: we discover the group because they have a web page of some sort proclaiming the group to be focused on the R lan...

2348 sym 2 img

Principal Components Regression in R: Part 2

24.05.2016

by John Mount Ph. D.Data Scientist at Win-Vector LLC In part 2 of her series on Principal Components Regression Dr. Nina Zumel illustrates so-called y-aware techniques. These often neglected methods use the fact that for predictive modeling problems we know the dependent variable, outcome or y, so we can use this during data preparation in additi...

1967 sym 4 img

Some Impressions from R Finance 2016

27.05.2016

by Joseph Rickert R / Finance 2016 lived up to expectations and provided the quality networking and learning experience that longtime participants have come to value.  Eight years is a long time for a conference to keep its sparkle and pizzazz.  But, the conference organizers and the UIC have managed to create a vibe that keeps people coming ba...

5528 sym 6 img

Principal Components Regression in R: Part 3

31.05.2016

by John Mount Ph. D.Data Scientist at Win-Vector LLC In her series on principal components analysis for regression in R, Win-Vector LLC's Dr. Nina Zumel broke the demonstration down into the following pieces: Part 1: the proper preparation of data and use of principal components analysis (particularly for supervised learning or regression). Part...

3474 sym 4 img

Using caret to compare models

02.06.2016

by Joseph Rickert The model table on the caret package website lists more that 200 variations of predictive analytics models that are available withing the caret framework. All of these models may be prepared, tuned, fit and evaluated with a common set of caret functions. All on its own, the table is an impressive testament to the utility and sco...

2643 sym R (4940 sym/1 pcs) 6 img

Bayesian Optimization of Machine Learning Models

07.06.2016

by Max Kuhn: Director, Nonclinical Statistics, Pfizer Many predictive and machine learning models have structural or tuning parameters that cannot be directly estimated from the data. For example, when using K-nearest neighbor model, there is no analytical estimator for K (the number of neighbors). Typically, resampling is used to get good perfo...

6148 sym R (8109 sym/5 pcs) 10 img

R Consortium and User! 2016 News

09.06.2016

by Joseph Rickert IBM Joins the R Consortium This past Monday at the Spark Summit in San Francisco IBM announced that it had joined the R Consortium as a “Platinum” member. This is very good news with respect to the development and growth of the R language, the health of the R Community and the position of opensource software in the corporate...

6236 sym