Publications by Joseph Rickert

Book Review: Graphical Data Analysis with R

07.04.2016

by Joseph Rickert Basically, there are two kinds of graphics or plots you can make from a data set: (1) those that allow you to see what is going on with the data, and (2) those you make to communicate what you have found to someone else. When making the first kind, you want to select plots that will enable you to see as much as possible while ta...

8246 sym 6 img

Get Involved with the R Consortium

14.04.2016

by Joseph Rickert The R Consortium, the non-profit trade organization formed under the Linux Foundation to support the R language and the R Community, is beginning to build real momentum. First of all, two new companies recently joined the Consortium: Avant which provides online personal and auto loans and Procogia, a consulting firm that helps c...

7798 sym 2 img

Get ready for R/Finance 2016

21.04.2016

by Joseph Rickert R/Finance 2016 is less than a month away and, as always, I am very much looking forward to it. In past years, I have elaborated on what puts it among my favorite conferences even though I am not a finance guy. R/Finance is small, single track and intense with almost no fluff. And scattered among the esoterica of finance and trad...

5414 sym 2 img

A Data Scientist’s Perspective on Microsoft R

26.04.2016

by Lixun Zhang, Data Scientist at Microsoft As a data scientist, I have experience with R. Naturally, when I was first exposed to Microsoft R Open (MRO, formerly Revolution R Open) and Microsoft R Server (MRS, formerly Revolution R Enterprise), I wanted to know the answers for 3 questions: What do R, MRO, and MRS have in common? What’s new in...

4856 sym 4 img

R Conferences: Europe 2016

28.04.2016

by Joseph Rickert Answering email queries from friends and acquaintances from around the world wanting to attend useR! 2016 has been painful. It is amazing that the conference sold out a full two months before its start, but upon reflection, not unbelievable. From its inception useR! has been an “academic” conference both in spirit and locati...

5023 sym 2 img

Reading Efron with R

02.05.2016

by Joseph Rickert When I first went to grad school, the mathematicians advised me cultivate the habit of reading with a pencil. This turned into a lifelong habit and useful skill for reading all sorts of things: literature, reports and newspapers for example; not just technical papers. However, reading statistics and data science papers, or reall...

5457 sym R (812 sym/1 pcs) 4 img

Build a Gradient Boosted Trees Model with Microsoft R Server

03.05.2016

by Yuzhou Song, Microsoft Data Scientist R is an open source, statistical programming language with millions of users in its community. However, a well-known weakness of R is that it is both single threaded and memory bound, which limits its ability to process big data. With Microsoft R Server (MRS), the enterprise grade distribution of R for adv...

9743 sym 8 img

Bike Rental Demand Estimation with Microsoft R Server

10.05.2016

by Katherine Zhao, Hong Lu, Zhongmou Li, Data Scientists at Microsoft Bicycle rental has become popular as a convenient and environmentally friendly transportation option. Accurate estimation of bike demand at different locations and different times would help bicycle-sharing systems better meet rental demand and allocate bikes to locations. In ...

7724 sym R (8840 sym/8 pcs) 4 img 1 tbl

Good R Packages

12.05.2016

by Joseph Rickert What makes for a good R package? With over 8,000 packages up on CRAN the quantity of packages is clearly not an issue for R users. Developing an instinct to recognize quality, however, both requires and deserves some effort. I regularly spend time on Dirk Eddelbuettel’s CRANberries site investigating new packages and monitorin...

5482 sym R (1149 sym/1 pcs) 4 img

Principal Components Regression in R, an operational tutorial

17.05.2016

John Mount Ph. D.Data Scientist at Win-Vector LLC Win-Vector LLC's Dr. Nina Zumel has just started a two part series on Principal Components Regression that we think is well worth your time. You can read her article here. Principal Components Regression (PCR) is the use of Principal Components Analysis (PCA) as a dimension reduction step prior to...

2887 sym 2 img