Publications by John Mount

Does replyr::let work with data.table?

24.12.2016

I’ve been asked if the adapter “let” from our R package replyr works with data.table. My answer is: it does work. I am not a data.table user so I am not the one to ask if data.table benefits a from a non-standard evaluation to standard evaluation adapter such as replyr::let. Using replyr::let with data.table looks like the following: l...

953 sym R (354 sym/1 pcs) 2 img

Data Preparation, Long Form and tl;dr Form

26.12.2016

Data preparation and cleaning are some of the most important steps of predictive analytic and data science tasks. They are laborious, where most of the errors are made, your last line of defense against a wild data, and hold the biggest opportunities for outcome improvement. No matter how much time you spend on then, they still seem like a negl...

2053 sym R (834 sym/1 pcs) 2 img

A Theory of Nested Cross Simulation

01.01.2017

[Reader’s Note. Some of our articles are applied and some of our articles are more theoretical. The following article is more theoretical, and requires fairly formal notation to even work through. However, it should be of interest as it touches on some of the fine points of cross-validation that are quite hard to perceive or discuss without the...

2525 sym 2 img

Why do Decision Trees Work?

06.01.2017

In this article we will discuss the machine learning method called “decision trees”, moving quickly over the usual “how decision trees work” and spending time on “why decision trees work.” We will write from a computational learning theory perspective, and hope this helps make both decision trees and computational learning theory mor...

22229 sym R (985 sym/1 pcs) 4 img

Upgrading to macOS Sierra (nee OSX) for R users

26.01.2017

A good fraction of R users use Apple computers. Apple machines historically have sat at a sweet spot of convenience, power, and utility: Convenience: Apple machines are available at retail stores, come with purchasable support, and can run a lot of common commercial software. Power: R packages such as parallel and Rcpp work better on top of a P...

5210 sym R (614 sym/3 pcs) 2 img

Upcoming Win-Vector LLC public speaking engagements

26.01.2017

I am happy to announce a couple of exciting upcoming Win-Vector LLC public speaking engagements. BARUG Meetup Tuesday, Tuesday February 7, 2017 ~7:50pm, Intuit, Building 20, 2600 Marine Way, Mountain View, CA. Win-Vector LLC’s John Mount will be giving a “lightning talk” (15 minutes) on R calling conventions (standard versus non-standard)...

1236 sym

Using the Bizarro Pipe to Debug magrittr Pipelines in R

29.01.2017

I have just finished and released a new R video lecture demonstrating how to use the “Bizarro pipe” to debug magrittr pipelines. I think R dplyr users will really enjoy it. Please read on for the link to the video lecture. In this video lecture I use the “Bizarro pipe” to debug the example pipeline from RStudio’s purrr announcement. ...

744 sym 4 img

Going to Strata / Hadoop World 2017 San Jose?

02.02.2017

Are you attending or considering attending Strata / Hadoop World 2017 San Jose? Are you interested in learning to use R to work with Spark and h2o? Then please consider signing up for my 3 1/2 hour workshop soon. We are about half full now, but I really want to fill the room, while making sure that people who really want to go get in. Win-Ve...

1360 sym

Evolving R Tools and Practices

05.02.2017

One of the distinctive features of the R platform is how explicit and user controllable everything is. This allows the style of use of R to evolve fairly rapidly. I will discuss this and end with some new notations, methods, and tools I am nominating for inclusion into your view of the evolving “current best practice style” of working with ...

6861 sym R (83 sym/1 pcs) 8 img

My recent BARUG talk: Parametric Programming in R with replyr

09.02.2017

I want to share an edited screencast of my rehearsal for my recent San Francisco Bay Area R Users Group talk: Related To leave a comment for the author, please follow the link and comment on their blog: R – Win-Vector Blog. R-bloggers.com offers daily e-mail updates about R news and tutorials about learning R and many other topics. Click he...

514 sym