Publications by Christopher Bare

The Dream 8 Challenges

25.06.2013

The 8th iteration of the DREAM Challenges are underway. DREAM is something like the Kaggle of computational biology with an open science bent. Participating teams apply machine learning and statistical modeling methods to biological problems, competing to achieve the best predictive accuracy. This year’s three challenges focus on reverse engine...

3065 sym 2 img

Generate UUIDs in R

11.07.2013

Here a snippet of R to generate a Version 4 UUID. Dunno why there wouldn’t be an official function for that in the standard libraries, but if there is, I couldn’t find it. ## Version 4 UUIDs have the form: ## xxxxxxxx-xxxx-4xxx-yxxx-xxxxxxxxxxxx ## where x is any hexadecimal digit and ## y is one of 8, 9, A, or B ## f47ac10b-58cc-...

633 sym R (695 sym/1 pcs)

Online class on Statistical Learning

24.01.2014

Trevor Hastie and Robert Tibshirani are teaching an online class on Statistical Learning starting this week. The first week is introduction and overview, so it’s not too late to join up. They’ve also published a new book, An Introduction to Statistical Learning, as a more accessible companion to their widely revered The Elements of Statistica...

2067 sym 4 img

Linear Models

07.02.2014

Related To leave a comment for the author, please follow the link and comment on their blog: Digithead's Lab Notebook. R-bloggers.com offers daily e-mail updates about R news and tutorials about learning R and many other topics. Click here if you're looking to post or find an R/data-science job. Want to share your content on R-bloggers? clic...

406 sym

Regression with multiple predictors

18.02.2014

Now that I'm ridiculously behind in the Stanford Online Statistical Learning class, I thought it would be fun to try to reproduce the figure on page 36 of the slides from chapter 3 or page 81 of the book. The result is a curvaceous surface that slices neatly through the data set. I’m not sure what incurable chemical imbalance explains why such ...

2703 sym R (1566 sym/7 pcs) 2 img

Lee Edlefsen on Big Data in R

03.12.2014

Lee Edlefsen, Chief Scientist at Revolution Analytics, spoke about Big Data in R at the FHCRC a week or two back. He introduced the PEMA or parallel external memory algorithm. “Parallel external memory algorithms (PEMA’s) allow solution of both capacity and speed problems, and can deal with distributed and streaming data.” When a problem i...

1612 sym 2 img