Publications by Joseph Rickert

The Collatz Conjecture Continued

09.09.2014

by Seth Mottaghinejad Let's review the Collatz conjecture, which says that given a positive integer n, the following recursive algorithm will always terminate: if n is 1, stop, otherwise recurse on the following if n is even, then divide it by 2 if n is odd, then multiply it by 3 and add 1 In our last post, we created a function called 'cpp_col...

8876 sym R (8934 sym/12 pcs)

R at Conferences this Fall

11.09.2014

by Joseph Rickert The days are getting shorter here in California and the summer R conferences UseR!2014 and JSM are behind us, but there are still some very fine conferences for R users to look forward to before the year ends.  DataWeek starts in San Francisco on September 15th. I will be conducting a bootcamp for new R users, and on Wedneday t...

3159 sym

Comparing machine learning models in R

18.09.2014

by Joseph Rickert While preparing for the DataWeek R Bootcamp that I conducted this week I came across the following gem. This code, based directly on a Max Kuhn presentation of a couple years back, compares the efficacy of two machine learning models on a training data set. #----------------------------------------- # SET UP THE PARAMETER SPAC...

2045 sym R (1882 sym/1 pcs) 2 img

How to publish R and ggplot2 to the web

23.09.2014

by Matt Sundquist, Plotly Co-founder It's delightfully smooth to publish R code, plots, and presentations to the web. For example: Shiny makes interactive apps from R. Pretty R highlights R code for HTML. Slidify makes slides from R Markdown. Knitr and RPubs let you publish R Markdown docs. GitHub and devtools let you quickly release packages a...

3918 sym R (878 sym/4 pcs) 14 img

DescTools: a new R "misc package"

25.09.2014

by Joseph Rickert One of the most difficult things about R, a problem that is particularly vexing to beginners, is finding things. This is an unintended consequence of R's spectacular, but mostly uncoordinated, organic growth. The R core team does a superb job of maintaining the stability and growth of the R language itself, but the innovation en...

2893 sym R (3135 sym/4 pcs) 4 img 1 tbl

Why are we still teaching T-tests?

30.09.2014

The following post by Norm Matloff originally appeared on his blog, Mad(Data)Scientist, on September 15th. We rarely republish posts that have appeared on other blogs, however, the questions that Norm raises both with respect to the teaching of statistics, and his assertion that “R's statistical procedures are centered far too much on significa...

10428 sym

R and Data Science Webinar

02.10.2014

by Joseph Rickert Recently, I had the opportunity to present a webinar on R and Data Science. The challenge with attempting this sort of thing is to say something interesting that does justice to the subject while being suitable for an audience that may include both experienced R users and curious beginners. The approach I settled on had three pa...

2163 sym

The Generalized Lambda Distribution and GLDEX Package: Fitting Financial Return Data

07.10.2014

by Daniel Hanson, with contributions by Steve Su (author of the GLDEX package). Part 1 of a series. Introduction As most readers are well aware, market return data tends to have heavier tails than that which can be captured by a normal distribution; furthermore, skewness will not be captured either. For this reason, a four parameter distribution ...

11695 sym 8 img

A Note on Tweedie

09.10.2014

by Joseph Rickert In a recent post I talked about the information that can be developed by fitting a Tweedie GLM to a 143 million record version of the airlines data set. Since I started working with them about a year or so ago, I now see Tweedie models everywhere. Basically, any time I come across a histogram that looks like it might be a sample...

4105 sym 4 img

The Generalized Lambda Distribution and GLDEX Package for Fitting Financial Return Data – Part 2

14.10.2014

Part 2 of a seriesby Daniel Hanson, with contributions by Steve Su (author of the GLDEX package)   Recap of Part 1 In our previous article, we introduced the four-parameter Generalized Lambda Distribution (GLD) and looked at fitting a 20-year set of returns from the Wilshire 5000 Index, comparing the results of two methods, namely the Method of ...

8982 sym 8 img