Publications by Millsy
sab-R-metrics: Basic Applied Regression (OLS)
Today, I’ll again be using a new data set that can be found here at my website (called ‘leagueoutcomes.csv’). The data set includes the standings results of the 2009 season for MLB along with average game attendance by team. I’ll use this to go over some basic regression techniques and tools in R. Hopefully this tutorial wi...
13782 sym 10 img
sab-R-metrics: Multiple Regression and Interactions
Last time, I covered ordinary least squares with a single variable. This time, I’ll extend this to using multiple predictor variables in a regression, interacting terms in R, and start thinking about using polynomials of certain terms in the regression (like Age and Age Squared). This should be a pretty straight forward tutorial, ...
15619 sym R (1335 sym/1 pcs)
sab-R-metrics: Brief Sidetrack for Scatterplot Matrices
In my last two posts I talked about Ordinary Least Squares, then extended this discussion to the multiple predictor case and briefly talked about some of the problems that may arise. These problems can include omitted variable bias, heteroskedasticity, non-normality, and multicollinearity. Most of these problems are relatively minor...
11277 sym R (1184 sym/1 pcs) 10 img
Having a problem with R-2.12.2 64-bit and "gam’ package!
While working with some pitch location data recently, I ran across something strange when using my new computer (with R-2.12.2 64-bit) versus my work computer (with R-2.11.1 x64). Both are 64-bit computers, but I got the new one for portability (it’s a laptop) and speed.Anyway, I had been doing some work in the office with Pitch F/...
3511 sym R (2264 sym/1 pcs) 6 img
sab-R-metrics Sidetrack: Bubble Plots
While I had mentioned in my last post that I will cover logistic regression in my next post, I decided that a quick interlude in working with bubble plots would be fun. Bubble plots have become pretty popular recently, especially with all of the Visualization Challenges I’ve seen around the internet (by the way, I think people in t...
12328 sym R (3670 sym/1 pcs) 12 img
Trackman Position Needs R Knowledge
Thought some of the R-Blogger readers would be interested in the position linked below. If you’re a baseball fan and like working in R, this is a fun company that seems to be getting more and more press. Recently, it was featured in Sports Illustrated and has been covered on ESPN as well.http://www.workinsports.com/wisquickregappl...
1120 sym
sab-R-metrics: Logistic Regression
It’s been a while since my last sab-R-metrics post, and I have not gotten to the real fun stuff yet. I apologize for the long layoff, and it’s likely that these will be sparse for the next couple weeks. I have had some consulting opportunities come up, I’ve got 6 (possibly 7) presentations or co-authored presentations coming u...
16088 sym 4 img
sab-R-metrics: Basics of LOESS Regression
Last week, I left you off at logistic regression. This week, I’ll be pushing the limits of regression analysis a bit more with a smoothing technique called LOESS regression. There are a number of smoothing methods that can be used, such as Smoothing Splines or simple Local Linear Regression; however, I’m going to cover LOESS (lo...
17278 sym 16 img
sab-R-metrics: Kernel Density Smoothing
Last time I left you, I had gone over some basics of doing loess regression in R. If you remember, loess is a sort of regression that allows wigglyness in your regression of some dependent variable Y on some independent variable X (I will generalize this to more than one dimension later on). However, sometimes we’re not always int...
11007 sym R (2821 sym/1 pcs) 10 img
sab-R-metrics: GIF Movies and Pitch Flights (Guest Post)
A couple weeks ago, I received an email from a fellow Pitch F/Xer and R-User, Josh Weinstock, asking if I was interested in a guest post here at Prince of Slides. I didn’t think I was important enough to have talented guests posting at my blog; however, Josh pointed out that this site tends to be the place for those who are part of...
8966 sym 10 img