Publications by andrew

Infovis vs. statgraphics: A clear example of their different goals

29.07.2011

I recently came across a data visualization that perfectly demonstrates the difference between the “infovis” and “statgraphics” perspectives. Here’s the image (link from Tyler Cowen): That’s the infovis. The statgraphic version would simply be a dotplot, something like this: (I purposely used the default settings in R with only mino...

2019 sym R (428 sym/1 pcs) 4 img

NppToR 2.6.0 beta 2

29.07.2011

http://sourceforge.net/projects/npptor/files/npptor%20installer/NppToR-2.6.0.beta2.exe/download I’ve released beta 2 of NppToR 2.6.0.  Please take a look and report any problems.  This improves the installer and the uninstaller as well as a few bugs that popped up from the transition to UNICODE. Related To leave a comment for the author, pl...

685 sym

Using a “pure infographic” to explore differences between information visualization and statistical graphics

10.08.2011

Our discussion on data visualization continues. One one side are three statisticians–Antony Unwin, Kaiser Fung, and myself. We have been writing about the different goals served by information visualization and statistical graphics. On the other side are graphics experts (sorry for the imprecision, I don’t know exactly what these people do i...

8251 sym 4 img

Data manipulations

23.08.2011

In the last Utah R Users group meeting I gave a presentation on data manipulations on R, and today I found through the plyr mailing list two commands that I was previously unaware of that should definitely be made mention of. arrange I was very pleased to find arrange because it fills the nagging hole for sorting data frames.  Calling arrange(d...

1376 sym Python (143 sym/3 pcs)

Why it doesn’t make sense to chew people out for not reading the help page

12.10.2011

Karl Broman writes: Barry Rowlingson gave an interesting talk at UseR 2011, “Why R-help must die!” He suggested the Q-and-A type sites Stack Overflow (on programming) and Cross Validated (on statistics), both part of Stack Exchange. I haven’t used R-help recently but I do occasionally send people there. Just to see what was going on there,...

3162 sym

Web-friendly visualizations in R

19.10.2011

Aleks points me to this new tool from Wojciech Gryc. Right now I save my graphs as pdfs or pngs and then upload them to put them on the web. I expect I’ll still be doing this for awhile—I like having full control of what my graphs look like—but Gryc’s default plots might be useful for lots of people making their analyses more accessible. ...

1084 sym

Wickham R short course

14.11.2011

Hadley writes: I [Hadley] am going to be teaching an R development master class in New York City on Dec 12-13. The basic idea of the class is to help you write better code, focused on the mantra of “do not repeat yourself”. In day one you will learn powerful new tools of abstraction, allowing you to solve a wider range of problems with fewer ...

2123 sym

World record running times vs. distance

15.11.2011

Julyan Arbel plots world record running times vs. distance (on the log-log scale): The line has a slope of 1.1. I think it would be clearer to plot speed vs. distance—then you’d get a slope of -0.1, and the numbers would be more directly interpretable. Indeed, this paper by Sandra Savaglio and Vincenzo Carbone (referred to in the comments o...

1398 sym 4 img

Permanently Setting the CRAN repository

29.11.2011

Setting the CRAN repository so that it does not ask every time you try to install a package  is something that I think few people bother to do, but it is so simple and can save a fair bit of frustration when working.  This is accomplished through a setting in one of the Rprofile files.  There is the site file found at either /etc/R/Rprofile.s...

1290 sym R (178 sym/4 pcs)

CrossValidated: A place to post your statistics questions

16.12.2011

Seth Rogers writes: I [Rogers] am a member of an online community of statisticians where I burn a great deal of time (and a recovering cog sci researcher). Our community website is a peer-reviewed Q and A spanning stats topics ranging from applications to mathematical theory. Our online community consists of mostly university faculty, grad studen...

2110 sym 2 img