Publications by Ryan Rosario

My First Few Days with RStudio

09.03.2011

As most readers are probably aware, the free IDE for R, called RStudio, was recently released for general use and it immediately made huge waves within the R community. IDE stands for Integrated Development Environment. IDEs typically provides a rich set tools developing in some target language. For standard programming languages like C++ (Visual...

11536 sym 14 img

Location Tracking on Android, too!

23.04.2011

This week it was revealed that the iPhone stores users’ locations, and this immediately caused a huge firestorm of commentary by tech geeks, panic among privacy advocates, and delight to data geeks like myself. Even better/worse, it seems that the iPhone caches location traces long-term, possibly back to the date the phone was activated. I ditc...

5855 sym R (517 sym/1 pcs) 2 img

EC2 Trials and Tribulations, Part 1 (Web Crawling)

11.05.2011

Elastic Compute Cloud (EC2) is a service provided a Amazon Web Services that allows users to leverage computing power without the need to build and maintain servers, or spend money on special hardware. The idea is simple, the user “boots” up one or more machines and then accesses those machines as if they were logged into any other machine re...

10629 sym Python (84 sym/1 pcs) 2 img

Review of 2011 Data Scientist Summit

13.05.2011

Some time over the past 6 weeks I randomly saw a tweet announcing the “Data Scientist Summit” and shortly below it I saw that it would be held in Las Vegas at the Venetian. Being a Data Scientist myself is reason enough to not pass up this opportunity, but Vegas definitely sweetens the deal! On Wednesday I woke up at 6am to partake on the 5.5...

16129 sym 56 img 7 tbl

SIGKDD 2011 Conference — Day 1 (Graph Mining and David Blei/Topic Models)

22.08.2011

I have been waiting for the KDD conference to come to California, and I was ecstatic to see it held in San Diego this year. AdMeld did an awesome job displaying KDD ads on the sites that I visit, sometimes multiple times per page. That’s good targeting! Mining and Learning on Graphs Workshop 2011 I had originally planned to attend the 2-day wor...

14159 sym 16 img 1 tbl

SIGKDD 2011 Conference — Days 2/3/4 Summary

27.08.2011

<< My review of Day 1. I am summarizing all of the days together since each talk was short, and I was too exhausted to write a post after each day. Due to the broken-up schedule of the KDD sessions, I group everything together instead of switching back and forth among a dozen different topics. By far the most enjoyable and interesting aspects of ...

18136 sym 16 img 1 tbl

“Hold Only That Pair of 2s?” Studying a Video Poker Hand with R

08.01.2012

Whenever I tell people in my family that I study Statistics, one of the first questions I get from laypeople is “do you count cards?” A blank look comes over their face when I say “no.” Look, if I am at a casino, I am well aware that the odds are against me, so why even try to think that I can use statistics to make money in this way? Alt...

8792 sym 28 img

Adventures at My First JSM (Joint Statistical Meetings) #JSM2012

06.08.2012

During the past few decades that I have been in graduate school (no, not literally) I have boycotted JSM on the notion that “I am not a statistician.” Ok, I am a renegade statistician, a statistician by training. JSM 2012 was held in San Diego, CA, one of the best places to spend a week during the summer. This time, I had no excuse not to go,...

14285 sym 34 img 3 tbl

Summary of My First Trip to Strata #strataconf

28.02.2013

In this post I am goIing to summarize some of the things that I learned at Strata Santa Clara 2013. For now, I will only discuss the conference sessions as I have a much longer post about the tutorial sessions that I am still working on and will post at a later date. I will add to this post as the conference winds down. The slides for most talks ...

22362 sym R (221 sym/2 pcs) 16 img 1 tbl