Publications by Ram

The Skills of a Data Miner

09.10.2011

Data mining is not only statistics, even if statistics is the most recognized academic component of it. It also includes data cleaning, machine learning and data visualization. The scarce factor is the ability to understand that data and extract value from it. Hal Varian, GoogleThe full article by Luca Sbardella published in Qua...

779 sym

What does it mean to be a Data Scientist?

12.10.2011

Check out this talk by John Rauser of AMZN at the 2011 Strata Conf. It is an excellent intro to the field. Related To leave a comment for the author, please follow the link and comment on their blog: Doodling with Data. R-bloggers.com offers daily e-mail updates about R news and tutorials about learning R and many other topics. Click here if...

509 sym

A true data-doodler – Christophe Ladroue (R ddly and plyr on Triathlon Results)

12.10.2011

To me, this post by Christophe Ladroue personifies what data doodlers do.They take a dataset that is of interest to them (In his case, his triathlon results) and then they manipulate the numbers to see what insights can be drawn. Most bloggers only show their final results which look great, but for our purposes (for wannabe data doodl...

1067 sym

Geo-doodlers – Paul Butler and FlowingData

16.10.2011

I found this great R-Visualization example via an R-Blogger post that xingmowang made. (One more good reason for why it is important to read lots of field-related blogs!)Here’s the image:If this was merely eye-candy, I would have enjoyed it, but not included it here. But to think that this was done in R — that means the rest of us...

1813 sym 2 img

Get the Basics right – Suggestion for R Beginners

17.10.2011

I am always looking for suggestions on how to get better at R, esp. for beginners. So when I see someone who’s gotten adept at it, I ask them how they got there.This weekend, at the Bay Area ACM Data Mining Camp, one person gave me what seemed like a good suggestion. Just wanted to share it here, for anyone else who’s just getting...

1264 sym

Fusion Tables by Google

18.10.2011

Google’s Fusion Tables look impressive, for those who want to try geo-visualizations of their data. You don’t need much programming experience to be able to use it.For those who want to try it out, here’s a nice intro that Kathyrn Hurley presented at the recent SVCC (Silicon Valley Code Camp). When combined with ShpEscape (note ...

1074 sym

Tips for getting started on Kaggle (datamining)

30.11.2011

Ever since I heard about Kaggle.com at this year’s Bay Area Data Mining Camp, I’ve wanted to participate. But I was feeling somewhat intimidated. Jeremy Howard’s “Intro to Kaggle” talk at yesterday’s MeetUp (DataMining for a Cause) was exactly what I needed. He had a number of tips for beginners. His was exactly the talk that I wa...

1973 sym 2 img

O’Reilly’s Data Science Kit – Books

02.12.2011

It is not as if I don’t have enough books (and material on the web) to read. But this list compiled by the O’Reilly team should make any data analyst salivate.http://shop.oreilly.com/category/deals/data-science-kit.doThe Books and Video included in the set are: Data Analysis with Open Source ToolsDesigning Data VisualizationsAn Introduction t...

870 sym 2 img

A pre-requisite to be a Data Scientist

07.12.2011

So what should be in the toolkit of people who call themselves a data scientist?A fundamental skill is the ability to manipulate data. A data scientist should be familiar and comfortable with a number of platforms and scripting tools to get the job done. What is difficult in Excel might be trivial in R. And when R struggles, you shoul...

1105 sym

Simulating Population Growth in Cities Using R

23.02.2013

R is great for anyone who wants to get started on learning Simulation. (Both Discrete Event or Agent-based, with stochastic elements in the process.) This post is inspired by Matt Asher’s “quick-and-dirty” R simulation work on Population Growth. Matt uses it to create aRt. I felt that his core idea provided a very good framework...

4585 sym 10 img