Publications by Vivek Patil

Motivating Students

13.04.2013

Figure shown to students in a particular class to show the effect of slacking after the mid-semester grades are received.Background information: A course has multiple components – Exams, projects, quizzes, assignments, etc. The objective of this set of plots was two fold – (a) To show students that their mid-term grade is not the same as the ...

1896 sym 2 img

Using ggplot2 to recreate 2012 Best Cities Results

18.04.2013

Data from: http://images.businessweek.com/slideshows/2012-09-26/americas-50-best-citiesThis time, I used ggplot2 to recreate the graphs created previously using Tableau. After all, R and ggplot2 are open source and free. Yes, I could leave a space between bars for different cities, but I wanted to be as close as possible to the tableau graphs pr...

1179 sym 6 img

Interacting, on demand, with 2012 best cities data and plotting different graphs —- Experiments with ggplot2 on shiny server

25.04.2013

Data from slides on Bloomberg businessweek’s site: http://images.businessweek.com/slideshows/2012-09-26/americas-50-best-citiesIn this post, I show how one can interact with a dataset and generate graphs on the fly (technically, I’ve predetermined which graphs to create, but it should not be difficult to build a menu of possible graphs). Thi...

1833 sym 8 img

Medicare Provider Charge Data – How Are Things in Spokane?

09.05.2013

Disclosure: I am not a health care professional, but just some one interested in playing with data and ways to represent them. I am not going to interpret findings here, but will make a remark or two about how to interpret them from the few graphs shown below.This project came up in the past couple of days, when I came across headlines about wid...

4569 sym 14 img

Tweetanalytics – Interactively analyzing tweets from accounts of 5 universities

01.06.2013

This is an attempt at learning and interactively displaying few results using twitter data using text mining. Interactivity is implemented using RStudio’s shiny server. Their documentation of demo scripts came in very handy. As a non-user of twitter, I had to open an account to get access to tweets. My first major source for information/functio...

4874 sym 4 img

Twitter Twitter on the Web, Who is the Most Popular of All? Interactively Determining Popularity of Two Entitites on Twitter

12.06.2013

Code updated based on feedback (see list of changes at the very end)Okay, that was a take on the mirror mirror on the wall quote from Snow White. This continues my saga of learning from the superb work done by the R-community and building on their ideas. My first post on twitter-related analysis relied on data downloaded at a particular time for...

8623 sym 2 img

Revisualizing the best cities in the US in 2012- Shiny + googleVis = Incredibly powerful

23.06.2013

This is the last time I will talk about visualizing the best cities of 2012 based on Bloomberg Businessweek’s rankings. In an earlier post on this topic, interactive applications to plot bar graphs and histograms for different characteristics defining different cities were discussed. They used R Studio’s shiny server and ggplot2. The reas...

2165 sym 2 img

Analyses of the Best Undergraduate (US-based) Business Schools of 2013

27.06.2013

Link to the code for the analysis program is given at the very end.I teach in a business school and am always fascinated by rankings published by different outlets. As much as one would like to think that they are meaningless, they seem to influence how many business schools actively try to manage their reputation. One such ranking of undergradua...

5336 sym 6 img

Visualizing a tiny slice of India’s demographics with information from Wikipedia

10.07.2013

This post presents a tiny slice of a complex and diverse India using charts. (Data retrieval from Wikipedia on 9 July, 2013 and the analysis were performed using R; charts were generated using ggplot2, googleVis and wordcloud. More information can be found in the code used for this analysis at github.)ObjectivePresent information regarding the fo...

3478 sym 20 img 1 tbl

Lilac Bloomsday Run in Graphs

17.07.2013

The Lilac Bloomsday Run is a 7.46 miles (12 kms) race held annually in Spokane, Washington. The run was started in 1977 by Don Kardong and it saw 51,613 people register for it in 2013. In this post, I use R to collect data from their website, clean and organize them, and present some information about the run and its history using graphs generate...

1088 sym Python (799 sym/10 pcs) 20 img