Publications by Vivek Patil
Motivating Students
Figure shown to students in a particular class to show the effect of slacking after the mid-semester grades are received.Background information: A course has multiple components – Exams, projects, quizzes, assignments, etc. The objective of this set of plots was two fold – (a) To show students that their mid-term grade is not the same as the ...
1896 sym 2 img
Using ggplot2 to recreate 2012 Best Cities Results
Data from: http://images.businessweek.com/slideshows/2012-09-26/americas-50-best-citiesThis time, I used ggplot2 to recreate the graphs created previously using Tableau. After all, R and ggplot2 are open source and free. Yes, I could leave a space between bars for different cities, but I wanted to be as close as possible to the tableau graphs pr...
1179 sym 6 img
Interacting, on demand, with 2012 best cities data and plotting different graphs —- Experiments with ggplot2 on shiny server
Data from slides on Bloomberg businessweek’s site: http://images.businessweek.com/slideshows/2012-09-26/americas-50-best-citiesIn this post, I show how one can interact with a dataset and generate graphs on the fly (technically, I’ve predetermined which graphs to create, but it should not be difficult to build a menu of possible graphs). Thi...
1833 sym 8 img
Medicare Provider Charge Data – How Are Things in Spokane?
Disclosure: I am not a health care professional, but just some one interested in playing with data and ways to represent them. I am not going to interpret findings here, but will make a remark or two about how to interpret them from the few graphs shown below.This project came up in the past couple of days, when I came across headlines about wid...
4569 sym 14 img
Tweetanalytics – Interactively analyzing tweets from accounts of 5 universities
This is an attempt at learning and interactively displaying few results using twitter data using text mining. Interactivity is implemented using RStudio’s shiny server. Their documentation of demo scripts came in very handy. As a non-user of twitter, I had to open an account to get access to tweets. My first major source for information/functio...
4874 sym 4 img
Twitter Twitter on the Web, Who is the Most Popular of All? Interactively Determining Popularity of Two Entitites on Twitter
Code updated based on feedback (see list of changes at the very end)Okay, that was a take on the mirror mirror on the wall quote from Snow White. This continues my saga of learning from the superb work done by the R-community and building on their ideas. My first post on twitter-related analysis relied on data downloaded at a particular time for...
8623 sym 2 img
Revisualizing the best cities in the US in 2012- Shiny + googleVis = Incredibly powerful
This is the last time I will talk about visualizing the best cities of 2012 based on Bloomberg Businessweek’s rankings. In an earlier post on this topic, interactive applications to plot bar graphs and histograms for different characteristics defining different cities were discussed. They used R Studio’s shiny server and ggplot2. The reas...
2165 sym 2 img
Analyses of the Best Undergraduate (US-based) Business Schools of 2013
Link to the code for the analysis program is given at the very end.I teach in a business school and am always fascinated by rankings published by different outlets. As much as one would like to think that they are meaningless, they seem to influence how many business schools actively try to manage their reputation. One such ranking of undergradua...
5336 sym 6 img
Visualizing a tiny slice of India’s demographics with information from Wikipedia
This post presents a tiny slice of a complex and diverse India using charts. (Data retrieval from Wikipedia on 9 July, 2013 and the analysis were performed using R; charts were generated using ggplot2, googleVis and wordcloud. More information can be found in the code used for this analysis at github.)ObjectivePresent information regarding the fo...
3478 sym 20 img 1 tbl
Lilac Bloomsday Run in Graphs
The Lilac Bloomsday Run is a 7.46 miles (12 kms) race held annually in Spokane, Washington. The run was started in 1977 by Don Kardong and it saw 51,613 people register for it in 2013. In this post, I use R to collect data from their website, clean and organize them, and present some information about the run and its history using graphs generate...
1088 sym Python (799 sym/10 pcs) 20 img