Publications by Michael kao
R package building automation
Title: R package building automation Inspired by the post at http://giventhedata.blogspot.tw/2013/02/my-r-package-development-cheat-sheet.html. I have decided to publish my cheat script for package development as well. Building package used to be a nightmare, filling in all those Rdfiles manually can cause some serious brain damage. Thanks to the...
1127 sym R (1698 sym/1 pcs)
Violin plots and regional income distribution
While preparing my slides for statistical graphics, a plot really caught my eye when I was playing around with the data. I started off by plotting the time seriesof GNI per capita by country, and as expected it got quite messy and incomprehensible. ## Download and manipulate the data library(FAOSTAT) raw.lst = getWDItoSYB(indicator = c("NY.GNP.P...
1830 sym R (1497 sym/3 pcs) 6 img
Tupper’s self-referential formula
Can't remember where I first came across this equation but the Tupper's self referential equation, is a very interesting formula that when graphed in two dimension plane it reproduces the formula. \[ \frac{1}{2} < \left\lfloor \bmod\left(\left\lfloor\frac{y}{17}\right\rfloor2^{-17\lfloor x\rfloor - \bmod(\lfloor y \rfloor, 17)}, 2\right)\right\...
1237 sym R (1029 sym/2 pcs) 2 img
First day of State of Food Insecurity (SOFI) 2013
The FAO flagship publication SOFI 2013 was release yesterday on the 1st of October, the publication is the most important report in monitoring the progress towards the 2015 Millenium Development Goal and ultimately eliminate hunger. I was interest in how the people responded, so I scrapped some data from Twitter and previous work to carry out s...
2324 sym 4 img
Accurate imputation and valid statistical inference with ensemble
Imputation is predictive inference and not causal inference! I have met many people, who consider the two are equivalent. Their reasoning is based on the belief that if you can produce a model which replicate the data generating mechanism, it will give you the best prediction. Which may or may not be true depending on your philosophy of math...
4833 sym R (2922 sym/1 pcs) 2 img
Why multiple imputation?
BackgroundIn the forth coming week, I will be giving a presentation on the fundamentals of imputation to my colleagues. One of the most important idea I would like to present is multiple imputation. In my last post, I have given a small example of multiple imputation, but it does not provide the evidence why we should use it. This is the aim o...
3098 sym R (4280 sym/5 pcs) 4 img
Spline interpolation of temporal resolution for satellite images.
This week, I had a discussion with a few of my colleagues on the possibility of utilizing remote sensing data or satellite images to improve our statistical estimation such as imputation. One source of interest is the Normalized Difference Vegetation Index which quantify the concentrations of green leaf vegetation around the globe. More detail...
3450 sym 8 img
3D Sine Wave
Had a headache last night, so decided to take things easy and just read posts Google+. Then I came across this post which seems interesting so I thought I would play around before I head to bed. First of all, I thought generating a square base would be much easier in R compare to hexagons. Starting with 2 numeric vectors and then expand them ...
1229 sym R (996 sym/1 pcs) 2 img