Publications by Yanchang Zhao
A simple example of parallel computing on a Windows (and also Mac) machine
by Yanchang Zhao, RDataMining.com With a Mac, parallel computing can be achieved with package multicore. Unfortunately, it does not work under Windows. A simple way for parallel computing under Windows (and also Mac) is using package snowfall, which can work with multi-CPU or multi-core on a single machine, as well as a cluster of multiple machin...
1190 sym 16 img
Book “R and Data Mining: Examples and Case Studies” on CRAN
by Yanchang Zhao, RDataMining.com My book in draft titled “R and Data Mining: Examples and Case Studies” is now available on CRAN at http://cran.r-project.org/other-docs.html. It is scheduled to be published by Elsevier in late 2012. Its latest version can be downloaded at http://www.rdatamining.com/docs. The book presents many examples on da...
1183 sym 16 img
An Example of Social Network Analysis with R using Package igraph
by Yanchang Zhao, RDataMining.com This post presents an example of social network analysis with R using package igraph. The data to analyze is Twitter text data of @RDataMining used in the example of Text Mining, and it can be downloaded as file “termDocMatrix.rdata” at the Data webpage. Putting it in a general scenario of social networks, th...
3478 sym R (924 sym/2 pcs) 20 img
CFP: the 10th Australasian Data Mining Conference (AusDM 2012)
The Tenth Australasian Data Mining Conference (AusDM 2012) Sydney, Australia 5-7 December 2012 http://ausdm12.togaware.com/ Data mining, the art and science of intelligent analysis of (usually large) data sets for meaningful (and previously unknown) insights, is now being actively applied in industries including defence, medicine, science, financ...
5553 sym 16 img
PDF slides and R code examples on Data Mining and Exploration
by Yanchang Zhao, RDataMining.com There are some nice slides and R code examples on Data Mining and Exploration at http://www.inf.ed.ac.uk/teaching/courses/dme/, which are listed below. PDF Slides: – Overview of Data Mining http://www.inf.ed.ac.uk/teaching/courses/dme/2012/slides/datamining_intro4up.pdf – Visualizing Data http://www.inf.ed.ac...
1585 sym 16 img
An example on sentiment analysis with R
by Yanchang Zhao, RDataMining.com There is a nice example on sentiment analysis with R at <http://viksalgorithms.blogspot.com.au/2012/06/tracking-us-sentiments-over-time-in.html>. In the example, the Wikileaks cable corpus is analyzed to track US sentiments of other countries and their presidents over time. The example describes how to map senten...
1055 sym 16 img
A tutorial on outlier detection techniques
by Yanchang Zhao, RDataMining.com There is an excellent tutorial on outlier detection techniques, presented by Hans-Peter Kriegel et al. at ACM SIGKDD 2010. It presents many popular outlier detection algorithms, most of which were published between mid 1990s and 2010, including – statistical tests, – depth-based approaches, – deviation-base...
980 sym 16 img
Data Mining In Excel: Lecture Notes and Cases
by Yanchang Zhao, RDataMining.com It is a 270-page book on data mining with Excel. It can be downloaded as a PDF file at http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.83.1393&rep=rep1&type=pdf. Below is its table of contents. – Overview of the Data Mining Process – Data Exploration and Dimension Reduction – Evaluating Classificat...
979 sym 16 img
2nd CFP: the 10th Australasian Data Mining Conference (AusDM 2012)
The Tenth Australasian Data Mining Conference (AusDM 2012) Sydney, Australia, 5-7 December 2012 http://ausdm12.togaware.com/ The Australasian Data Mining Conference has established itself as the premier Australasian meeting for both practitioners and researchers in data mining. This year’s conference, AusDM’12, co-hosted with the Australian J...
2176 sym 16 img
Examples and resources on association rule mining with R
by Yanchang Zhao, RDataMining.com The technique of association rules is widely used for retail basket analysis, as well as in other applications to find assocations between itemsets and between sets of attribute-value pairs. It can also be used for classification by using rules with class labels on the right-hand side. It is even used for outlier...
2463 sym 16 img