Publications by Yanchang Zhao

An excellent introduction to MapReduce and Hadoop

19.07.2013

by Yanchang Zhao, RDataMining.com The lectures in week 3 of a free online course Introduction to Data Science give an excellent introduction to MapReduce and Hadoop, and demonstrate with examples how to use MapReduce to do various tasks, such as, word frequency counting, matrix multiplication, simple social network analysis, and a join operation ...

1192 sym 4 img

Step by step to build my first R Hadoop System

20.08.2013

by Yanchang Zhao, RDataMining.com After reading documents and tutorials on MapReduce and Hadoop and playing with RHadoop for about 2 weeks, finally I have built my first R Hadoop system and successfully run some R examples on it. My experience and steps to achieve that are presented at http://www.rdatamining.com/tutorials/rhadoop. Hopefully it wi...

1392 sym 4 img

Preview of book Data Mining Applications with R

03.12.2013

An edited book titled Data Mining Applications with R will be on market soon, which features 15 real-word applications on data mining with R. A preview of the book is available on Google Books. R code, data and color figures for the book can be downloaded at RDataMining.com. Below is its table of contents. Foreword Graham Williams Chapter 1 Pow...

2649 sym 4 img

New book release: Data Mining Applications with R

23.12.2013

Book title: Data Mining Applications with R Editors: Yanchang Zhao, Yonghua Cen Publisher: Elsevier Publish date: December 2013 ISBN: 978-0-12-411511-8 Length: 514 pages URL: http://www.rdatamining.com/books/dmar An edited book titled Data Mining Applications with R was released in December 2013, which features 15 real-word applications on data m...

3216 sym 4 img

Multidimensional Scaling (MDS) with R

02.05.2014

This page shows Multidimensional Scaling (MDS) with R. It demonstrates with an example of automatic layout of Australian cities based on distances between them. The layout obtained with MDS is very close to their locations on a map. At first, the data of distances between 8 city in Australia are loaded from http://rosetta.reltech.org/TC/v15/Mappi...

1370 sym R (1614 sym/7 pcs) 10 img

CFP: AusDM 2014 – the 12th Australasian Data Mining Conference

13.05.2014

********************************************************* 12th Australasian Data Mining Conference (AusDM 2014) Brisbane, Australia 27-28 November 2014 http://ausdm14.ausdm.org/ ********************************************************* Data Mining is the art and science of intelligent analysis of (usually big) data sets for meaningful insights. D...

7216 sym 4 img

A Coursera course on Machine Learning starts on 16 June

21.05.2014

A 10-week course on Machine Learning by Andrew Ng from Stanford University will start on Coursera on 16 June. Below are descriptions of the course picked up from Coursera. The course provides a broad introduction to machine learning, data mining, and statistical pattern recognition. Topics include: (i) Supervised learning (parametric/non-parametr...

1404 sym 4 img

A Sequence of 9 Courses on Data Science Starts on Coursera on 2 June and 7 July 2014

26.05.2014

A sequence of 9 courses on Data Science will start on Coursera on 2 June and 7 July 2014, to be lectured by(Associate/Assistant) Professors of Johns Hopkins University. The courses are designed for students to learn to become Data Scientists and apply their skills in a capstone project. You can take the courses for free. However, if you want to g...

6042 sym 4 img

Step-by-Step Guide to Setting Up an R-Hadoop System

30.05.2014

by Yanchang Zhao RDataMining.com Following my first R-Hadoop system setup guide written in Sept 2013, I have further tested setting up a Hadoop system for running R code, as well as using HBase. I have tested it both on a single computer and on a cluster of computers. The process is described in a newer version of guide to setting up an R-Hadoop ...

2781 sym 4 img

Currency Exchange Rate Forecasting with ARIMA and STL

03.07.2014

I have made an example of time series forecasting with R, demonstrating currency exchange rate forecasting with the ARIMA and STL models. The example is easy to understand and follow. R source files are provided to run the example. The example was produced with R Markdown. If you want to learn R Markdown, you can try the Rmd source file, which is...

867 sym 6 img