Publications by Yanchang Zhao
A nice short article on memory in R
There is a nice short article on memory issue in R at http://www.matthewckeller.com/html/memory.html. If you use R to process large data, you might find it helpful. It introduces: – checking how much memory an object is taking; – the memory limit of 32-bit R and 64-bit R; – package designed to store objects on hard drives rather than RAM; �...
1032 sym 16 img
Statistics with R – Lots of R Examples
“Statistics with R” is a great R graphics & stats website. It provides lots of R examples, covering many analytics topics. It is also available as a PDF document to download at the website, as well as the R codes. The table of contents of the document are as below. 1. Introduction to R 2. Programming in R 3. From Data to Graphics 4. Customizi...
1340 sym 16 img
A prize of US$3,000,000 for a data mining competition to improve healthcare
There is a data mining competition with a prize of $3,000,000. The target is to improve healthcare in US by identifying patients who will be admitted to a hospital within the next year, using historical claims data. The algorithm to develop is to predict how many days a patient will spend in a hospital in the next year, so that unnecessary hospit...
1038 sym 16 img
Experience on using R to build prediction models in business applications
By Yanchang zhao, RDataMining.com Building prediction/classification models is one of the most widely-seen data mining tasks in business applications. To share experience on building prediction models with R, I have started a discussion at RDataMining group on LinkedIn with the following questions. And my experience can be found at the end of que...
5053 sym 16 img
Call for chapters: Data Mining Applications with R
Data Mining Applications with R A book to be published by Elsevier http://www.RDataMining.com/books/book2 Proposal Submission Deadline: April 30, 2012 Introduction R is one of the most widely used data mining tools in scientific and business applications, among dozens of commercial and open-source data mining software. It is free and expandable w...
3194 sym 16 img
Social Network Analysis with R
By Yanchang Zhao, RDataMining.com If you have tried social network analysis or graph mining with R, you might have already come across package igraph before. The package is designed for graphs and network analysis in R. It can handle large graphs very well and provides functions for interactive graph plotting and many other useful functions. Ther...
2067 sym 18 img
Obama administration unveiled a Big Data Research and Development Initiative with $200 million
Yanchang Zhao, RDataMining.com Obama administration unveiled a Big Data Research and Development Initiative with $200 million on March 29, 2012, to improve the ability to extract knowledge and insights from large and complex collections of digital data. Six Federal departments and agencies are involved to to improve the tools and techniques neede...
2014 sym 16 img
R Tips: lots of tips for R programming
by Yanchang Zhao, RDataMining.com There are more than 100 R tips at http://pj.freefaculty.org/R/Rtips.html, which provide quick examples to small challenges in everyday R programming, especially for users switching from other languages to R. There is also a .PDF version for it at http://pj.freefaculty.org/R/Rtips.pdf. It presents short examples o...
1151 sym 16 img
2nd round of call for chapter proposals for book Data Mining Applications with R: due by 31 May
2nd CALL FOR CHAPTERS: proposals due by 31 May 2012 Data Mining Applications with R A book to be published by Elsevier http://www.RDataMining.com/books/book2 Introduction —————— R is one of the most widely used data mining tools in scientific and business applications, among dozens of commercial and open-source data mining software. I...
3222 sym 16 img
Online resources for handling big data and parallel computing in R
by Yanchang Zhao, RDataMining.com Compared with many other programming languages, such as C/C++ and Java, R is less efficient and consumes much more memory. Fortunately, there are some packages that enables parallel computing in R and also packages for processing big data in R without loading all data into RAM. I have collected some links to onli...
2814 sym 16 img