QA: Comparing RevoScaleR and MapReduce

How does the distributed computing functionality in RevoScaleR differ from MapReduce? 

Both RevoScaleR and MapReduce provide a framework for analyzing large data sets in R. RevoScaleR provides large data computing independent of whether you are working on a cluster; it contains external memory algorithms that process the data in chunks. Beginning with Revolution R Enterprise 5.0, RevoScaleR also provides a framework that allows one to use resources on an artibrary set of nodes on a cluster with any standard R function.   
Note This is a "FAST PUBLISH" article created directly from within the Microsoft support organization. The information contained herein is provided as-is in response to emerging issues. As a result of the speed in making it available, the materials may include typographical errors and may be revised at any time without notice. See Terms of Use for other considerations.

Article ID: 3104267 - Last Review: 10/29/2015 08:39:00 - Revision: 1.0

Revolution Analytics

  • KB3104267