QA: Why do the RevoScaleR analysis functions typically run faster the second time?

The operating system caches recently-used data in RAM. On the second run or second iteration of algorithm used by an analysis function, access to the data is often much faster because some or all of the data is cached in RAM. 

The extent to which this is noticable depends upon the relative speeds of the hard drive, the number of CPUs, the amount of RAM available, and the the number of calculations performed in the CPU. 
Note This is a "FAST PUBLISH" article created directly from within the Microsoft support organization. The information contained herein is provided as-is in response to emerging issues. As a result of the speed in making it available, the materials may include typographical errors and may be revised at any time without notice. See Terms of Use for other considerations.

Article ID: 3104272 - Last Review: 10/29/2015 08:51:00 - Revision: 1.0

Revolution Analytics

  • KB3104272