What are the specific configuration requirements for distibuted computing on LSF clusters?

Both Platform LSF and Platofrm MPI needs to be installed on the cluster.

Revolution R needs to be installed on every node of the cluster.

You need to setup a common share directory on each node of your cluster that will be used in running 
R jobs. Every LSF user needs to be able to read and write to this directory and each user should 
have their own subdirectory to hold information from running jobs.

You need to be able to 'ssh' from each node of the cluster to every other node in the cluster.
You will need to setup either password-less ssh or key-based authentication on the cluster. 
Authentication can be either domain, key or host based.

For more information on how to set this up, please consult this IBM KB article:

Your data used in R computations must be available on all of the nodes.
Note This is a "FAST PUBLISH" article created directly from within the Microsoft support organization. The information contained herein is provided as-is in response to emerging issues. As a result of the speed in making it available, the materials may include typographical errors and may be revised at any time without notice. See Terms of Use for other considerations.

Article ID: 3104137 - Last Review: 11/01/2015 02:00:00 - Revision: 1.0

Revolution Analytics

  • KB3104137