Available Data sources for Distributed compute contexts

In the local compute context, all of RevoScaleR’s supported data sources are available to you. In a distributed compute context, however, your choice of data sources may be severely limited.
The most extreme case is the RxInTeradata compute context, which supports only the RxTeradata data source—this makes sense, as the computations are being performed on data inside the Teradata database.

The following table shows the available combinations of compute contexts and data sources (x indicates available):
Compute Context →

Data Source↓
RxLocalSeq/ParallelRxHpcServerRxLsfClusterRxHadoopMRRxInTeradata
Delimited Text (RxTextData)xxxx 
Fixed-Format Text (RxTextData)xxx  
.xdf data files (RxXdfData)xxxx 
SAS data files (RxSasData)xxx  
SPSS data files (RxSpssData)xxx  
ODBC data (RxOdbcData)xxx  
Teradata database (RxTeradata)xxx x
For more information - please review the RevoScaleR Distributed Computing Guide
Note This is a "FAST PUBLISH" article created directly from within the Microsoft support organization. The information contained herein is provided as-is in response to emerging issues. As a result of the speed in making it available, the materials may include typographical errors and may be revised at any time without notice. See Terms of Use for other considerations.
Properties

Article ID: 3104237 - Last Review: 10/29/2015 07:15:00 - Revision: 1.0

Revolution Analytics

  • KB3104237
Feedback