How To: How do I compute group means for a number of variables in an .xdf file?

The easiest way to do this is to use the 'rowSelection' argument. Here is a short example, that uses it with the RevoScaleR sample dataset 'CensusWorkers.xdf': 

inFile <- file.path(rxGetOption("sampleDataDir"), "CensusWorkers") rxCube(incwage ~ sex : state, data = inFile, rowSelection = age >=30 & age <=50 & wkswork1 > 40, means=TRUE)
   
Note This is a "FAST PUBLISH" article created directly from within the Microsoft support organization. The information contained herein is provided as-is in response to emerging issues. As a result of the speed in making it available, the materials may include typographical errors and may be revised at any time without notice. See Terms of Use for other considerations.
Properties

Article ID: 3104265 - Last Review: 10/29/2015 08:24:00 - Revision: 1.0

Revolution Analytics

  • KB3104265
Feedback