How To: How do I compute group means for a number of variables in an .xdf file?

The easiest way to do this is to use the 'rowSelection' argument. Here is a short example, that uses it with the RevoScaleR sample dataset 'CensusWorkers.xdf': 

inFile <- file.path(rxGetOption("sampleDataDir"), "CensusWorkers") 
rxCube(incwage ~ sex : state, data = inFile, rowSelection = age >=30 & age <=50 & wkswork1 > 40, means=TRUE)

Need more help?

Expand your skills
Explore Training
Get new features first
Join Microsoft Insiders

Was this information helpful?

Thank you for your feedback!

Thank you for your feedback! It sounds like it might be helpful to connect you to one of our Office support agents.