QA: How can I use an object in my global environment to create a new variable in an .xdf file?

Use the userObjects argument along with the transforms argument to create new variables from objects in your global environment (or other environments in your current search path). 

For example, suppose you would like to estimate a linear model using wage income as the dependent variable, and want to include state-level of per capita expenditure on education as one of the independent variables. We can define a named vector to contain this state-level data as follows: 

educExp <- c(Connecticut=1795.57, Washington=1170.46, Indiana = 1289.66) 

We can then use rxDataStepXdf to add the per capita education expenditure as a new variable using the transforms argument, passing educExp to the userObjects argument as a named list: 

censusWorkers <- file.path(rxGetOption("sampleDataDir"), "censusWorkers.xdf") rxDataStepXdf(inFile = censusWorkers, outFile = "censusWorkersWithEduc", transforms=list(stateEducExpPC=educExp[match(state, names(educExp))]), transformVars="state", userObjects=list(educExp=educExp)) 

The rxGetInfoXdf function reveals the added variable: 

rxGetInfo("censusWorkersWithEduc.xdf",getVarInfo=TRUE) 

This example and other are included in the RevoScaleR User's Guide, accessed by selecting Help - R Manuals (PDF) from the Revolution R Enterprise menu. 
Note This is a "FAST PUBLISH" article created directly from within the Microsoft support organization. The information contained herein is provided as-is in response to emerging issues. As a result of the speed in making it available, the materials may include typographical errors and may be revised at any time without notice. See Terms of Use for other considerations.
Vlastnosti

ID článku: 3104270 - Poslední kontrola: 10/29/2015 08:46:00 - Revize: 1.0

Revolution Analytics

  • KB3104270
Váš názor