For example, suppose you would like to estimate a linear model using wage income as the dependent variable, and want to include state-level of per capita expenditure on education as one of the independent variables. We can define a named vector to contain this state-level data as follows:
educExp <- c(Connecticut=1795.57, Washington=1170.46, Indiana = 1289.66)
We can then use rxDataStepXdf to add the per capita education expenditure as a new variable using the transforms argument, passing educExp to the userObjects argument as a named list:
censusWorkers <- file.path(rxGetOption("sampleDataDir"), "censusWorkers.xdf") rxDataStepXdf(inFile = censusWorkers, outFile = "censusWorkersWithEduc", transforms=list(stateEducExpPC=educExp[match(state, names(educExp))]), transformVars="state", userObjects=list(educExp=educExp))
The rxGetInfoXdf function reveals the added variable:
This example and other are included in the RevoScaleR User's Guide, accessed by selecting Help - R Manuals (PDF) from the Revolution R Enterprise menu.
Article ID: 3104270 - Last Review: 29 Oct 2015 - Revision: 1