Applies To
Revolution Analytics

Hadoop Sort / Merge / By-Group ProcessingWorkaroundsEven if you pre-sort in Hadoop and then import to RRE there is no guarantee that the splits will contain whole by-groups or be processed in the correct order. Hence the options narrow to CSV input and:1)            Hive or Pig for Sort, merge, and by-group processing. 2)            rmr2 or plyrmr for by-group processing in R.   

Need more help?

Want more options?

Explore subscription benefits, browse training courses, learn how to secure your device, and more.