Package org.duracloud.services.hadoop.base

Class Summary
AltTextOutputFormat<K,V> Overrides the text output format to allow the output directory to exist and include a DuraCloud space properties file.
Constants  
FileCopier This class can be run as a separate thread to copy a file either from the local filesystem to a remote path, or from a remote path to the local filesystem.
InitParamParser Parses the initialization parameters for the hadoop job
JobBuilder This class constructs a hadoop job to process files.
JobRunner This is the main point of entry for the hadoop file processing application.
ProcessFileMapper Mapper used to process files.
ProcessResult Stores the result of file processing.
ResultsReducer Reducer used to collect processing output
SimpleFileRecordReader Record reader, used to provide a set of key/value pairs for each file in a file split.
WholeFileInputFormat Input format which defines that files are not split and uses the SimpleFileRecordReader to produce key/value pairs based only on file path.
 



Copyright © 2009-2011 DuraSpace. All Rights Reserved.