Class Summary |
AltTextOutputFormat<K,V> |
Overrides the text output format to allow the output directory to exist and
include a DuraCloud space properties file. |
Constants |
|
FileCopier |
This class can be run as a separate thread to copy a file either from the
local filesystem to a remote path, or from a remote path to the local
filesystem. |
InitParamParser |
Parses the initialization parameters for the hadoop job |
JobBuilder |
This class constructs a hadoop job to process files. |
JobRunner |
This is the main point of entry for the hadoop file processing application. |
ProcessFileMapper |
Mapper used to process files. |
ProcessResult |
Stores the result of file processing. |
ResultsReducer |
Reducer used to collect processing output |
SimpleFileRecordReader |
Record reader, used to provide a set of key/value pairs for each file
in a file split. |
WholeFileInputFormat |
Input format which defines that files are not split and uses the
SimpleFileRecordReader to produce key/value pairs based only on file path. |