Home » chukwa-0.4.0 » org.apache » hadoop » mapred »

org.apache.hadoop.mapred

Sub Packages:

org.apache.hadoop.mapred.jobcontrol   Utilities for managing dependent jobs.  
org.apache.hadoop.mapred.lib   Library of generally useful mappers, reducers, and partitioners.  
org.apache.hadoop.mapred.lib.aggregate   Classes for performing various counting and aggregations.  
org.apache.hadoop.mapred.pipes   Hadoop Pipes allows C++ code to use Hadoop DFS and map/reduce.  

Interfaces:

BufferSorter   This class provides a generic sort interface that should be implemented by specific sort algorithms.  code | html
InputFormat   An input data format.  code | html
InputSplit   The description of the data for a single map task.  code | html
InterTrackerProtocol   Protocol that a TaskTracker and the central JobTracker use to communicate.  code | html
JobConfigurable   That what may be configured.  code | html
JobHistory.Listener   Callback interface for reading back log events from JobHistory.  code | html
JobSubmissionProtocol   Protocol that a JobClient and the central JobTracker use to communicate.  code | html
MRConstants   Some handy constants  code | html
MapRunnable   Expert: Permits greater control of map processing.  code | html
MapTask.MapOutputCollector     code | html
Mapper   Maps input key/value pairs to a set of intermediate key/value pairs.  code | html
OutputCollector   Passed to Mapper and Reducer implementations to collect output data.  code | html
OutputFormat   An output data format.  code | html
Partitioner   Partitions the key space. A partition is created for each reduce task.  code | html
RecordReader   Reads key/value pairs from an input file FileSplit code | html
RecordWriter   Writes key/value pairs to an output file.  code | html
Reducer   Reduces a set of intermediate values which share a key to a smaller set of values.  code | html
Reporter   Passed to application code to permit alteration of status.  code | html
RunningJob   Includes details on a running MapReduce job.  code | html
SequenceFileInputFilter.Filter   filter interface  code | html
TaskUmbilicalProtocol   Protocol that task child process uses to contact its parent process.  code | html

Abstract Classes:

BasicTypeSorterBase   This class implements the sort interface using primitive int arrays as the data structures (that is why this class is called 'BasicType'SorterBase)  code | html
FileInputFormat   A base class for InputFormat code | html
InputFormatBase   A base class for InputFormat code | html
MultiFileInputFormat   An abstract InputFormat that returns MultiFileSplit 's in #getSplits(JobConf, int) method.  code | html
OutputFormatBase   A base class for OutputFormat code | html
SequenceFileInputFilter.FilterBase   base calss for Filters  code | html
TaskRunner   Base class that runs a task in a separate process.  code | html
TaskTrackerAction   A generic directive from the org.apache.hadoop.mapred.JobTracker to the org.apache.hadoop.mapred.TaskTracker to take some 'action'.  code | html

Classes:

JobClient.TaskStatusFilter     code | html
JobHistory.Keys   Job history files contain key="value" pairs, where keys belong to this enum.  code | html
JobHistory.RecordTypes   Record types are identifiers for each line of log in history files.  code | html
JobHistory.Values   This enum contains some of the values commonly used by history log events.  code | html
JobInProgress.Counter     code | html
JobPriority   Used to describe the priority of the running job.  code | html
TaskCompletionEvent.Status     code | html
TaskLog.LogName   The filter for userlogs.  code | html
TaskStatus.Phase     code | html
TaskStatus.State     code | html
TaskTracker.State     code | html
TaskTrackerAction.ActionType   Ennumeration of various 'actions' that the JobTracker directs the TaskTracker to perform periodically.  code | html
ChukwaJobTrackerInstrumentation     code | html
ClusterStatus   Summarizes the size and current state of the cluster.  code | html
Counters   A set of named counters.  code | html
Counters.CounterRec   A counter record, comprising its name and value.  code | html
Counters.Group   Represents a group of counters, comprising the counters from a particular counter enum class.  code | html
DefaultJobHistoryParser   Default parser for job history files.  code | html
DefaultJobHistoryParser.FailedOnNodesFilter     code | html
DefaultJobHistoryParser.JobTasksParseListener   Listener for Job's history log file, it populates JobHistory.JobInfo object with data from log file.  code | html
DefaultJobHistoryParser.KilledOnNodesFilter     code | html
DefaultJobHistoryParser.MasterIndex   Contents of a job history file.  code | html
DefaultJobHistoryParser.MasterIndexParseListener   Parses and returns a map of values in master index.  code | html
DisallowedTaskTrackerException   This exception is thrown when a tasktracker tries to register or communicate with the jobtracker when it does not appear on the list of included nodes, or has been specifically excluded.  code | html
FileAlreadyExistsException   Used when target file already exists for any operation and is not configured to be overwritten.  code | html
FileSplit   A section of an input file.  code | html
HeartbeatResponse   The response sent by the JobTracker to the hearbeat sent periodically by the TaskTracker   code | html
InvalidFileTypeException   Used when file type differs from the desired file type.  code | html
InvalidInputException   This class wraps a list of problems with the input, so that the user can get a list of problems together instead of finding and fixing them one by one.  code | html
InvalidJobConfException   This exception is thrown when jobconf misses some mendatory attributes or value of some attributes is invalid.  code | html
IsolationRunner   Licensed to the Apache Software Foundation (ASF) under one or more contributor license agreements.  code | html
IsolationRunner.FakeUmbilical     code | html
JobClient   JobClient interacts with the JobTracker network interface.  code | html
JobClient.NetworkedJob   A NetworkedJob is an implementation of RunningJob.  code | html
JobClient.RawSplit     code | html
JobConf   A map/reduce job configuration.  code | html
JobEndNotifier   Licensed to the Apache Software Foundation (ASF) under one or more contributor license agreements.  code | html
JobEndNotifier.JobEndStatusInfo     code | html
JobHistory   Provides methods for writing to and reading from job history.  code | html
JobHistory.HistoryCleaner   Delete history files older than one month.  code | html
JobHistory.JobInfo   Helper class for logging or reading back events related to job start, finish or failure.  code | html
JobHistory.KeyValuePair   Base class contais utility stuff to manage types key value pairs with enums.  code | html
JobHistory.MapAttempt   Helper class for logging or reading back events related to start, finish or failure of a Map Attempt on a node.  code | html
JobHistory.ReduceAttempt   Helper class for logging or reading back events related to start, finish or failure of a Map Attempt on a node.  code | html
JobHistory.Task   Helper class for logging or reading back events related to Task's start, finish or failure.  code | html
JobHistory.TaskAttempt   Base class for Map and Reduce TaskAttempts.  code | html
JobInProgress   Licensed to the Apache Software Foundation (ASF) under one or more contributor license agreements.  code | html
JobProfile   A JobProfile is a MapReduce primitive.  code | html
JobStatus   Describes the current status of a job.  code | html
JobTracker   JobTracker is the central location for submitting and tracking MR jobs in a network environment.  code | html
JobTracker.ExpireLaunchingTasks   A thread to timeout tasks that have been assigned to task trackers, but that haven't reported back yet.  code | html
JobTracker.ExpireTrackers     code | html
JobTracker.JobInitThread     code | html
JobTracker.JobTrackerMetrics     code | html
JobTracker.RetireJobs     code | html
KeyValueLineRecordReader   This class treats a line in the input as a key/value pair separated by a separator character.  code | html
KeyValueTextInputFormat   An InputFormat for plain text files.  code | html
KillJobAction   Represents a directive from the org.apache.hadoop.mapred.JobTracker to the org.apache.hadoop.mapred.TaskTracker to kill the task of a job and cleanup resources.  code | html
KillTaskAction   Represents a directive from the org.apache.hadoop.mapred.JobTracker to the org.apache.hadoop.mapred.TaskTracker to kill a task.  code | html
LaunchTaskAction   Represents a directive from the org.apache.hadoop.mapred.JobTracker to the org.apache.hadoop.mapred.TaskTracker to launch a new task.  code | html
LineRecordReader   Treats keys as offset in file and value as line.  code | html
LineRecordReader.TextStuffer   Provide a bridge to get the bytes from the ByteArrayOutputStream without creating a new byte array.  code | html
LocalJobRunner   Implements MapReduce locally, in-process, for debugging.  code | html
LocalJobRunner.Job     code | html
MRSortResultIterator     code | html
MRSortResultIterator.InMemUncompressedBytes     code | html
MapFileOutputFormat   An OutputFormat that writes MapFile s.  code | html
MapOutputFile   Manipulate the working area for the transient store for maps and reduces.  code | html
MapOutputLocation   The location of a map output file, as passed to a reduce task via the InterTrackerProtocol code | html
MapReduceBase   Base class for Mapper and Reducer implementations.  code | html
MapRunner   Default MapRunnable implementation.  code | html
MapTask   A Map task.  code | html
MapTask.DirectMapOutputCollector     code | html
MapTask.MapOutputBuffer     code | html
MapTask.MapOutputBuffer.CombineValuesIterator     code | html
MapTaskRunner   Runs a map task.  code | html
MergeSorter   This class implements the sort method from BasicTypeSorterBase class as MergeSort.  code | html
MultiFileSplit   A sub-collection of input files.  code | html
PhasedFileSystem   This class acts as a proxy to the actual file system being used.  code | html
PhasedFileSystem.FileInfo     code | html
ReduceTask   A Reduce task.  code | html
ReduceTask.ReduceCopier     code | html
ReduceTask.ReduceCopier.CopyResult   Represents the result of an attempt to copy a map output  code | html
ReduceTask.ReduceCopier.InMemFSMergeThread     code | html
ReduceTask.ReduceCopier.MapOutputCopier   Copies map outputs as they become available  code | html
ReduceTask.ReduceCopier.ShuffleClientMetrics   This class contains the methods that should be used for metrics-reporting the specific metrics for shuffle.  code | html
ReduceTask.ReduceValuesIterator     code | html
ReduceTask.ValuesIterator   Iterates values while keys match in sorted input.  code | html
ReduceTaskRunner   Runs a reduce task.  code | html
ReinitTrackerAction   Represents a directive from the org.apache.hadoop.mapred.JobTracker to the org.apache.hadoop.mapred.TaskTracker to reinitialize itself.  code | html
SequenceFileAsTextInputFormat   This class is similar to SequenceFileInputFormat, except it generates SequenceFileAsTextRecordReader which converts the input keys and values to their String forms by calling toString() method.  code | html
SequenceFileAsTextRecordReader   This class converts the input keys and values to their String forms by calling toString() method.  code | html
SequenceFileInputFilter   A class that allows a map/red job to work on a sample of sequence files.  code | html
SequenceFileInputFilter.FilterRecordReader     code | html
SequenceFileInputFilter.MD5Filter   This class returns a set of records by examing the MD5 digest of its key against a filtering frequency f code | html
SequenceFileInputFilter.PercentFilter   This class returns a percentage of records The percentage is determined by a filtering frequency f using the criteria record# % f == 0.  code | html
SequenceFileInputFilter.RegexFilter   Records filter by matching key to regex  code | html
SequenceFileInputFormat   An InputFormat for SequenceFile s.  code | html
SequenceFileOutputFormat   An OutputFormat that writes SequenceFile s.  code | html
SequenceFileRecordReader   An RecordReader for SequenceFile s.  code | html
StatusHttpServer   Create a Jetty embedded server to answer http requests.  code | html
StatusHttpServer.StackServlet   A very simple servlet to serve up a text representation of the current stack traces.  code | html
TaskCompletionEvent   This is used to track task completion events on job tracker.  code | html
TaskInProgress   Licensed to the Apache Software Foundation (ASF) under one or more contributor license agreements.  code | html
TaskLog   A simple logger to handle the task-specific user logs.  code | html
TaskLog.Reader     code | html
TaskLog.TaskLogsPurgeFilter     code | html
TaskLogAppender   A simple log4j-appender for the task child's map-reduce system logs.  code | html
TaskLogServlet   A servlet that is run by the TaskTrackers to provide the task logs via http.  code | html
TaskReport   A report on the state of a task.  code | html
TaskStatus   Describes the current status of a task.  code | html
TaskTracker   TaskTracker is a process that starts and tracks MR Tasks in a networked environment.  code | html
TaskTracker.Child   The main() for child processes.  code | html
TaskTracker.FetchStatus     code | html
TaskTracker.MapEventsFetcherThread     code | html
TaskTracker.MapOutputServlet   This class is used in TaskTracker's Jetty to serve the map outputs to other nodes.  code | html
TaskTracker.RunningJob   The datastructure for initializing a job  code | html
TaskTracker.ShuffleServerMetrics   This class contains the methods that should be used for metrics-reporting the specific metrics for shuffle.  code | html
TaskTracker.TaskInProgress     code | html
TaskTracker.TaskTrackerMetrics     code | html
TaskTrackerStatus   A TaskTrackerStatus is a MapReduce primitive.  code | html
TextInputFormat   An InputFormat for plain text files.  code | html
TextOutputFormat   An OutputFormat that writes plain text files.  code | html
TextOutputFormat.LineRecordWriter     code | html