gov.llnl.ontology.mapreduce.ingest
Class ExtractNounPairsMR

java.lang.Object
  extended by org.apache.hadoop.conf.Configured
      extended by gov.llnl.ontology.mapreduce.CorpusTableMR
          extended by gov.llnl.ontology.mapreduce.ingest.ExtractNounPairsMR
All Implemented Interfaces:
org.apache.hadoop.conf.Configurable, org.apache.hadoop.util.Tool

public class ExtractNounPairsMR
extends CorpusTableMR

Author:
Keith Stevens

Nested Class Summary
static class ExtractNounPairsMR.ExtractNounPairReducer
           
static class ExtractNounPairsMR.ExtractNounPairsMapper
           
 
Nested classes/interfaces inherited from class gov.llnl.ontology.mapreduce.CorpusTableMR
CorpusTableMR.CorpusTableMapper<K,V>
 
Field Summary
static String ACCEPTOR
          The configuration key for the DependencyPathAcceptor.
static String EVIDENCE
          The configuration key for the EvidenceTable.
static String PATH_LENGTH
          The configuration key for the maximum valid depenendency path length.
 
Fields inherited from class gov.llnl.ontology.mapreduce.CorpusTableMR
CONF_PREFIX, TABLE
 
Constructor Summary
ExtractNounPairsMR()
           
 
Method Summary
protected  void addOptions(MRArgOptions options)
          Add more command line arguments.
protected  String jobName()
          Returns a descriptive job name for this map reduce task.
static void main(String[] args)
          Runs the ExtractNounPairsMR.
protected  Class mapperClass()
          Returns the Class object for the Mapper task.
protected  Class mapperKeyClass()
          Returns the Class object for the Mapper Key of this task.
protected  Class mapperValueClass()
          Returns the Class object for the Mapper Value of this task.
protected  void setupConfiguration(MRArgOptions options, org.apache.hadoop.conf.Configuration conf)
          Copies command line arguments to a Configuration so that Map/Reduce jobs can utilize the values set.
protected  void setupReducer(String tableName, org.apache.hadoop.mapreduce.Job job, MRArgOptions options)
          Sets up the Reducer for this job.
 
Methods inherited from class gov.llnl.ontology.mapreduce.CorpusTableMR
run, validateOptions
 
Methods inherited from class org.apache.hadoop.conf.Configured
getConf, setConf
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 
Methods inherited from interface org.apache.hadoop.conf.Configurable
getConf, setConf
 

Field Detail

ACCEPTOR

public static final String ACCEPTOR
The configuration key for the DependencyPathAcceptor.


PATH_LENGTH

public static final String PATH_LENGTH
The configuration key for the maximum valid depenendency path length.


EVIDENCE

public static final String EVIDENCE
The configuration key for the EvidenceTable.

Constructor Detail

ExtractNounPairsMR

public ExtractNounPairsMR()
Method Detail

main

public static void main(String[] args)
                 throws Exception
Runs the ExtractNounPairsMR.

Throws:
Exception

jobName

protected String jobName()
Returns a descriptive job name for this map reduce task.

Overrides:
jobName in class CorpusTableMR

mapperClass

protected Class mapperClass()
Returns the Class object for the Mapper task.

Specified by:
mapperClass in class CorpusTableMR

mapperKeyClass

protected Class mapperKeyClass()
Returns the Class object for the Mapper Key of this task. By default this returns ImmutableBytesWritable.

Overrides:
mapperKeyClass in class CorpusTableMR

mapperValueClass

protected Class mapperValueClass()
Returns the Class object for the Mapper Value of this task. By default, this returns Put.

Overrides:
mapperValueClass in class CorpusTableMR

setupReducer

protected void setupReducer(String tableName,
                            org.apache.hadoop.mapreduce.Job job,
                            MRArgOptions options)
                     throws IOException
Description copied from class: CorpusTableMR
Sets up the Reducer for this job. By default, it is a IdentityTableReducer.

Overrides:
setupReducer in class CorpusTableMR
Throws:
IOException

addOptions

protected void addOptions(MRArgOptions options)
Add more command line arguments. By default, this adds no options.

Overrides:
addOptions in class CorpusTableMR

setupConfiguration

protected void setupConfiguration(MRArgOptions options,
                                  org.apache.hadoop.conf.Configuration conf)
Copies command line arguments to a Configuration so that Map/Reduce jobs can utilize the values set. By default, this does no configuration.

Overrides:
setupConfiguration in class CorpusTableMR


Copyright © 2010-2011. All Rights Reserved.