gov.llnl.ontology.mapreduce
Class CorpusTableMR.CorpusTableMapper<K,V>

java.lang.Object
  extended by org.apache.hadoop.mapreduce.Mapper<org.apache.hadoop.hbase.io.ImmutableBytesWritable,org.apache.hadoop.hbase.client.Result,KEYOUT,VALUEOUT>
      extended by org.apache.hadoop.hbase.mapreduce.TableMapper<K,V>
          extended by gov.llnl.ontology.mapreduce.CorpusTableMR.CorpusTableMapper<K,V>
Direct Known Subclasses:
DependencyOccurrenceCountMR.DependencyOccurrenceCountMapper, DisambiguateMR.DisambiguateMapper, ExtractNounPairsMR.ExtractNounPairsMapper, IngestCorpusMR.IngestCorpusMapper, OneLinePerDocExtractorMR.OneLinePerDocExtractorMRMapper, ParsedDocExtractorMR.ParsedDocExtractorMapper, ParseMR.ParseMapper, POSCountMR.POSCountMapper, SemEvalPrinter.SemEvalPrinterMapper, TagDocumentMR.TagDocumentMapper, TagNetworkMR.TagNetworkMapper, TagOccurrenceMR.TagOccurrenceMapper, TagWordStatsMR.TagWordStatsMapper, TermDocOccurrenceCountMR.TermDocOccurrenceCountMapper, TermDocumentCountMR.TermDocumentCountMapper, TokenCountMR.TokenCountMapper, WordOccurrenceCountMR.WordOccurrenceCountMapper, WordsiMR.WordsiDependencyMapper, WordsiMR.WordsiOccurrenceMapper
Enclosing class:
CorpusTableMR

public abstract static class CorpusTableMR.CorpusTableMapper<K,V>
extends org.apache.hadoop.hbase.mapreduce.TableMapper<K,V>

A simple base class for any CorpusTableMR job. Most implementations need only implement #map(ImmutableBytesWritable, Result, Context). Those that need more than just a CorpusTable should override #setup(Context, Configuration) to create any additional data structures or data sources.


Nested Class Summary
 
Nested classes/interfaces inherited from class org.apache.hadoop.mapreduce.Mapper
org.apache.hadoop.mapreduce.Mapper.Context
 
Field Summary
protected  CorpusTable table
          The CorpusTable responsible for reading row data.
 
Constructor Summary
CorpusTableMR.CorpusTableMapper()
           
 
Method Summary
 void setup(org.apache.hadoop.mapreduce.Mapper.Context context)
          Initializes the CorpusTable for this CorpusTableMR.CorpusTableMapper and calls #setup(Context, Configuration).
protected  void setup(org.apache.hadoop.mapreduce.Mapper.Context context, org.apache.hadoop.conf.Configuration conf)
          Sets up any addition data classes or information needed by the CorpusTableMR.CorpusTableMapper.
 
Methods inherited from class org.apache.hadoop.mapreduce.Mapper
cleanup, map, run
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Field Detail

table

protected CorpusTable table
The CorpusTable responsible for reading row data.

Constructor Detail

CorpusTableMR.CorpusTableMapper

public CorpusTableMR.CorpusTableMapper()
Method Detail

setup

public void setup(org.apache.hadoop.mapreduce.Mapper.Context context)
           throws IOException,
                  InterruptedException
Initializes the CorpusTable for this CorpusTableMR.CorpusTableMapper and calls #setup(Context, Configuration).

Overrides:
setup in class org.apache.hadoop.mapreduce.Mapper<org.apache.hadoop.hbase.io.ImmutableBytesWritable,org.apache.hadoop.hbase.client.Result,K,V>
Throws:
IOException
InterruptedException

setup

protected void setup(org.apache.hadoop.mapreduce.Mapper.Context context,
                     org.apache.hadoop.conf.Configuration conf)
              throws IOException,
                     InterruptedException
Sets up any addition data classes or information needed by the CorpusTableMR.CorpusTableMapper. By default, this does nothing.

Throws:
IOException
InterruptedException


Copyright © 2010-2011. All Rights Reserved.