gov.llnl.ontology.mapreduce.ingest
Class IngestCorpusMR.IngestCorpusMapper

java.lang.Object
  extended by org.apache.hadoop.mapreduce.Mapper<org.apache.hadoop.hbase.io.ImmutableBytesWritable,org.apache.hadoop.hbase.client.Result,KEYOUT,VALUEOUT>
      extended by org.apache.hadoop.hbase.mapreduce.TableMapper<K,V>
          extended by gov.llnl.ontology.mapreduce.CorpusTableMR.CorpusTableMapper<org.apache.hadoop.hbase.io.ImmutableBytesWritable,org.apache.hadoop.hbase.client.Put>
              extended by gov.llnl.ontology.mapreduce.ingest.IngestCorpusMR.IngestCorpusMapper
Enclosing class:
IngestCorpusMR

public static class IngestCorpusMR.IngestCorpusMapper
extends CorpusTableMR.CorpusTableMapper<org.apache.hadoop.hbase.io.ImmutableBytesWritable,org.apache.hadoop.hbase.client.Put>

This TableMapper iterates over rows in a CorpusTable and applies sentence spans, token spans, and part of speech tags to every element in the raw text document.


Nested Class Summary
 
Nested classes/interfaces inherited from class org.apache.hadoop.mapreduce.Mapper
org.apache.hadoop.mapreduce.Mapper.Context
 
Field Summary
 
Fields inherited from class gov.llnl.ontology.mapreduce.CorpusTableMR.CorpusTableMapper
table
 
Constructor Summary
IngestCorpusMR.IngestCorpusMapper()
           
 
Method Summary
protected  void cleanup(org.apache.hadoop.mapreduce.Mapper.Context context)
          
 void map(org.apache.hadoop.hbase.io.ImmutableBytesWritable key, org.apache.hadoop.hbase.client.Result row, org.apache.hadoop.mapreduce.Mapper.Context context)
          
 void setup(org.apache.hadoop.mapreduce.Mapper.Context context, org.apache.hadoop.conf.Configuration conf)
          Sets up any addition data classes or information needed by the CorpusTableMR.CorpusTableMapper.
 
Methods inherited from class gov.llnl.ontology.mapreduce.CorpusTableMR.CorpusTableMapper
setup
 
Methods inherited from class org.apache.hadoop.mapreduce.Mapper
run
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Constructor Detail

IngestCorpusMR.IngestCorpusMapper

public IngestCorpusMR.IngestCorpusMapper()
Method Detail

setup

public void setup(org.apache.hadoop.mapreduce.Mapper.Context context,
                  org.apache.hadoop.conf.Configuration conf)
Sets up any addition data classes or information needed by the CorpusTableMR.CorpusTableMapper. By default, this does nothing.

Overrides:
setup in class CorpusTableMR.CorpusTableMapper<org.apache.hadoop.hbase.io.ImmutableBytesWritable,org.apache.hadoop.hbase.client.Put>

map

public void map(org.apache.hadoop.hbase.io.ImmutableBytesWritable key,
                org.apache.hadoop.hbase.client.Result row,
                org.apache.hadoop.mapreduce.Mapper.Context context)

Overrides:
map in class org.apache.hadoop.mapreduce.Mapper<org.apache.hadoop.hbase.io.ImmutableBytesWritable,org.apache.hadoop.hbase.client.Result,org.apache.hadoop.hbase.io.ImmutableBytesWritable,org.apache.hadoop.hbase.client.Put>

cleanup

protected void cleanup(org.apache.hadoop.mapreduce.Mapper.Context context)

Overrides:
cleanup in class org.apache.hadoop.mapreduce.Mapper<org.apache.hadoop.hbase.io.ImmutableBytesWritable,org.apache.hadoop.hbase.client.Result,org.apache.hadoop.hbase.io.ImmutableBytesWritable,org.apache.hadoop.hbase.client.Put>


Copyright © 2010-2011. All Rights Reserved.