gov.llnl.ontology.mapreduce.ingest
Class ImportCorpusMR.ImportCorpusMapper

java.lang.Object
  extended by org.apache.hadoop.mapreduce.Mapper<org.apache.hadoop.hbase.io.ImmutableBytesWritable,org.apache.hadoop.io.Text,org.apache.hadoop.io.Text,org.apache.hadoop.io.Text>
      extended by gov.llnl.ontology.mapreduce.ingest.ImportCorpusMR.ImportCorpusMapper
Enclosing class:
ImportCorpusMR

public static class ImportCorpusMR.ImportCorpusMapper
extends org.apache.hadoop.mapreduce.Mapper<org.apache.hadoop.hbase.io.ImmutableBytesWritable,org.apache.hadoop.io.Text,org.apache.hadoop.io.Text,org.apache.hadoop.io.Text>

This Mapper iterates over text documents on disk and extracts various document details and the raw document text. All of the extracted information is stored in a CorpusTable.


Nested Class Summary
 
Nested classes/interfaces inherited from class org.apache.hadoop.mapreduce.Mapper
org.apache.hadoop.mapreduce.Mapper.Context
 
Constructor Summary
ImportCorpusMR.ImportCorpusMapper()
           
 
Method Summary
protected  void cleanup(org.apache.hadoop.mapreduce.Mapper.Context context)
          
 void map(org.apache.hadoop.hbase.io.ImmutableBytesWritable key, org.apache.hadoop.io.Text value, org.apache.hadoop.mapreduce.Mapper.Context context)
          
 void setup(org.apache.hadoop.mapreduce.Mapper.Context context)
          
 
Methods inherited from class org.apache.hadoop.mapreduce.Mapper
run
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Constructor Detail

ImportCorpusMR.ImportCorpusMapper

public ImportCorpusMR.ImportCorpusMapper()
Method Detail

setup

public void setup(org.apache.hadoop.mapreduce.Mapper.Context context)

Overrides:
setup in class org.apache.hadoop.mapreduce.Mapper<org.apache.hadoop.hbase.io.ImmutableBytesWritable,org.apache.hadoop.io.Text,org.apache.hadoop.io.Text,org.apache.hadoop.io.Text>

map

public void map(org.apache.hadoop.hbase.io.ImmutableBytesWritable key,
                org.apache.hadoop.io.Text value,
                org.apache.hadoop.mapreduce.Mapper.Context context)

Overrides:
map in class org.apache.hadoop.mapreduce.Mapper<org.apache.hadoop.hbase.io.ImmutableBytesWritable,org.apache.hadoop.io.Text,org.apache.hadoop.io.Text,org.apache.hadoop.io.Text>

cleanup

protected void cleanup(org.apache.hadoop.mapreduce.Mapper.Context context)

Overrides:
cleanup in class org.apache.hadoop.mapreduce.Mapper<org.apache.hadoop.hbase.io.ImmutableBytesWritable,org.apache.hadoop.io.Text,org.apache.hadoop.io.Text,org.apache.hadoop.io.Text>


Copyright © 2010-2011. All Rights Reserved.