gov.llnl.ontology.mapreduce.ingest
Class ImportCorpusMR.ImportCorpusMapper
java.lang.Object
org.apache.hadoop.mapreduce.Mapper<org.apache.hadoop.hbase.io.ImmutableBytesWritable,org.apache.hadoop.io.Text,org.apache.hadoop.io.Text,org.apache.hadoop.io.Text>
gov.llnl.ontology.mapreduce.ingest.ImportCorpusMR.ImportCorpusMapper
- Enclosing class:
- ImportCorpusMR
public static class ImportCorpusMR.ImportCorpusMapper
- extends org.apache.hadoop.mapreduce.Mapper<org.apache.hadoop.hbase.io.ImmutableBytesWritable,org.apache.hadoop.io.Text,org.apache.hadoop.io.Text,org.apache.hadoop.io.Text>
This Mapper
iterates over text documents on disk and extracts
various document details and the raw document text. All of the extracted
information is stored in a CorpusTable
.
Nested classes/interfaces inherited from class org.apache.hadoop.mapreduce.Mapper |
org.apache.hadoop.mapreduce.Mapper.Context |
Method Summary |
protected void |
cleanup(org.apache.hadoop.mapreduce.Mapper.Context context)
|
void |
map(org.apache.hadoop.hbase.io.ImmutableBytesWritable key,
org.apache.hadoop.io.Text value,
org.apache.hadoop.mapreduce.Mapper.Context context)
|
void |
setup(org.apache.hadoop.mapreduce.Mapper.Context context)
|
Methods inherited from class org.apache.hadoop.mapreduce.Mapper |
run |
Methods inherited from class java.lang.Object |
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
ImportCorpusMR.ImportCorpusMapper
public ImportCorpusMR.ImportCorpusMapper()
setup
public void setup(org.apache.hadoop.mapreduce.Mapper.Context context)
-
- Overrides:
setup
in class org.apache.hadoop.mapreduce.Mapper<org.apache.hadoop.hbase.io.ImmutableBytesWritable,org.apache.hadoop.io.Text,org.apache.hadoop.io.Text,org.apache.hadoop.io.Text>
map
public void map(org.apache.hadoop.hbase.io.ImmutableBytesWritable key,
org.apache.hadoop.io.Text value,
org.apache.hadoop.mapreduce.Mapper.Context context)
-
- Overrides:
map
in class org.apache.hadoop.mapreduce.Mapper<org.apache.hadoop.hbase.io.ImmutableBytesWritable,org.apache.hadoop.io.Text,org.apache.hadoop.io.Text,org.apache.hadoop.io.Text>
cleanup
protected void cleanup(org.apache.hadoop.mapreduce.Mapper.Context context)
-
- Overrides:
cleanup
in class org.apache.hadoop.mapreduce.Mapper<org.apache.hadoop.hbase.io.ImmutableBytesWritable,org.apache.hadoop.io.Text,org.apache.hadoop.io.Text,org.apache.hadoop.io.Text>
Copyright © 2010-2011. All Rights Reserved.