gov.llnl.ontology.mapreduce.ingest
Class IngestCorpusMR.IngestCorpusMapper
java.lang.Object
org.apache.hadoop.mapreduce.Mapper<org.apache.hadoop.hbase.io.ImmutableBytesWritable,org.apache.hadoop.hbase.client.Result,KEYOUT,VALUEOUT>
org.apache.hadoop.hbase.mapreduce.TableMapper<K,V>
gov.llnl.ontology.mapreduce.CorpusTableMR.CorpusTableMapper<org.apache.hadoop.hbase.io.ImmutableBytesWritable,org.apache.hadoop.hbase.client.Put>
gov.llnl.ontology.mapreduce.ingest.IngestCorpusMR.IngestCorpusMapper
- Enclosing class:
- IngestCorpusMR
public static class IngestCorpusMR.IngestCorpusMapper
- extends CorpusTableMR.CorpusTableMapper<org.apache.hadoop.hbase.io.ImmutableBytesWritable,org.apache.hadoop.hbase.client.Put>
This TableMapper
iterates over rows in a CorpusTable
and
applies sentence spans, token spans, and part of speech tags to every
element in the raw text document.
Nested classes/interfaces inherited from class org.apache.hadoop.mapreduce.Mapper |
org.apache.hadoop.mapreduce.Mapper.Context |
Method Summary |
protected void |
cleanup(org.apache.hadoop.mapreduce.Mapper.Context context)
|
void |
map(org.apache.hadoop.hbase.io.ImmutableBytesWritable key,
org.apache.hadoop.hbase.client.Result row,
org.apache.hadoop.mapreduce.Mapper.Context context)
|
void |
setup(org.apache.hadoop.mapreduce.Mapper.Context context,
org.apache.hadoop.conf.Configuration conf)
Sets up any addition data classes or information needed by the CorpusTableMR.CorpusTableMapper . |
Methods inherited from class org.apache.hadoop.mapreduce.Mapper |
run |
Methods inherited from class java.lang.Object |
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
IngestCorpusMR.IngestCorpusMapper
public IngestCorpusMR.IngestCorpusMapper()
setup
public void setup(org.apache.hadoop.mapreduce.Mapper.Context context,
org.apache.hadoop.conf.Configuration conf)
- Sets up any addition data classes or information needed by the
CorpusTableMR.CorpusTableMapper
. By default, this does nothing.
- Overrides:
setup
in class CorpusTableMR.CorpusTableMapper<org.apache.hadoop.hbase.io.ImmutableBytesWritable,org.apache.hadoop.hbase.client.Put>
map
public void map(org.apache.hadoop.hbase.io.ImmutableBytesWritable key,
org.apache.hadoop.hbase.client.Result row,
org.apache.hadoop.mapreduce.Mapper.Context context)
-
- Overrides:
map
in class org.apache.hadoop.mapreduce.Mapper<org.apache.hadoop.hbase.io.ImmutableBytesWritable,org.apache.hadoop.hbase.client.Result,org.apache.hadoop.hbase.io.ImmutableBytesWritable,org.apache.hadoop.hbase.client.Put>
cleanup
protected void cleanup(org.apache.hadoop.mapreduce.Mapper.Context context)
-
- Overrides:
cleanup
in class org.apache.hadoop.mapreduce.Mapper<org.apache.hadoop.hbase.io.ImmutableBytesWritable,org.apache.hadoop.hbase.client.Result,org.apache.hadoop.hbase.io.ImmutableBytesWritable,org.apache.hadoop.hbase.client.Put>
Copyright © 2010-2011. All Rights Reserved.