gov.llnl.ontology.mapreduce.stats
Class NYTOnelineSectionMR.NYTOnelineSectionMapper
java.lang.Object
org.apache.hadoop.mapreduce.Mapper<org.apache.hadoop.hbase.io.ImmutableBytesWritable,org.apache.hadoop.io.Text,org.apache.hadoop.io.Text,org.apache.hadoop.io.Text>
gov.llnl.ontology.mapreduce.stats.NYTOnelineSectionMR.NYTOnelineSectionMapper
- Enclosing class:
- NYTOnelineSectionMR
public static class NYTOnelineSectionMR.NYTOnelineSectionMapper
- extends org.apache.hadoop.mapreduce.Mapper<org.apache.hadoop.hbase.io.ImmutableBytesWritable,org.apache.hadoop.io.Text,org.apache.hadoop.io.Text,org.apache.hadoop.io.Text>
This Mapper
iterates over text documents on disk and extracts
various document details and the raw document text. All of the extracted
information is stored in a CorpusTable
.
Nested classes/interfaces inherited from class org.apache.hadoop.mapreduce.Mapper |
org.apache.hadoop.mapreduce.Mapper.Context |
Method Summary |
void |
map(org.apache.hadoop.hbase.io.ImmutableBytesWritable key,
org.apache.hadoop.io.Text value,
org.apache.hadoop.mapreduce.Mapper.Context context)
|
Methods inherited from class org.apache.hadoop.mapreduce.Mapper |
cleanup, run, setup |
Methods inherited from class java.lang.Object |
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
NYTOnelineSectionMR.NYTOnelineSectionMapper
public NYTOnelineSectionMR.NYTOnelineSectionMapper()
map
public void map(org.apache.hadoop.hbase.io.ImmutableBytesWritable key,
org.apache.hadoop.io.Text value,
org.apache.hadoop.mapreduce.Mapper.Context context)
throws IOException,
InterruptedException
-
- Overrides:
map
in class org.apache.hadoop.mapreduce.Mapper<org.apache.hadoop.hbase.io.ImmutableBytesWritable,org.apache.hadoop.io.Text,org.apache.hadoop.io.Text,org.apache.hadoop.io.Text>
- Throws:
IOException
InterruptedException
Copyright © 2010-2011. All Rights Reserved.