|
||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |
java.lang.Objectorg.apache.hadoop.conf.Configured
gov.llnl.ontology.mapreduce.CorpusTableMR
gov.llnl.ontology.mapreduce.stats.WordOccurrenceCountMR
public class WordOccurrenceCountMR
A Map/Reduce job that counts word co-occurrence frequencies and writes them to hdfs.
Nested Class Summary | |
---|---|
static class |
WordOccurrenceCountMR.WordOccurrenceCountMapper
The TableMapper responsible for most of the work. |
Nested classes/interfaces inherited from class gov.llnl.ontology.mapreduce.CorpusTableMR |
---|
CorpusTableMR.CorpusTableMapper<K,V> |
Field Summary | |
---|---|
static String |
ABOUT
The job description used in the help text. |
static String |
CONF_PREFIX
The prefix for every configuration. |
static String |
MR_NAME
|
static String |
USE_ORDER
The configuration set when word ordering features should be used. |
static String |
USE_POS
The configuration set when part of speech features should be used. |
static String |
WINDOW_SIZE
The configuration used to set the maximum sliding window size. |
Fields inherited from class gov.llnl.ontology.mapreduce.CorpusTableMR |
---|
TABLE |
Constructor Summary | |
---|---|
WordOccurrenceCountMR()
|
Method Summary | |
---|---|
protected void |
addOptions(MRArgOptions options)
Add more command line arguments. |
protected String |
jobName()
Returns a descriptive job name for this map reduce task. |
static void |
main(String[] args)
Runs the WordOccurrenceCountMR . |
protected Class |
mapperClass()
Returns the Class object for the Mapper task. |
protected Class |
mapperKeyClass()
Returns the Class object for the Mapper Value of this task. |
protected Class |
mapperValueClass()
Returns the Class object for the Mapper Value of this task. |
protected void |
setupConfiguration(MRArgOptions options,
org.apache.hadoop.conf.Configuration conf)
Copies command line arguments to a Configuration so that
Map/Reduce jobs can utilize the values set. |
protected void |
setupReducer(String tableName,
org.apache.hadoop.mapreduce.Job job,
MRArgOptions options)
Sets up the Reducer for this job. |
protected void |
validateOptions(MRArgOptions options)
Returns true if the MRArgOptions contains a valid value for each
requried option. |
Methods inherited from class gov.llnl.ontology.mapreduce.CorpusTableMR |
---|
run |
Methods inherited from class org.apache.hadoop.conf.Configured |
---|
getConf, setConf |
Methods inherited from class java.lang.Object |
---|
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
Methods inherited from interface org.apache.hadoop.conf.Configurable |
---|
getConf, setConf |
Field Detail |
---|
public static final String ABOUT
public static final String CONF_PREFIX
public static final String MR_NAME
public static final String USE_POS
public static final String USE_ORDER
public static final String WINDOW_SIZE
Constructor Detail |
---|
public WordOccurrenceCountMR()
Method Detail |
---|
public static void main(String[] args) throws Exception
WordOccurrenceCountMR
.
Exception
protected String jobName()
jobName
in class CorpusTableMR
protected void addOptions(MRArgOptions options)
addOptions
in class CorpusTableMR
protected void validateOptions(MRArgOptions options)
MRArgOptions
contains a valid value for each
requried option. By default, this does no validation.
validateOptions
in class CorpusTableMR
protected void setupConfiguration(MRArgOptions options, org.apache.hadoop.conf.Configuration conf)
Configuration
so that
Map/Reduce jobs can utilize the values set. By default, this does no
configuration.
setupConfiguration
in class CorpusTableMR
protected Class mapperClass()
Class
object for the Mapper task.
mapperClass
in class CorpusTableMR
protected Class mapperKeyClass()
Class
object for the Mapper Value of this task.
mapperKeyClass
in class CorpusTableMR
protected Class mapperValueClass()
Class
object for the Mapper Value of this task.
mapperValueClass
in class CorpusTableMR
protected void setupReducer(String tableName, org.apache.hadoop.mapreduce.Job job, MRArgOptions options)
setupReducer
in class CorpusTableMR
|
||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |