gov.llnl.ontology.text.corpora
Class SemEval2010TestDocumentReader

java.lang.Object
  extended by gov.llnl.ontology.text.corpora.SemEval2010TrainDocumentReader
      extended by gov.llnl.ontology.text.corpora.SemEval2010TestDocumentReader
All Implemented Interfaces:
DocumentReader

public class SemEval2010TestDocumentReader
extends SemEval2010TrainDocumentReader

A DocumentReader for the SemEval2010 test corpus. This automatically removes the TargetSentence tags from the document. It uses the instance name as the key, the title is just the keyterm. The id is the token index of the word that matches the title when both are stemmed. It does not generate any labels for a document.

This is not thread safe.

Author:
Keith Stevens

Field Summary
static String CORPUS_NAME
           
 
Constructor Summary
SemEval2010TestDocumentReader()
           
 
Method Summary
 String corpusName()
          Returns CORPUS_NAME
 
Methods inherited from class gov.llnl.ontology.text.corpora.SemEval2010TrainDocumentReader
readDocument, readDocument
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Field Detail

CORPUS_NAME

public static final String CORPUS_NAME
See Also:
Constant Field Values
Constructor Detail

SemEval2010TestDocumentReader

public SemEval2010TestDocumentReader()
Method Detail

corpusName

public String corpusName()
Returns CORPUS_NAME

Overrides:
corpusName in class SemEval2010TrainDocumentReader


Copyright © 2010-2011. All Rights Reserved.