gov.llnl.ontology.text.corpora
Class SemEval2010TestDocumentReader
java.lang.Object
gov.llnl.ontology.text.corpora.SemEval2010TrainDocumentReader
gov.llnl.ontology.text.corpora.SemEval2010TestDocumentReader
- All Implemented Interfaces:
- DocumentReader
public class SemEval2010TestDocumentReader
- extends SemEval2010TrainDocumentReader
A DocumentReader
for the SemEval2010 test corpus. This automatically
removes the TargetSentence
tags from the document. It uses the
instance name as the key, the title is just the keyterm. The id is the
token index of the word that matches the title when both are stemmed. It
does not generate any labels for a document.
This is not thread safe.
- Author:
- Keith Stevens
Methods inherited from class java.lang.Object |
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
CORPUS_NAME
public static final String CORPUS_NAME
- See Also:
- Constant Field Values
SemEval2010TestDocumentReader
public SemEval2010TestDocumentReader()
corpusName
public String corpusName()
- Returns
CORPUS_NAME
- Overrides:
corpusName
in class SemEval2010TrainDocumentReader
Copyright © 2010-2011. All Rights Reserved.