|
||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |
java.lang.Objectgov.llnl.ontology.text.corpora.SemEval2010TrainDocumentReader
public class SemEval2010TrainDocumentReader
A DocumentReader
for the SemEval2010 test corpus. It uses the
instance name as the key, the title is be just the keyterm. The id is the
token index of the word that matches the title when both are stemmed. It
does not generate any labels for a document.
Field Summary | |
---|---|
static String |
CORPUS_NAME
|
Constructor Summary | |
---|---|
SemEval2010TrainDocumentReader()
Constructs a new SemEval2010TrainDocumentReader . |
Method Summary | |
---|---|
String |
corpusName()
Returns CORPUS_NAME |
Document |
readDocument(String doc)
Returns a Document represented by the given string. |
Document |
readDocument(String doc,
String corpusName)
Returns a Document represented by the given string and uses
corpusName as the corpus name for the returned Document . |
Methods inherited from class java.lang.Object |
---|
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
Field Detail |
---|
public static final String CORPUS_NAME
Constructor Detail |
---|
public SemEval2010TrainDocumentReader()
SemEval2010TrainDocumentReader
.
Method Detail |
---|
public String corpusName()
CORPUS_NAME
public Document readDocument(String doc)
Document
represented by the given string.
readDocument
in interface DocumentReader
public Document readDocument(String doc, String corpusName)
Document
represented by the given string and uses
corpusName
as the corpus name for the returned Document
.
readDocument
in interface DocumentReader
|
||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |