public class SenseEvalDependencyCorpusReader extends Object implements CorpusReader<Document>
Document
.
This CorpusReader
returns documents in the following format:
word_instance_id focus_word text ...
This is particularly neccesary for the evaluating against the SemEval testing
framework which requires the focus word information and the instance idModifier and Type | Class and Description |
---|---|
class |
SenseEvalDependencyCorpusReader.SenseEvalIterator |
Constructor and Description |
---|
SenseEvalDependencyCorpusReader() |
public Iterator<Document> read(File file)
Iterator
that traverses the documents containted in
the given file
.read
in interface CorpusReader<Document>
file
- A text file holding documents in a format
that is readable by a particular CorpusReader
. This text
file may have it's own unique text structure or an xml format.
Each CorpusReader
should specify the expected text format.public Iterator<Document> read(Reader docReader)
Iterator
that traverses the documents contained in
baseReader
.read
in interface CorpusReader<Document>
docReader
- A Reader
that will extract text from a data
source, such as a URL, a File, a data stream, or any other source
accesible via the Reader
interface. Each CorpusReader
should specify the expected text format, be it an
XML schema or some other unique format.Copyright © 2012. All Rights Reserved.