public class TemporalUsenetCorpusReader extends DirectoryCorpusReader<TemporalDocument>
DirectoryCorpusReader
for the Usenet
corpus provided by the Westbury Lab.
The corpus filenames are expected to remain unchanged from how they wereModifier and Type | Class and Description |
---|---|
class |
TemporalUsenetCorpusReader.UseNetIterator |
DirectoryCorpusReader.BaseFileIterator
Constructor and Description |
---|
TemporalUsenetCorpusReader() |
TemporalUsenetCorpusReader(DocumentPreprocessor preprocessor) |
Modifier and Type | Method and Description |
---|---|
protected Iterator<TemporalDocument> |
corpusIterator(Iterator<File> files)
|
initialize, read, read
public TemporalUsenetCorpusReader()
public TemporalUsenetCorpusReader(DocumentPreprocessor preprocessor)
protected Iterator<TemporalDocument> corpusIterator(Iterator<File> files)
DirectoryCorpusReader
Iterator
over documents contained in the File
s
traversed by fileIter
. Sub-classes are encouraged to sub-class
BaseFileIterator for the return value of this method.corpusIterator
in class DirectoryCorpusReader<TemporalDocument>
Copyright © 2012. All Rights Reserved.