public class TemporalUsenetCorpusReader extends DirectoryCorpusReader<TemporalDocument>
DirectoryCorpusReader for the Usenet
corpus provided by the Westbury Lab.
The corpus filenames are expected to remain unchanged from how they were| Modifier and Type | Class and Description |
|---|---|
class |
TemporalUsenetCorpusReader.UseNetIterator |
DirectoryCorpusReader.BaseFileIterator| Constructor and Description |
|---|
TemporalUsenetCorpusReader() |
TemporalUsenetCorpusReader(DocumentPreprocessor preprocessor) |
| Modifier and Type | Method and Description |
|---|---|
protected Iterator<TemporalDocument> |
corpusIterator(Iterator<File> files)
|
initialize, read, readpublic TemporalUsenetCorpusReader()
public TemporalUsenetCorpusReader(DocumentPreprocessor preprocessor)
protected Iterator<TemporalDocument> corpusIterator(Iterator<File> files)
DirectoryCorpusReaderIterator over documents contained in the Files
traversed by fileIter. Sub-classes are encouraged to sub-class
BaseFileIterator for the return value of this method.corpusIterator in class DirectoryCorpusReader<TemporalDocument>Copyright © 2012. All Rights Reserved.