public class TemporalBloglinesCorpusReader extends DirectoryCorpusReader<TemporalDocument>
| Modifier and Type | Class and Description |
|---|---|
class |
TemporalBloglinesCorpusReader.BloglinesIterator |
DirectoryCorpusReader.BaseFileIterator| Constructor and Description |
|---|
TemporalBloglinesCorpusReader()
Constructs a new
TemporalBloglinesCorpusReader that uses no
preprocessing before documents are returned. |
TemporalBloglinesCorpusReader(DocumentPreprocessor preprocessor)
Constructs a new
TemporalBloglinesCorpusReader that uses preprocessor to clean documents before they are returned. |
| Modifier and Type | Method and Description |
|---|---|
protected Iterator<TemporalDocument> |
corpusIterator(Iterator<File> files)
|
initialize, read, readpublic TemporalBloglinesCorpusReader()
TemporalBloglinesCorpusReader that uses no
preprocessing before documents are returned.public TemporalBloglinesCorpusReader(DocumentPreprocessor preprocessor)
TemporalBloglinesCorpusReader that uses preprocessor to clean documents before they are returned.protected Iterator<TemporalDocument> corpusIterator(Iterator<File> files)
Iterator over documents contained in the Files
traversed by fileIter. Sub-classes are encouraged to sub-class
BaseFileIterator for the return value of this method.corpusIterator in class DirectoryCorpusReader<TemporalDocument>Copyright © 2012. All Rights Reserved.