public class TemporalBloglinesCorpusReader extends DirectoryCorpusReader<TemporalDocument>
Modifier and Type | Class and Description |
---|---|
class |
TemporalBloglinesCorpusReader.BloglinesIterator |
DirectoryCorpusReader.BaseFileIterator
Constructor and Description |
---|
TemporalBloglinesCorpusReader()
Constructs a new
TemporalBloglinesCorpusReader that uses no
preprocessing before documents are returned. |
TemporalBloglinesCorpusReader(DocumentPreprocessor preprocessor)
Constructs a new
TemporalBloglinesCorpusReader that uses preprocessor to clean documents before they are returned. |
Modifier and Type | Method and Description |
---|---|
protected Iterator<TemporalDocument> |
corpusIterator(Iterator<File> files)
|
initialize, read, read
public TemporalBloglinesCorpusReader()
TemporalBloglinesCorpusReader
that uses no
preprocessing before documents are returned.public TemporalBloglinesCorpusReader(DocumentPreprocessor preprocessor)
TemporalBloglinesCorpusReader
that uses preprocessor
to clean documents before they are returned.protected Iterator<TemporalDocument> corpusIterator(Iterator<File> files)
Iterator
over documents contained in the File
s
traversed by fileIter
. Sub-classes are encouraged to sub-class
BaseFileIterator for the return value of this method.corpusIterator
in class DirectoryCorpusReader<TemporalDocument>
Copyright © 2012. All Rights Reserved.