Package | Description |
---|---|
edu.ucla.sspace.text | |
edu.ucla.sspace.text.corpora |
Modifier and Type | Class and Description |
---|---|
class |
BloglinesCorpusReader
A
DirectoryCorpusReader for the bloglines corpus. |
class |
ChildesCorpusReader
A corpus reader for the Childes corpus.
|
class |
DirectoryCorpusReader<D extends Document>
An abstract base class for corpus reading iterators that need to traverse
through a large nested directory structure to find files containing text.
|
class |
SenseEvalDependencyCorpusReader
A corpus reader for the SenseEvalDependency corpus.
|
class |
TemporalBloglinesCorpusReader
A subclass of
BloglinesCorpusReader that always includes timestamps. |
class |
TemporalUsenetCorpusReader
A subclass of
UsenetCorpusReader that always includes timestamps. |
class |
UsenetCorpusReader
|
Modifier and Type | Class and Description |
---|---|
class |
PukWacCorpusReader
Reads full documents from a parsed UkWac or Wackypedia corpus.
|
class |
PukWacDependencyCorpusReader
Reads dependency parsed sentence lines, which are expected to be a the CoNNL
format, from a parsed UkWac or Wackypedia corpus.
|
class |
SemEvalCorpusReader
Reads the xml corpus files for the SemEval 2010 Word Sense Induction task,
available here.
|
class |
SemEvalLexSubReader
Reads the xml corpus files for the SemEval 2010 Lexical Substition task,
available here.
|
Copyright © 2012. All Rights Reserved.