public class TopicModelContextExtractor extends Object implements ContextExtractor
ContextExtractor
for processing documents with topic signatures for
contexts as computed by the Mallet framework. Each document should be
preceeded with the very first token representing the focus word represented
by the context.Constructor and Description |
---|
TopicModelContextExtractor() |
Modifier and Type | Method and Description |
---|---|
int |
getVectorLength()
Returns the maximum number of dimensions used to represent any given
context.
|
void |
processDocument(BufferedReader document,
Wordsi wordsi)
Processes the content of
document and calls Wordsi.handleContextVector(java.lang.String, java.lang.String, edu.ucla.sspace.vector.SparseDoubleVector) for each context vector that can be extracted
from document . |
public void processDocument(BufferedReader document, Wordsi wordsi)
document
and calls Wordsi.handleContextVector(java.lang.String, java.lang.String, edu.ucla.sspace.vector.SparseDoubleVector)
for each context vector that can be extracted
from document
.processDocument
in interface ContextExtractor
public int getVectorLength()
getVectorLength
in interface ContextExtractor
Copyright © 2012. All Rights Reserved.