public interface ContextExtractor
Wordsi
implementation. Implementations are
recomended to use either a ContextGenerator
or a BasisMapping
that is serializable. Use of a ContextGenerator
or a BasisMapping
separates the feature space from the text traveral, allowing
the feature space to be reused, even if a different text traversal method
needs to be used.Modifier and Type | Method and Description |
---|---|
int |
getVectorLength()
Returns the maximum number of dimensions used to represent any given
context.
|
void |
processDocument(BufferedReader document,
Wordsi wordsi)
Processes the content of
document and calls Wordsi.handleContextVector(java.lang.String, java.lang.String, edu.ucla.sspace.vector.SparseDoubleVector) for each context vector that can be extracted
from document . |
void processDocument(BufferedReader document, Wordsi wordsi)
document
and calls Wordsi.handleContextVector(java.lang.String, java.lang.String, edu.ucla.sspace.vector.SparseDoubleVector)
for each context vector that can be extracted
from document
.int getVectorLength()
Copyright © 2012. All Rights Reserved.