public class DVWordsiMain extends GenericWordsiMain
Wordsi
. GenericWordsiMain
provides the core command line arguments and
functionality. This class provides the following additional arguments:
-p
, --pathAcceptor=CLASSNAME
Specifies the DependencyPathAcceptor
to use when validating paths as features.
(Default: UniversalPathAcceptor
)
-W
, --weightingFunction=CLASSNAME
Specifies the
class that will weight dependency paths.
-b
, --basisMapping=CLASSNAME
Specifies the class
that deterine what aspect of a DependencyPath
will as a feature
in the word space. (Default: WordBasedBasisMapping
)
argOptions, EXT, isMultiThreaded, verbose
Constructor and Description |
---|
DVWordsiMain() |
Modifier and Type | Method and Description |
---|---|
protected void |
addDocIterators(Collection<Iterator<Document>> docIters,
String[] fileNames)
Adds
DependencyFileDocumentIterator s for each file name provided. |
protected void |
addExtraOptions(ArgOptions options)
Adds options to the provided
ArgOptions instance, which will be
used to parse the command line. |
protected void |
addFileIterators(Collection<Iterator<Document>> docIters,
String[] fileNames)
Throws
UnsupportedOperationException . |
protected DependencyPathAcceptor |
getAcceptor() |
protected DependencyContextGenerator |
getContextGenerator() |
protected ContextExtractor |
getExtractor()
Returns a
ContextExtractor , which will be responsible for
creating context vectors for documents. |
protected SemanticSpaceIO.SSpaceFormat |
getSpaceFormat()
Returns the
format in which the
finished SemanticSpace should be saved. |
protected DependencyPathWeight |
getWeighter() |
protected void |
handleExtraOptions()
Once the command line has been parsed, allows the subclasses to perform
additional steps based on class-specific options.
|
static void |
main(String[] args) |
protected void |
postProcessing()
Allows subclasses to interact with the
SemanticSpace after the
space has finished processing all of the text. |
contextExtractorFromGenerator, getAcceptedWords, getDocumentIterator, getPseudoWordMap, getSpace, loadObject, openLoadFile, openSaveFile, saveObject, windowSize
addCorpusReaderIterators, getAlgorithmSpecifics, loadValidTermSet, parseDocumentsMultiThreaded, parseDocumentsSingleThreaded, processDocumentsAndSpace, run, saveSSpace, setupOptions, setupProperties, usage, verbose, verbose
protected void addExtraOptions(ArgOptions options)
ArgOptions
instance, which will be
used to parse the command line. This method allows subclasses the
ability to add extra command line options.addExtraOptions
in class GenericWordsiMain
options
- the ArgOptions object which more main specific options can
be added to.GenericMain.handleExtraOptions()
protected void handleExtraOptions()
GenericMain
getSpace
.handleExtraOptions
in class GenericMain
GenericMain.addExtraOptions(ArgOptions)
protected void postProcessing()
SemanticSpace
after the
space has finished processing all of the text.postProcessing
in class GenericMain
protected DependencyPathWeight getWeighter()
protected DependencyPathAcceptor getAcceptor()
protected DependencyContextGenerator getContextGenerator()
protected ContextExtractor getExtractor()
ContextExtractor
, which will be responsible for
creating context vectors for documents.getExtractor
in class GenericWordsiMain
protected SemanticSpaceIO.SSpaceFormat getSpaceFormat()
format
in which the
finished SemanticSpace
should be saved. Subclasses should
override this function if they want to specify a specific format that is
most suited for their space, when one is not manually specified by the
user.getSpaceFormat
in class GenericMain
protected void addFileIterators(Collection<Iterator<Document>> docIters, String[] fileNames) throws IOException
UnsupportedOperationException
.addFileIterators
in class GenericMain
IOException
protected void addDocIterators(Collection<Iterator<Document>> docIters, String[] fileNames) throws IOException
DependencyFileDocumentIterator
s for each file name provided.addDocIterators
in class GenericMain
IOException
Copyright © 2012. All Rights Reserved.