|
||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |
java.lang.Objectgov.llnl.ontology.wordnet.WordNetCorpusReader
public class WordNetCorpusReader
This class acts as the central interface for the WordNet dictionary. It
begins it's initialization by reading all of the dictionary information into
ram and then generates a complete graph, connecting all Synset
s via
the specified relations. The dictionary graph can be modified during runtime
and later saved to disk in the same format as the original WordNet database,
allowing other interfaces the ability to access the modified form of WordNet.
Field Summary | |
---|---|
static String[] |
FILE_EXTENSIONS
The file extensions for each of the data and index files in the WordNet dictionary. |
static Map<String,Synset.PartsOfSpeech> |
POS_MAP
A simple mapping from part of speech characters their respective ParstOfSpeech enumerations. |
static String[] |
POS_TAGS
The set of part of speech tags. |
Method Summary | |
---|---|
void |
addSynset(Synset synset)
Adds synset to the OntologyReader . |
void |
addSynset(Synset synset,
int index)
Adds synset to the OntologyReader . |
Set<Synset> |
allSynsets()
Returns a Set of all Synset s maintained by this OntologyReader . |
Set<Synset> |
allSynsets(Synset.PartsOfSpeech pos)
Returns a Set of all Synset s for the given Synset.PartsOfSpeech maintained by this OntologyReader . |
int |
getMaxDepth(Synset.PartsOfSpeech pos)
Returns the maximum depth of any Synset chain in this OntologyReader . |
Synset |
getSynset(String fullSynsetName)
Returns the Synset specified by the full synset name. |
Synset |
getSynset(String lemma,
Synset.PartsOfSpeech pos,
int senseNum)
Returns the single Synset specified by the given lemma name, part
of speech tag, and sense number. |
Synset[] |
getSynsets(String lemma)
Returns all Synset s that match the given lemma name. |
Synset[] |
getSynsets(String lemma,
Synset.PartsOfSpeech pos)
Returns all Synset s that match the given lemma name and part of
speech. |
Synset[] |
getSynsets(String lemma,
Synset.PartsOfSpeech pos,
boolean useMorphy)
Returns all Synset s that match the given lemma name and part of
speech. |
static WordNetCorpusReader |
getWordNet()
Returns the initialzied instance of the WordNetCorpusReader . |
static WordNetCorpusReader |
initialize(String dictPath)
Returns a singleton instance of the WordNetCorpusReader . |
static WordNetCorpusReader |
initialize(String dictPath,
boolean readFromJar)
Returns a singleton instance of the WordNetCorpusReader . |
Iterator<String> |
morphy(String form)
Returns an Iterator over the possible morphological variations of
the given word form for all Synset.PartsOfSpeech . |
Iterator<String> |
morphy(String form,
Synset.PartsOfSpeech pos)
Returns an Iterator over the possible morphological variations of
the given word form for a given Synset.PartsOfSpeech . |
void |
removeSynset(Synset synset)
Removes synset from the OntologyReader . |
void |
replaceSynset(Synset synset,
Synset replacement)
Removes the Synset from the known hierarchy. |
Set<String> |
wordnetTerms()
Returns a Set of lemmas that serve as keys in this OntologyReader . |
Set<String> |
wordnetTerms(Synset.PartsOfSpeech pos)
Returns a Set of lemmas that the current word net instance is
aware of for a particular Synset.PartsOfSpeech . |
Methods inherited from class java.lang.Object |
---|
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
Field Detail |
---|
public static final String[] POS_TAGS
public static final Map<String,Synset.PartsOfSpeech> POS_MAP
ParstOfSpeech
enumerations.
public static final String[] FILE_EXTENSIONS
Method Detail |
---|
public Iterator<String> morphy(String form)
Iterator
over the possible morphological variations of
the given word form
for all Synset.PartsOfSpeech
. For each part
of speech, if there are any known exceptions for the form, they will be
returned before the part of speech specific replacement rules. For
example, if "geese" is given, "goose" will be returned first.
Afterwords, no other variations would be returned. If "explodes" is
given, the variants would be "explode", "explode", and "explod", based on
the rules specified in MORPHOLOGICAL_SUBSTITUTIONS
.
morphy
in interface OntologyReader
public Iterator<String> morphy(String form, Synset.PartsOfSpeech pos)
Iterator
over the possible morphological variations of
the given word form
for a given Synset.PartsOfSpeech
. If there
are any known exceptions for the form, they will be returned before the
part of speech specific replacement rules. For example, if "geese" is
given, "goose" will be returned first. Afterwords, no other variations
would be returned. If "explodes" is given, the variants would be
"explode", "explode", and "explod", based on the rules specified in
MORPHOLOGICAL_SUBSTITUTIONS
.
morphy
in interface OntologyReader
public Set<Synset> allSynsets()
Set
of all Synset
s maintained by this OntologyReader
.
allSynsets
in interface OntologyReader
public Set<Synset> allSynsets(Synset.PartsOfSpeech pos)
Set
of all Synset
s for the given Synset.PartsOfSpeech
maintained by this OntologyReader
.
allSynsets
in interface OntologyReader
public void addSynset(Synset synset)
synset
to the OntologyReader
. A mapping from each
Lemma
linked to by synset
will be made to synset
.
synset
will be set as the last Synset
for each Lemma
mapping.
addSynset
in interface OntologyReader
public void addSynset(Synset synset, int index)
synset
to the OntologyReader
. A mapping from each
Lemma
linked to by synset
will be made to synset
.
synset
will be set at index index
for each Lemma
mapping, or as the last entry if index
is too large for any
particular Lemma
mapping.
addSynset
in interface OntologyReader
public void removeSynset(Synset synset)
synset
from the OntologyReader
. A mapping from
each Lemma
linked to by synset
will be removed from
synset
.
removeSynset
in interface OntologyReader
public void replaceSynset(Synset synset, Synset replacement)
Synset
from the known hierarchy. All mappings from
lemmas to this Synset
will be removed, along with any stored
details about this particular Synset
.
replaceSynset
in interface OntologyReader
public Set<String> wordnetTerms()
Set
of lemmas that serve as keys in this OntologyReader
.
wordnetTerms
in interface OntologyReader
public Set<String> wordnetTerms(Synset.PartsOfSpeech pos)
Set
of lemmas that the current word net instance is
aware of for a particular Synset.PartsOfSpeech
.
wordnetTerms
in interface OntologyReader
public static WordNetCorpusReader initialize(String dictPath)
WordNetCorpusReader
. If the
reader has not already been created, it will be initialzied. This method
assumes that dictPath
does not correspond to a jar internal path.
public static WordNetCorpusReader initialize(String dictPath, boolean readFromJar)
WordNetCorpusReader
. If the
reader has not already been created, it will be initialized. If readFromjar
is true, the reader will dictPath
as a path within
the current jar running this code and read the dictionary files from the
jar. In these cases, dictPath
should start with "/". A common
argument for dictPath
is "/dict", which assumes that the
directory dict contains all the WordNet dictionary files and is as the
base directory of the jar.
public static WordNetCorpusReader getWordNet()
WordNetCorpusReader
.
public Synset[] getSynsets(String lemma)
Synset
s that match the given lemma name.
getSynsets
in interface OntologyReader
public Synset[] getSynsets(String lemma, Synset.PartsOfSpeech pos)
Synset
s that match the given lemma name and part of
speech. If there is no known mapping for the given word, the Synset
s for all it's part of speech specific morphological variations
will be returned.
getSynsets
in interface OntologyReader
public Synset[] getSynsets(String lemma, Synset.PartsOfSpeech pos, boolean useMorphy)
Synset
s that match the given lemma name and part of
speech. If there is no known mapping for the given word and useMorphy
is true, the Synset
s for all it's part of speech specific morphological variations
will be returned.
getSynsets
in interface OntologyReader
public Synset getSynset(String fullSynsetName)
Synset
specified by the full synset name. The name
should be of the following format:
lemma.pos.senseNum
getSynset
in interface OntologyReader
public Synset getSynset(String lemma, Synset.PartsOfSpeech pos, int senseNum)
Synset
specified by the given lemma name, part
of speech tag, and sense number. Sense numbers start at 1.
getSynset
in interface OntologyReader
public int getMaxDepth(Synset.PartsOfSpeech pos)
Synset
chain in this OntologyReader
.
getMaxDepth
in interface OntologyReader
|
||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |