gov.llnl.ontology.text
Interface Document

All Known Implementing Classes:
DynamicDocument, NYTCorpusDocument, SimpleDocument

public interface Document

An interface for representing a document.

Author:
Keith Stevens

Method Summary
 Set<String> categories()
          Returns the set of categories that this document has, if any.
 long id()
          Returns a unique identifier for this document.
 String key()
          Returns a string name of this document.
 String originalText()
          Returns the original, uncleaned text.
 String rawText()
          Returns the raw text of the corpus.
 String sourceCorpus()
          Returns the name of the source corpus.
 String title()
          Returns the title of this document, if any exists.
 

Method Detail

sourceCorpus

String sourceCorpus()
Returns the name of the source corpus.


rawText

String rawText()
Returns the raw text of the corpus.


originalText

String originalText()
Returns the original, uncleaned text.


key

String key()
Returns a string name of this document.


id

long id()
Returns a unique identifier for this document.


title

String title()
Returns the title of this document, if any exists.


categories

Set<String> categories()
Returns the set of categories that this document has, if any.



Copyright © 2010-2011. All Rights Reserved.