gov.llnl.ontology.text
Interface DocumentReader

All Known Implementing Classes:
NYTDocumentReader, PubMedDocumentReader, SemEval2010TestDocumentReader, SemEval2010TrainDocumentReader, SenseEval2007DocumentReader, UkWacDocumentReader, WackypediaDocumentReader

public interface DocumentReader

DocumentReaders transform a string of text into a raw document. The original text may contain xml or other formatting information and the cleaned version contains just the interesting text.

Author:
Keith Stevens

Method Summary
 Document readDocument(String doc)
          Returns a Document represented by the given string.
 Document readDocument(String doc, String corpusName)
          Returns a Document represented by the given string and uses corpusName as the corpus name for the returned Document.
 

Method Detail

readDocument

Document readDocument(String doc)
Returns a Document represented by the given string.


readDocument

Document readDocument(String doc,
                      String corpusName)
Returns a Document represented by the given string and uses corpusName as the corpus name for the returned Document.



Copyright © 2010-2011. All Rights Reserved.