org.apache.tika.parser.microsoft.ooxml
Class POIXMLTextExtractorDecorator
java.lang.Object
org.apache.tika.parser.microsoft.ooxml.AbstractOOXMLExtractor
org.apache.tika.parser.microsoft.ooxml.POIXMLTextExtractorDecorator
- All Implemented Interfaces:
- OOXMLExtractor
public class POIXMLTextExtractorDecorator
- extends AbstractOOXMLExtractor
|
Method Summary |
protected void |
buildXHTML(org.apache.tika.sax.XHTMLContentHandler xhtml)
Populates the XHTMLContentHandler object received as parameter. |
protected List<org.apache.poi.openxml4j.opc.PackagePart> |
getMainDocumentParts()
Return a list of the main parts of the document, used
when searching for embedded resources. |
| Methods inherited from class java.lang.Object |
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
POIXMLTextExtractorDecorator
public POIXMLTextExtractorDecorator(org.apache.tika.parser.ParseContext context,
org.apache.poi.POIXMLTextExtractor extractor)
buildXHTML
protected void buildXHTML(org.apache.tika.sax.XHTMLContentHandler xhtml)
throws SAXException
- Description copied from class:
AbstractOOXMLExtractor
- Populates the
XHTMLContentHandler object received as parameter.
- Specified by:
buildXHTML in class AbstractOOXMLExtractor
- Throws:
SAXException
getMainDocumentParts
protected List<org.apache.poi.openxml4j.opc.PackagePart> getMainDocumentParts()
- Description copied from class:
AbstractOOXMLExtractor
- Return a list of the main parts of the document, used
when searching for embedded resources.
This should be all the parts of the document that end
up with things embedded into them.
- Specified by:
getMainDocumentParts in class AbstractOOXMLExtractor
Copyright © 2007-2012 The Apache Software Foundation. All Rights Reserved.