org.apache.tika.parser.microsoft
Class OfficeParser
java.lang.Object
org.apache.tika.parser.AbstractParser
org.apache.tika.parser.microsoft.OfficeParser
- All Implemented Interfaces:
- Serializable, org.apache.tika.parser.Parser
public class OfficeParser
- extends org.apache.tika.parser.AbstractParser
Defines a Microsoft document content extractor.
- See Also:
- Serialized Form
|
Method Summary |
Set<org.apache.tika.mime.MediaType> |
getSupportedTypes(org.apache.tika.parser.ParseContext context)
|
protected void |
parse(org.apache.poi.poifs.filesystem.DirectoryNode root,
org.apache.tika.parser.ParseContext context,
org.apache.tika.metadata.Metadata metadata,
org.apache.tika.sax.XHTMLContentHandler xhtml)
|
void |
parse(InputStream stream,
ContentHandler handler,
org.apache.tika.metadata.Metadata metadata,
org.apache.tika.parser.ParseContext context)
Extracts properties and text from an MS Document input stream |
| Methods inherited from class org.apache.tika.parser.AbstractParser |
parse |
| Methods inherited from class java.lang.Object |
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
OfficeParser
public OfficeParser()
getSupportedTypes
public Set<org.apache.tika.mime.MediaType> getSupportedTypes(org.apache.tika.parser.ParseContext context)
parse
public void parse(InputStream stream,
ContentHandler handler,
org.apache.tika.metadata.Metadata metadata,
org.apache.tika.parser.ParseContext context)
throws IOException,
SAXException,
org.apache.tika.exception.TikaException
- Extracts properties and text from an MS Document input stream
- Throws:
IOException
SAXException
org.apache.tika.exception.TikaException
parse
protected void parse(org.apache.poi.poifs.filesystem.DirectoryNode root,
org.apache.tika.parser.ParseContext context,
org.apache.tika.metadata.Metadata metadata,
org.apache.tika.sax.XHTMLContentHandler xhtml)
throws IOException,
SAXException,
org.apache.tika.exception.TikaException
- Throws:
IOException
SAXException
org.apache.tika.exception.TikaException
Copyright © 2007-2012 The Apache Software Foundation. All Rights Reserved.