org.apache.tika.parser.microsoft.ooxml
Class OOXMLParser
java.lang.Object
org.apache.tika.parser.AbstractParser
org.apache.tika.parser.microsoft.ooxml.OOXMLParser
- All Implemented Interfaces:
- Serializable, org.apache.tika.parser.Parser
public class OOXMLParser
- extends org.apache.tika.parser.AbstractParser
Office Open XML (OOXML) parser.
- See Also:
- Serialized Form
|
Field Summary |
protected static Set<org.apache.tika.mime.MediaType> |
SUPPORTED_TYPES
|
protected static Set<org.apache.tika.mime.MediaType> |
UNSUPPORTED_OOXML_TYPES
We claim to support all OOXML files, but we actually don't support a small
number of them. |
|
Method Summary |
Set<org.apache.tika.mime.MediaType> |
getSupportedTypes(org.apache.tika.parser.ParseContext context)
|
void |
parse(InputStream stream,
ContentHandler handler,
org.apache.tika.metadata.Metadata metadata,
org.apache.tika.parser.ParseContext context)
|
| Methods inherited from class org.apache.tika.parser.AbstractParser |
parse |
| Methods inherited from class java.lang.Object |
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
SUPPORTED_TYPES
protected static final Set<org.apache.tika.mime.MediaType> SUPPORTED_TYPES
UNSUPPORTED_OOXML_TYPES
protected static final Set<org.apache.tika.mime.MediaType> UNSUPPORTED_OOXML_TYPES
- We claim to support all OOXML files, but we actually don't support a small
number of them.
This list is used to decline certain formats that are not yet supported
by Tika and/or POI.
OOXMLParser
public OOXMLParser()
getSupportedTypes
public Set<org.apache.tika.mime.MediaType> getSupportedTypes(org.apache.tika.parser.ParseContext context)
parse
public void parse(InputStream stream,
ContentHandler handler,
org.apache.tika.metadata.Metadata metadata,
org.apache.tika.parser.ParseContext context)
throws IOException,
SAXException,
org.apache.tika.exception.TikaException
- Throws:
IOException
SAXException
org.apache.tika.exception.TikaException
Copyright © 2007-2013 The Apache Software Foundation. All Rights Reserved.