org.apache.tika.parser.microsoft
Class OfficeParser
java.lang.Object
org.apache.tika.parser.microsoft.OfficeParser
- All Implemented Interfaces:
- java.io.Serializable, Parser
public class OfficeParser
- extends java.lang.Object
- implements Parser
Defines a Microsoft document content extractor.
- See Also:
- Serialized Form
|
Method Summary |
java.util.Set<MediaType> |
getSupportedTypes(ParseContext context)
|
void |
parse(java.io.InputStream stream,
org.xml.sax.ContentHandler handler,
Metadata metadata)
Deprecated. This method will be removed in Apache Tika 1.0. |
void |
parse(java.io.InputStream stream,
org.xml.sax.ContentHandler handler,
Metadata metadata,
ParseContext context)
Extracts properties and text from an MS Document input stream |
| Methods inherited from class java.lang.Object |
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
OfficeParser
public OfficeParser()
getSupportedTypes
public java.util.Set<MediaType> getSupportedTypes(ParseContext context)
- Specified by:
getSupportedTypes in interface Parser
parse
public void parse(java.io.InputStream stream,
org.xml.sax.ContentHandler handler,
Metadata metadata,
ParseContext context)
throws java.io.IOException,
org.xml.sax.SAXException,
TikaException
- Extracts properties and text from an MS Document input stream
- Specified by:
parse in interface Parser
- Throws:
java.io.IOException
org.xml.sax.SAXException
TikaException
parse
public void parse(java.io.InputStream stream,
org.xml.sax.ContentHandler handler,
Metadata metadata)
throws java.io.IOException,
org.xml.sax.SAXException,
TikaException
- Deprecated. This method will be removed in Apache Tika 1.0.
- Specified by:
parse in interface Parser
- Throws:
java.io.IOException
org.xml.sax.SAXException
TikaException
Copyright © 2007-2011 The Apache Software Foundation. All Rights Reserved.