org.apache.tika.parser.epub
Class EpubContentParser
java.lang.Object
org.apache.tika.parser.epub.EpubContentParser
- All Implemented Interfaces:
- java.io.Serializable, Parser
public class EpubContentParser
- extends java.lang.Object
- implements Parser
Parser for EPUB OPS *.html files.
For the time being, assume XHTML (TODO: DTBook)
- See Also:
- Serialized Form
| Methods inherited from class java.lang.Object |
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
EpubContentParser
public EpubContentParser()
getSupportedTypes
public java.util.Set<MediaType> getSupportedTypes(ParseContext context)
- Specified by:
getSupportedTypes in interface Parser
parse
public void parse(java.io.InputStream stream,
org.xml.sax.ContentHandler handler,
Metadata metadata,
ParseContext context)
throws java.io.IOException,
org.xml.sax.SAXException,
TikaException
- Specified by:
parse in interface Parser
- Throws:
java.io.IOException
org.xml.sax.SAXException
TikaException
parse
public void parse(java.io.InputStream stream,
org.xml.sax.ContentHandler handler,
Metadata metadata)
throws java.io.IOException,
org.xml.sax.SAXException,
TikaException
- Deprecated. This method will be removed in Apache Tika 1.0.
- Specified by:
parse in interface Parser
- Throws:
java.io.IOException
org.xml.sax.SAXException
TikaException
Copyright © 2007-2011 The Apache Software Foundation. All Rights Reserved.