org.apache.tika.parser.epub
Class EpubContentParser

java.lang.Object
  extended by org.apache.tika.parser.AbstractParser
      extended by org.apache.tika.parser.epub.EpubContentParser
All Implemented Interfaces:
Serializable, org.apache.tika.parser.Parser

public class EpubContentParser
extends org.apache.tika.parser.AbstractParser

Parser for EPUB OPS *.html files. For the time being, assume XHTML (TODO: DTBook)

See Also:
Serialized Form

Constructor Summary
EpubContentParser()
           
 
Method Summary
 Set<org.apache.tika.mime.MediaType> getSupportedTypes(org.apache.tika.parser.ParseContext context)
           
 void parse(InputStream stream, ContentHandler handler, org.apache.tika.metadata.Metadata metadata, org.apache.tika.parser.ParseContext context)
           
 
Methods inherited from class org.apache.tika.parser.AbstractParser
parse
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Constructor Detail

EpubContentParser

public EpubContentParser()
Method Detail

getSupportedTypes

public Set<org.apache.tika.mime.MediaType> getSupportedTypes(org.apache.tika.parser.ParseContext context)

parse

public void parse(InputStream stream,
                  ContentHandler handler,
                  org.apache.tika.metadata.Metadata metadata,
                  org.apache.tika.parser.ParseContext context)
           throws IOException,
                  SAXException,
                  org.apache.tika.exception.TikaException
Throws:
IOException
SAXException
org.apache.tika.exception.TikaException


Copyright © 2007-2011 The Apache Software Foundation. All Rights Reserved.