Class EpubContentParser

  • All Implemented Interfaces:
    Serializable, org.apache.tika.parser.Parser

    public class EpubContentParser
    extends org.apache.tika.parser.AbstractParser
    Parser for EPUB OPS *.html files.

    For the time being, assume XHTML (TODO: DTBook)

    See Also:
    Serialized Form
    • Constructor Detail

      • EpubContentParser

        public EpubContentParser()
    • Method Detail

      • getSupportedTypes

        public Set<org.apache.tika.mime.MediaType> getSupportedTypes​(org.apache.tika.parser.ParseContext context)