|
||||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | |||||||||
java.lang.Objectorg.apache.tika.parser.CompositeParser
public class CompositeParser
Composite parser that delegates parsing tasks to a component parser based on the declared content type of the incoming document. A fallback parser is defined for cases where a parser for the given content type is not available.
| Constructor Summary | |
|---|---|
CompositeParser()
|
|
| Method Summary | |
|---|---|
Parser |
getFallback()
Returns the fallback parser. |
protected Parser |
getParser(Metadata metadata)
Returns the parser that best matches the given metadata. |
java.util.Map<java.lang.String,Parser> |
getParsers()
Returns the component parsers. |
void |
parse(java.io.InputStream stream,
org.xml.sax.ContentHandler handler,
Metadata metadata)
Deprecated. This method will be removed in Apache Tika 1.0. |
void |
parse(java.io.InputStream stream,
org.xml.sax.ContentHandler handler,
Metadata metadata,
ParseContext context)
Delegates the call to the matching component parser. |
void |
setFallback(Parser fallback)
Sets the fallback parser. |
void |
setParsers(java.util.Map<java.lang.String,Parser> parsers)
Sets the component parsers. |
| Methods inherited from class java.lang.Object |
|---|
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
| Constructor Detail |
|---|
public CompositeParser()
| Method Detail |
|---|
public java.util.Map<java.lang.String,Parser> getParsers()
public void setParsers(java.util.Map<java.lang.String,Parser> parsers)
parsers - component parsers, keyed by media typepublic Parser getFallback()
public void setFallback(Parser fallback)
fallback - fallback parserprotected Parser getParser(Metadata metadata)
Subclasses can override this method to provide more accurate parser resolution.
metadata - document metadata
public void parse(java.io.InputStream stream,
org.xml.sax.ContentHandler handler,
Metadata metadata,
ParseContext context)
throws java.io.IOException,
org.xml.sax.SAXException,
TikaException
Potential RuntimeExceptions, IOExceptions and
SAXExceptions unrelated to the given input stream and content
handler are automatically wrapped into TikaExceptions to better
honor the Parser contract.
parse in interface Parserstream - the document stream (input)handler - handler for the XHTML SAX events (output)metadata - document metadata (input and output)context - parse context
java.io.IOException - if the document stream could not be read
org.xml.sax.SAXException - if the SAX events could not be processed
TikaException - if the document could not be parsed
public void parse(java.io.InputStream stream,
org.xml.sax.ContentHandler handler,
Metadata metadata)
throws java.io.IOException,
org.xml.sax.SAXException,
TikaException
Parser#parse(InputStream, ContentHandler, Metadata, Map) method
instead in new code. Calls to this backwards compatibility method
are forwarded to the new parse() method with an empty parse context.
parse in interface Parserjava.io.IOException
org.xml.sax.SAXException
TikaException
|
||||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | |||||||||