|
Class Summary |
| AutoDetectParser |
|
| CompositeParser |
Composite parser that delegates parsing tasks to a component parser
based on the declared content type of the incoming document. |
| DefaultParser |
A composite parser based on all the Parser implementations
available through the service provider mechanism. |
| DelegatingParser |
Base class for parser implementations that want to delegate parts of the
task of parsing an input document to another parser. |
| EmptyParser |
Dummy parser that always produces an empty XHTML document without even
attempting to parse the given document stream. |
| ErrorParser |
Dummy parser that always throws a TikaException without even
attempting to parse the given document stream. |
| ExternalParser |
Parser that uses an external program (like catdoc or pdf2txt) to extract
text content from a given document. |
| ParseContext |
Parse context. |
| ParserDecorator |
Decorator base class for the Parser interface. |
| ParserPostProcessor |
Parser decorator that post-processes the results from a decorated parser. |
| ParsingReader |
Reader for the text content from a given binary stream. |