Uses of Interface
org.apache.tika.parser.Parser
Packages that use Parser
Package
Description
Apache Tika.
Tika configuration tools.
Extraction of component documents.
Forked parser.
Tika parsers.
External parser process.
Utilities.
-
Uses of Parser in org.apache.tika
Methods in org.apache.tika that return ParserModifier and TypeMethodDescriptionTika.getParser()Returns the parser instance used by this facade.Constructors in org.apache.tika with parameters of type ParserModifierConstructorDescriptionCreates a Tika facade using the given detector and parser instances, but the default Translator.Tika(Detector detector, Parser parser, Translator translator) Creates a Tika facade using the given detector, parser, and translator instances. -
Uses of Parser in org.apache.tika.config
Methods in org.apache.tika.config that return Parser -
Uses of Parser in org.apache.tika.extractor
Methods in org.apache.tika.extractor that return ParserModifier and TypeMethodDescriptionParsingEmbeddedDocumentExtractor.getDelegatingParser()static ParserEmbeddedDocumentUtil.getStatelessParser(ParseContext context) Utility function to get the Parser that was sent in to the ParseContext to handle embedded documents.static ParserEmbeddedDocumentUtil.tryToFindExistingLeafParser(Class clazz, ParseContext context) Tries to find an existing parser within the ParseContext.Constructors in org.apache.tika.extractor with parameters of type Parser -
Uses of Parser in org.apache.tika.fork
Classes in org.apache.tika.fork that implement ParserConstructors in org.apache.tika.fork with parameters of type Parser -
Uses of Parser in org.apache.tika.parser
Classes in org.apache.tika.parser that implement ParserModifier and TypeClassDescriptionclassAbstract base class for parsers that use the AutoDetectReader and need to use theEncodingDetectorconfigured byTikaConfigclassAbstract base class for parsers that call external processes.classAbstract base class for new parsers.classclassComposite parser that delegates parsing tasks to a component parser based on the declared content type of the incoming document.classDecrypts the incoming document stream and delegates further parsing to another parser instance.classA composite parser based on all theParserimplementations available through theservice provider mechanism.classBase class for parser implementations that want to delegate parts of the task of parsing an input document to another parser.classclassDummy parser that always produces an empty XHTML document without even attempting to parse the given document stream.classDummy parser that always throws aTikaExceptionwithout even attempting to parse the given document stream.classclassDecorator base class for theParserinterface.classParser decorator that post-processes the results from a decorated parser.classThis is a helper class that wraps a parser in a recursive handler.classclassThe RecursiveParserWrapper wraps the parser sent into the parsecontext and then uses that parser to store state (among many other things).Methods in org.apache.tika.parser that return ParserModifier and TypeMethodDescriptionAutoDetectParserFactory.build()abstract ParserParserFactory.build()protected ParserDelegatingParser.getDelegateParser(ParseContext context) Returns the parser instance to which parsing tasks should be delegated.CompositeParser.getFallback()Returns the fallback parser.protected ParserReturns the parser that best matches the given metadata.protected ParserCompositeParser.getParser(Metadata metadata, ParseContext context) ParserDecorator.getWrappedParser()Gets the parser wrapped by this ParserDecoratorstatic final ParserParserDecorator.withFallbacks(Collection<? extends Parser> parsers, Set<MediaType> types) Deprecated.This has been replaced byFallbackParserstatic final ParserParserDecorator.withoutTypes(Parser parser, Set<MediaType> excludeTypes) Decorates the given parser so that it never claims to support parsing of the given media types, but will work for all others.static final ParserDecorates the given parser so that it always claims to support parsing of the given media types.Methods in org.apache.tika.parser that return types with arguments of type ParserModifier and TypeMethodDescriptionCompositeParser.findDuplicateParsers(ParseContext context) Utility method that goes through all the component parsers and finds all media types for which more than one parser declares support.CompositeParser.getAllComponentParsers()Returns all parsers registered with the Composite Parser, including ones which may not currently be active.DefaultParser.getAllComponentParsers()CompositeParser.getParsers()Returns the component parsers.CompositeParser.getParsers(ParseContext context) DefaultParser.getParsers(ParseContext context) Methods in org.apache.tika.parser with parameters of type ParserModifier and TypeMethodDescriptionvoidCompositeParser.setFallback(Parser fallback) Sets the fallback parser.static final ParserParserDecorator.withoutTypes(Parser parser, Set<MediaType> excludeTypes) Decorates the given parser so that it never claims to support parsing of the given media types, but will work for all others.static final ParserDecorates the given parser so that it always claims to support parsing of the given media types.Method parameters in org.apache.tika.parser with type arguments of type ParserModifier and TypeMethodDescriptionvoidCompositeParser.setParsers(Map<MediaType, Parser> parsers) Sets the component parsers.static final ParserParserDecorator.withFallbacks(Collection<? extends Parser> parsers, Set<MediaType> types) Deprecated.This has been replaced byFallbackParserConstructors in org.apache.tika.parser with parameters of type ParserModifierConstructorDescriptionAutoDetectParser(Detector detector, Parser... parsers) AutoDetectParser(Parser... parsers) Creates an auto-detecting parser instance using the specified set of parser.CompositeParser(MediaTypeRegistry registry, Parser... parsers) DigestingParser(Parser parser, DigestingParser.Digester digester, boolean skipContainerDocument) Creates a decorator for the given parser.ParserDecorator(Parser parser) Creates a decorator for the given parser.ParserPostProcessor(Parser parser) Creates a post-processing decorator for the given parser.ParsingReader(Parser parser, InputStream stream, Metadata metadata, ParseContext context) Creates a reader for the text content of the given binary stream with the given document metadata.ParsingReader(Parser parser, InputStream stream, Metadata metadata, ParseContext context, Executor executor) Creates a reader for the text content of the given binary stream with the given document metadata.RecursiveParserWrapper(Parser wrappedParser) Initialize the wrapper withRecursiveParserWrapper.catchEmbeddedExceptionsset totrueas default.RecursiveParserWrapper(Parser wrappedParser, boolean catchEmbeddedExceptions) StatefulParser(Parser parser) Creates a decorator for the given parser.Constructor parameters in org.apache.tika.parser with type arguments of type ParserModifierConstructorDescriptionCompositeParser(MediaTypeRegistry registry, List<Parser> parsers) CompositeParser(MediaTypeRegistry registry, List<Parser> parsers, Collection<Class<? extends Parser>> excludeParsers) CompositeParser(MediaTypeRegistry registry, List<Parser> parsers, Collection<Class<? extends Parser>> excludeParsers) DefaultParser(MediaTypeRegistry registry, ServiceLoader loader, Collection<Class<? extends Parser>> excludeParsers) DefaultParser(MediaTypeRegistry registry, ServiceLoader loader, Collection<Class<? extends Parser>> excludeParsers, EncodingDetector encodingDetector, Renderer renderer) -
Uses of Parser in org.apache.tika.parser.external
Classes in org.apache.tika.parser.external that implement ParserModifier and TypeClassDescriptionclassA Composite Parser that wraps up all the available External Parsers, and provides an easy way to access them.classParser that uses an external program (like catdoc or pdf2txt) to extract text content and metadata from a given document. -
Uses of Parser in org.apache.tika.parser.external2
Classes in org.apache.tika.parser.external2 that implement ParserModifier and TypeClassDescriptionclassThis is a next generation external parser that uses some of the more recent additions to Tika.Methods in org.apache.tika.parser.external2 that return ParserMethods in org.apache.tika.parser.external2 with parameters of type ParserModifier and TypeMethodDescriptionvoidExternalParser.setOutputParser(Parser parser) This parser is called on the output of the process. -
Uses of Parser in org.apache.tika.parser.multiple
Classes in org.apache.tika.parser.multiple that implement ParserModifier and TypeClassDescriptionclassAbstract base class for parser wrappers which may / will process a given stream multiple times, merging the results of the various parsers used.classTries multiple parsers in turn, until one succeeds.classRuns the input stream through all available parsers, merging the metadata from them based on theAbstractMultipleParser.MetadataPolicychosen.Methods in org.apache.tika.parser.multiple that return types with arguments of type ParserMethods in org.apache.tika.parser.multiple with parameters of type ParserModifier and TypeMethodDescriptionprotected abstract booleanAbstractMultipleParser.parserCompleted(Parser parser, Metadata metadata, ContentHandler handler, ParseContext context, Exception exception) Used to notify implementations that a Parser has Finished or Failed, and to allow them to decide to continue or abort further parsingprotected booleanFallbackParser.parserCompleted(Parser parser, Metadata metadata, ContentHandler handler, ParseContext context, Exception exception) protected booleanSupplementingParser.parserCompleted(Parser parser, Metadata metadata, ContentHandler handler, ParseContext context, Exception exception) protected voidAbstractMultipleParser.parserPrepare(Parser parser, Metadata metadata, ParseContext context) Used to allow implementations to prepare or change things before parsing occursConstructors in org.apache.tika.parser.multiple with parameters of type ParserModifierConstructorDescriptionAbstractMultipleParser(MediaTypeRegistry registry, AbstractMultipleParser.MetadataPolicy policy, Parser... parsers) FallbackParser(MediaTypeRegistry registry, AbstractMultipleParser.MetadataPolicy policy, Parser... parsers) SupplementingParser(MediaTypeRegistry registry, AbstractMultipleParser.MetadataPolicy policy, Parser... parsers) Constructor parameters in org.apache.tika.parser.multiple with type arguments of type ParserModifierConstructorDescriptionAbstractMultipleParser(MediaTypeRegistry registry, Collection<? extends Parser> parsers, Map<String, Param> params) AbstractMultipleParser(MediaTypeRegistry registry, AbstractMultipleParser.MetadataPolicy policy, Collection<? extends Parser> parsers) FallbackParser(MediaTypeRegistry registry, Collection<? extends Parser> parsers, Map<String, Param> params) FallbackParser(MediaTypeRegistry registry, AbstractMultipleParser.MetadataPolicy policy, Collection<? extends Parser> parsers) SupplementingParser(MediaTypeRegistry registry, Collection<? extends Parser> parsers, Map<String, Param> params) SupplementingParser(MediaTypeRegistry registry, AbstractMultipleParser.MetadataPolicy policy, Collection<? extends Parser> parsers) -
Uses of Parser in org.apache.tika.utils
Methods in org.apache.tika.utils with parameters of type ParserModifier and TypeMethodDescriptionstatic StringParserUtils.getParserClassname(Parser parser) Identifies the real class name of theParser, unwrapping anyParserDecoratordecorations on top of it.static voidParserUtils.recordParserDetails(Parser parser, Metadata metadata) static voidParserUtils.recordParserFailure(Parser parser, Throwable failure, Metadata metadata)
TikaConfig.getParser()method instead