org.apache.tika.parser.txt
Class TXTParser
java.lang.Object
org.apache.tika.parser.AbstractParser
org.apache.tika.parser.txt.TXTParser
- All Implemented Interfaces:
- Serializable, org.apache.tika.parser.Parser
public class TXTParser
- extends org.apache.tika.parser.AbstractParser
Plain text parser. The text encoding of the document stream is
automatically detected based on the byte patterns found at the
beginning of the stream and the given document metadata, most
notably the charset parameter of a
HttpHeaders.CONTENT_TYPE value.
This parser sets the following output metadata entries:
HttpHeaders.CONTENT_TYPE
text/plain; charset=...
- See Also:
- Serialized Form
|
Method Summary |
Set<org.apache.tika.mime.MediaType> |
getSupportedTypes(org.apache.tika.parser.ParseContext context)
|
void |
parse(InputStream stream,
ContentHandler handler,
org.apache.tika.metadata.Metadata metadata,
org.apache.tika.parser.ParseContext context)
|
| Methods inherited from class org.apache.tika.parser.AbstractParser |
parse |
| Methods inherited from class java.lang.Object |
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
TXTParser
public TXTParser()
getSupportedTypes
public Set<org.apache.tika.mime.MediaType> getSupportedTypes(org.apache.tika.parser.ParseContext context)
parse
public void parse(InputStream stream,
ContentHandler handler,
org.apache.tika.metadata.Metadata metadata,
org.apache.tika.parser.ParseContext context)
throws IOException,
SAXException,
org.apache.tika.exception.TikaException
- Throws:
IOException
SAXException
org.apache.tika.exception.TikaException
Copyright © 2007-2013 The Apache Software Foundation. All Rights Reserved.