|
||||||||||
| PREV PACKAGE NEXT PACKAGE | FRAMES NO FRAMES | |||||||||
| Interface Summary | |
|---|---|
| HtmlMapper | HTML mapper used to make incoming HTML documents easier to handle by Tika clients. |
| Class Summary | |
|---|---|
| BoilerpipeContentHandler | Uses the boilerpipe library to automatically extract the main content from a web page. |
| DefaultHtmlMapper | The default HTML mapping rules in Tika. |
| HtmlParser | HTML parser. |
| IdentityHtmlMapper | Alternative HTML mapping rules that pass the input HTML as-is without any modifications. |
|
||||||||||
| PREV PACKAGE NEXT PACKAGE | FRAMES NO FRAMES | |||||||||