public class LinkExtractor extends DefaultCallback
| Modifier and Type | Field and Description |
|---|---|
Set<String> |
urls
The URLs resulting from the parsing process.
|
EMPTY_CALLBACK_ARRAY| Constructor and Description |
|---|
LinkExtractor() |
| Modifier and Type | Method and Description |
|---|---|
String |
base()
Returns the URL specified by the
BASE element. |
void |
configure(BulletParser parser)
Configure the parser to parse elements and certain attributes.
|
String |
metaLocation()
Returns the URL specified by
META HTTP-EQUIV elements of location type. |
String |
metaRefresh()
Returns the URL specified by
META HTTP-EQUIV elements of refresh type. |
void |
startDocument()
Receive notification of the beginning of the document.
|
boolean |
startElement(Element element,
Map<Attribute,MutableString> attrMap)
Receive notification of the start of an element.
|
cdata, characters, endDocument, endElement, getInstancepublic void configure(BulletParser parser)
The required attributes are SRC , HREF , HTTP-EQUIV , and CONTENT
.
configure in interface Callbackconfigure in class DefaultCallbackpublic void startDocument()
CallbackThe callback must use this method to reset its internal state so that it can be resued. It must be safe to invoke this method several times.
startDocument in interface CallbackstartDocument in class DefaultCallbackpublic boolean startElement(Element element, Map<Attribute,MutableString> attrMap)
CallbackFor simple elements, this is the only notification that the callback will ever receive.
startElement in interface CallbackstartElement in class DefaultCallbackelement - the element whose opening tag was found.attrMap - a map from Attributes to MutableStrings.public String metaLocation()
META HTTP-EQUIV elements of location type. More
precisely, this method returns a non- null result iff there is at least one META HTTP-EQUIV
element specifying a location URL (if there is more than one, we keep the first one).META HTTP-EQUIV elements of location type, or
null.public String base()
BASE element. More precisely, this method returns a non-
null result iff there is at least one BASE element specifying a derelativisation URL
(if there is more than one, we keep the first one).BASE element, or null.public String metaRefresh()
META HTTP-EQUIV elements of refresh type. More
precisely, this method returns a non- null result iff there is at least one META HTTP-EQUIV
element specifying a refresh URL (if there is more than one, we keep the first one).META HTTP-EQUIV elements of refresh type, or
null.