|
Lightweigth HTML Scanner 2.00 | ||||||
| PREV NEXT | FRAMES NO FRAMES | ||||||
See:
Description
| Packages | |
| be.arci.html | Core package for the Lightweight HTML Scanner. |
| be.arci.pub | In the be.arci.pub package we supply add-on classes and examples for our Java libraries, together with their source code. |
The HTMLScanner scans the document for HTML tags when it's method getTags() is invoked. As parameter to this method the application programmer supplies an array of the tag names he is interested in, or possibly the complete set of possible HTML tag names. A single HTMLScanner object can scan the same HTML document repeatedly for different sets of tag names, in successive calls to getTags().
HTMLScanner.getTags() returns an array of HTMLTag objects. You can regard these HTMLTags as substrings of the HTML document, with an index (ID) into the array of tag names to identify the type of tag (e.g. <IMG> tag or <BODY> tag). Some of these HTMLTag objects will represent HTML text content; they have an ID of 0 (zero). Closing tags (e.g. </BODY> tag) are given the negative of the ID of the opening tag.
The HTMLTag class has 2 methods of interest:
|
Lightweigth HTML Scanner 2.00 | ||||||
| PREV NEXT | FRAMES NO FRAMES | ||||||