links to this page:
Recipe: Grabbing a page from the Internet
Notes on specific classes
HtmlParser
Classes by category
Swiki Page List
HtmlDocument
Last updated at 1:34 pm UTC on 14 February 2003
Subclass of
HtmlEntity
.
An HtmlDocument object represents an object tree of a HTML document. It is created by the class
HtmlParser
.
An HtmlDocument object has two sub-entities: a HtmlHead object and a HtmlBody object.
To get all anchors a document contains, create the following method:
allAnchors
| a |
a := Set new.
self allSubentitiesDo: [ :e | e class == HtmlAnchor
ifTrue: [a add: e]].
^a