Configuration
Fine-tune parsing to DOM tree.
LagartoDom configuration is specified in LagartoDomBuilderConfig
class. Among new properties, there is also the instance of the LagartoParser configuration, of the parser that is used internally.
In most cases, you will just use the predefined modes. Here is the list of properties that you can configure.
ignoreWhitespacesBetweenTags
This flag is used for XML mode, to ignore all whitespace content between two starting or two ending tags. Whitespace content between one open and one closed tag is still not ignored.
ignoreComments
This flag simply defines if the resulting DOM tree should contain comments or not.
enabledVoidTags
Flag to enable/disable void tags.
selfCloseVoidTags
When an element is a void element, this flag defines if it can be self-closed or if it should have the standard end tag.
impliedEndTags
Enables rules for implicit end tags. There are a number of tags that do not require the use of a closing tag for valid HTML (body
, li
, dd
, dt
, p
, td
, tr
,...). When this flag is on, these tags are implicitly closed if needed and no error/warning is logged.
This feature somewhat slows down the parsing. If you know that all tags are closed in input HTML, consider switching this feature off, to improve performances.
condCommentIEVersion
The version of conditional comments.
errorLogger & debugLogger
Custom loggers.
Last updated