The main components of the rule expression are blocks. Each block specifies a single extraction query.
The syntax of the block is as follows:
{{TYPE[option1=value;option2=value...]:query}}
A block starts with {{
and ends with }}
. Inside the brackets, it starts with the block type followed by a colon (:
). Optionally, the block can have related settings. The settings are key value pairs inside a square bracket separated by semi-colon (;
). The actual query is specified after the colon.
A block can be one of the following types:
- DOM Block
- XPATH
- JQUERY
- String Block
- REGEX
- LITERAL