| PageBuilder | Allows a TagRule to add information to a Page object. | code | html |
| BodyTagRule | code | html | |
| ContentBlockExtractingRule | code | html | |
| FramesetRule | code | html | |
| HeadExtractingRule | code | html | |
| HtmlAttributesRule | code | html | |
| MSOfficeDocumentPropertiesRule | Extracts the extra properties saved in HTML from MS Office applications (Word and Excel), such as Author, Company, Version, etc. | code | html |
| MetaTagRule | code | html | |
| ParameterExtractingRule | code | html | |
| RegexReplacementTextFilter | TextFilter that substitutes content using a JDK 1.4 regular expression. | code | html |
| TagReplaceRule | Very simple rule for replacing all occurences of one tag with another. | code | html |
| TitleExtractingRule | code | html |