All Classes and Interfaces

Class
Description
 
 
 
Base token interface.
Base class for all tokens.
 
Browser compact XML serializer - creates resulting XML by stripping whitespaces wherever possible, but preserving single whitespace where at least one exists.
 
Properties defining cleaner's behaviour
Contains transformation collection.
 
Command line usage class.
HTML comment token.
Compact HTML serializer - creates resulting HTML by stripping whitespaces wherever possible.
Compact XML serializer - creates resulting XML by stripping whitespaces.
Configuration file tag provider - reads XML file in specified format and creates a Tag Provider.
HTML text token.
 
This is the default tag provider for HTML Cleaner Note this is no longer generated from XML - see https://sourceforge.net/p/htmlcleaner/bugs/81/
Most HTML 4 elements permitted within the BODY are classified as either block-level elements or inline elements.
HTML doctype token.
 
DOM serializer - creates xml DOM.
HTML tag end token.
Possible error codes (read messages) that cleaner uses to inform clients about reasons/actions that modification involves.
 
 
Main HtmlCleaner class.
General HtmlCleaner runtime exception.
Support for ANT.
Implementors can be registered on CleanerProperties to receive notifications about modifications made by html cleaner.
 
Marker interface denoting nodes of the document tree
Abstract HTML serializer - contains common logic for descendants.
Main HTML tokenizer.
Provides set of TagInfo instances.
Used as base for different node checkers.
JDom serializer - creates xml JDom instance out of the TagNode.
It contains the MathML tags to use with Html5 tags
 
Pretty HTML serializer - creates resulting HTML with indenting lines.
Pretty XML serializer - creates resulting XML with indenting lines.
Basic abstract serializer - contains common logic for descendants (methods writeXXX().
Simple HTML serializer - creates resulting HTML without indenting and/or compacting.
Simple XML serializer - creates resulting XML without indenting lines.
This class contains map with special entities used in HTML and their unicodes.
 
All nodes.
Class contains information about single HTML tag.
It also contains rules for tag balancing.
XML node tag - basic node of the cleaned HTML tree.
Checks if node contains specified attribute.
Checks if node has specified attribute with specified value.
Checks if node has specified attribute with specified value.
Remove empty autogenerated nodes.
Checks if node is an inline 0r block element and has empty contents or white/non-breakable spaces only.
Checks if node is an insignificant br tag -- is placed at the end or at the start of a block.
Checks if node has specified name.
Defines action to be performed on TagNodes
HTML tag token - descendants are start (TagNode) and end token (EndTagToken).
Describes how specified tag is transformed to another one, or is ignored during parsing
A traversal-based serializer for DOM; used to avoid recursion and stack overflow for large HTML documents.
Common utilities.
Abstract XML serializer - contains common logic for descendants.
Depth-first node traversor.
Node visitor interface.
Utility for searching cleaned document tree with XPath expressions.
Exception that could occure during XPather evaluation.