Class TextExtractingVisitor


  • public class TextExtractingVisitor
    extends NodeVisitor
    Extracts text from a web page. Usage: Parser parser = new Parser(...); TextExtractingVisitor visitor = new TextExtractingVisitor(); parser.visitAllNodesWith(visitor); String textInPage = visitor.getExtractedText();
    • Constructor Detail

      • TextExtractingVisitor

        public TextExtractingVisitor()
    • Method Detail

      • getExtractedText

        public java.lang.String getExtractedText()
      • visitStringNode

        public void visitStringNode​(Text stringNode)
        Description copied from class: NodeVisitor
        Called for each StringNode visited.
        Overrides:
        visitStringNode in class NodeVisitor
        Parameters:
        stringNode - The string node being visited.
      • visitTag

        public void visitTag​(Tag tag)
        Description copied from class: NodeVisitor
        Called for each Tag visited.
        Overrides:
        visitTag in class NodeVisitor
        Parameters:
        tag - The tag being visited.
      • visitEndTag

        public void visitEndTag​(Tag tag)
        Description copied from class: NodeVisitor
        Called for each Tag visited that is an end tag.
        Overrides:
        visitEndTag in class NodeVisitor
        Parameters:
        tag - The end tag being visited.