Package org.htmlparser.visitors
Class TextExtractingVisitor
- java.lang.Object
-
- org.htmlparser.visitors.NodeVisitor
-
- org.htmlparser.visitors.TextExtractingVisitor
-
public class TextExtractingVisitor extends NodeVisitor
Extracts text from a web page. Usage:Parser parser = new Parser(...); TextExtractingVisitor visitor = new TextExtractingVisitor(); parser.visitAllNodesWith(visitor); String textInPage = visitor.getExtractedText();
-
-
Constructor Summary
Constructors Constructor Description TextExtractingVisitor()
-
Method Summary
All Methods Instance Methods Concrete Methods Modifier and Type Method Description java.lang.StringgetExtractedText()voidvisitEndTag(Tag tag)Called for eachTagvisited that is an end tag.voidvisitStringNode(Text stringNode)Called for eachStringNodevisited.voidvisitTag(Tag tag)Called for eachTagvisited.-
Methods inherited from class org.htmlparser.visitors.NodeVisitor
beginParsing, finishedParsing, shouldRecurseChildren, shouldRecurseSelf, visitRemarkNode
-
-
-
-
Method Detail
-
getExtractedText
public java.lang.String getExtractedText()
-
visitStringNode
public void visitStringNode(Text stringNode)
Description copied from class:NodeVisitorCalled for eachStringNodevisited.- Overrides:
visitStringNodein classNodeVisitor- Parameters:
stringNode- The string node being visited.
-
visitTag
public void visitTag(Tag tag)
Description copied from class:NodeVisitorCalled for eachTagvisited.- Overrides:
visitTagin classNodeVisitor- Parameters:
tag- The tag being visited.
-
visitEndTag
public void visitEndTag(Tag tag)
Description copied from class:NodeVisitorCalled for eachTagvisited that is an end tag.- Overrides:
visitEndTagin classNodeVisitor- Parameters:
tag- The end tag being visited.
-
-