Class WikiCapturer
- java.lang.Object
-
- org.htmlparser.parserapplications.SiteCapturer
-
- org.htmlparser.parserapplications.WikiCapturer
-
public class WikiCapturer extends SiteCapturer
Save a wikiwikiweb locally. Illustrative program to save a wiki locally.
-
-
Field Summary
-
Fields inherited from class org.htmlparser.parserapplications.SiteCapturer
mCaptureResources, mCopied, mFilter, mFinished, mImages, mPages, mParser, mSource, mTarget, TRANSFER_SIZE
-
-
Constructor Summary
Constructors Constructor Description WikiCapturer()
Create a wikicapturer.
-
Method Summary
All Methods Static Methods Instance Methods Concrete Methods Modifier and Type Method Description protected boolean
isToBeCaptured(java.lang.String link)
Returnstrue
if the link is one we are interested in.static void
main(java.lang.String[] args)
Mainline to capture a web site locally.-
Methods inherited from class org.htmlparser.parserapplications.SiteCapturer
capture, copy, decode, getCaptureResources, getFilter, getSource, getTarget, isHtml, makeLocalLink, process, setCaptureResources, setFilter, setSource, setTarget
-
-
-
-
Method Detail
-
isToBeCaptured
protected boolean isToBeCaptured(java.lang.String link)
Returnstrue
if the link is one we are interested in.- Overrides:
isToBeCaptured
in classSiteCapturer
- Parameters:
link
- The link to be checked.- Returns:
true
if the link has the source URL as a prefix and doesn't contain '?' or '#'; the former because we won't be able to handle server side queries in the static target directory structure and the latter because presumably the full page with that reference has already been captured previously. This performs a case insensitive comparison, which is cheating really, but it's cheap.
-
main
public static void main(java.lang.String[] args) throws java.net.MalformedURLException, java.io.IOException
Mainline to capture a web site locally.- Parameters:
args
- The command line arguments. There are three arguments the web site to capture, the local directory to save it to, and a flag (true or false) to indicate whether resources such as images and video are to be captured as well. These are requested via dialog boxes if not supplied.- Throws:
java.net.MalformedURLException
- If the supplied URL is invalid.java.io.IOException
- If an error occurs reading the pages or resources.
-
-