websphinx
Class Mirror
java.lang.Object
|
+--websphinx.HTMLTransformer
|
+--websphinx.LinkTransformer
|
+--websphinx.Mirror
- public class Mirror
- extends LinkTransformer
Offline mirror of a Web site. Web pages written to
a mirror are stored as files on the local disk in a directory
structure mirroring their URLs.
Field Summary |
(package private) java.lang.String |
defaultFilename
|
(package private) java.util.Vector |
files
|
(package private) boolean |
needRewrite
|
(package private) int |
nPages
|
(package private) java.lang.String |
root
|
Constructor Summary |
Mirror(java.lang.String directory)
Make a new Mirror. |
Method Summary |
private static java.lang.String |
canonicalDir(java.lang.String dir)
|
void |
close()
Close the mirror. |
private static java.lang.String |
encode(java.lang.String component)
|
java.lang.String |
getDefaultFilename()
Get the filename used for directory URLs. |
int |
getPageCount()
Get number of pages written to this mirror. |
java.lang.String |
lookupDir(java.net.URL base,
java.net.URL url)
Lookup the local directory to which a remote directory
URL maps. |
static void |
main(java.lang.String[] args)
|
void |
mapDir(java.net.URL url,
java.lang.String dir)
Map a directory URL (of the form http://host/path/) to
a local directory. |
void |
rewrite()
Rewrite the mirror to make local links consistent. |
void |
setDefaultFilename(java.lang.String filename)
Set the filename used for directory URLs. |
private java.lang.String |
toLocalDirURL(java.net.URL remoteURL)
|
private java.lang.String |
toLocalFileURL(java.net.URL remoteURL)
|
void |
write(Region region)
Writes a chunk of HTML through the HTML transformer. |
void |
write(java.lang.String string)
Writes a literal string through the HTML transformer
(without parsing it or transforming it). |
void |
writePage(Page page)
Write a page to the mirror. |
Methods inherited from class websphinx.LinkTransformer |
getBase, getEmitBaseElement, handleBase, handleElement, handleLink, isMapped, lookup, map, map, setBase, setEmitBaseElement |
Methods inherited from class websphinx.HTMLTransformer |
emit, emit, emitInternal, emitPendingRegion, finalize, flush, getFilePointer, getOutput, getRandomAccessFile, openFile, processElementsInRegion, seek, setOutput, setOutput, setRandomAccessFile, transformContents, transformElement, writeStream |
Methods inherited from class java.lang.Object |
, clone, equals, getClass, hashCode, notify, notifyAll, registerNatives, toString, wait, wait, wait |
root
java.lang.String root
files
java.util.Vector files
needRewrite
boolean needRewrite
defaultFilename
java.lang.String defaultFilename
nPages
int nPages
Mirror
public Mirror(java.lang.String directory)
throws java.io.IOException
- Make a new Mirror.
- Parameters:
directory
- Root directory (on local disk
relative to which the mirror pages are stored)
getDefaultFilename
public java.lang.String getDefaultFilename()
- Get the filename used for directory URLs.
For example, if the default filename is "index.html",
then the remote URL "http://www.xxx.com/path/" would
map to the local pathname "www.xxx.com/path/index.html".
- Returns:
- default filename. Default is "index.html".
setDefaultFilename
public void setDefaultFilename(java.lang.String filename)
- Set the filename used for directory URLs.
For example, if the default filename is "index.html",
then the remote URL "http://www.xxx.com/path/" would
map to the local pathname "www.xxx.com/path/index.html".
- Parameters:
filename
- Default filename.
getPageCount
public int getPageCount()
- Get number of pages written to this mirror.
- Returns:
- number of calls to writePage() on this mirror
write
public void write(Region region)
throws java.io.IOException
- Description copied from class:
HTMLTransformer
- Writes a chunk of HTML through the HTML transformer.
- Overrides:
write
in class HTMLTransformer
- Following copied from class:
websphinx.HTMLTransformer
- Parameters:
region
- Region to write
write
public void write(java.lang.String string)
throws java.io.IOException
- Description copied from class:
HTMLTransformer
- Writes a literal string through the HTML transformer
(without parsing it or transforming it).
- Overrides:
write
in class HTMLTransformer
- Following copied from class:
websphinx.HTMLTransformer
- Parameters:
string
- String to write
writePage
public void writePage(Page page)
throws java.io.IOException
- Write a page to the mirror. Stores the page on the local
disk, fixing up its links to point to the local
copies of any pages already stored to this mirror.
- Overrides:
writePage
in class LinkTransformer
- Parameters:
page
- Page to write
close
public void close()
throws java.io.IOException
- Close the mirror. Makes sure that links point to local versions of
pages wherever possible.
- Overrides:
close
in class HTMLTransformer
- Following copied from class:
websphinx.HTMLTransformer
- Throws:
java.io.IOException
- if an I/O error occurs
rewrite
public void rewrite()
throws java.io.IOException
- Rewrite the mirror to make local links consistent.
toLocalFileURL
private java.lang.String toLocalFileURL(java.net.URL remoteURL)
toLocalDirURL
private java.lang.String toLocalDirURL(java.net.URL remoteURL)
mapDir
public void mapDir(java.net.URL url,
java.lang.String dir)
throws java.net.MalformedURLException
- Map a directory URL (of the form http://host/path/) to
a local directory.
- Parameters:
url
- Directory URL. Must end with a slash.dir
- Local directory relative to which descendents of
url should be saved.
lookupDir
public java.lang.String lookupDir(java.net.URL base,
java.net.URL url)
- Lookup the local directory to which a remote directory
URL maps.
- Parameters:
base
- local file URL to use as a base. If non-null,
then the returned pathname is relative to this URL. If
null, the returned pathname is an absolute URL (file:/path/).url
- remote directory URL to look up. Must end in slash.
canonicalDir
private static java.lang.String canonicalDir(java.lang.String dir)
encode
private static java.lang.String encode(java.lang.String component)
main
public static void main(java.lang.String[] args)
throws java.lang.Exception