In order to fully understand the material below, it is necessary to specify some of the
jargon used with WebMaven.
Term
Definition
URI
WebMaven uses the term URI to encompass URLs along with other
associated data necessary to reference a file via the HTTP protocol.
From RFC 2068:
URIs have been known by many names: WWW addresses, Universal Document
Identifiers, Universal Resource Identifiers , and finally the
combination of Uniform Resource Locators (URL) and Names (URN). As
far as HTTP is concerned, Uniform Resource Identifiers are simply
formatted strings which identify -- via name, location, or any other
characteristic -- a resource.
Local path
A remote site, be it a complete domain (e.g. www.cfsrexx.com) or one
or more paths within a remote domain (e.g. www.cfsrexx.com/pub/ &
www.cfsrexx.com/WebMaven/) are downloaded into the local path directory.
Remote path is a domain name, with or without a path. The remote path
corresponds to the local path on a one-to-one basis when only one
remote path is specified.
Enterprise enabled versions of WebMaven could specify remote path values of
Baseball/American/Red Sox and
Baseball/National/Dodgers. and a local path of MLB which would result in the following local tree structure
on the local hard drive:
MLB
|
*-- American
| |
| *-- Red Sox
| |
| *-- Pawtucket
| |
| *-- Trenton
|
*-- National
|
*-- Dodgers
|
*-- Albuquerque
|
*-- San Antonio
Out of tree designates HTML links that are not within the remote
path(s). What happens with out of tree links is detailed in the Protocol processing table.
A WebMaven.LOG file is created / extended in the local path directory
each time WebMaven is run. The contents of the .LOG file vary depending
upon user-selected options.
WebMaven.!!!
This sentinel file exists in the local path while WebMaven is running.