Macromedia colfusion mx 7 Manual

Descargar
Página de 170
Path and URL options
125
-https
Type: Web crawling only
Lets you index SSL-enabled websites.
Note: You must have the Verity SSL Option Pack installed to use the 
-https
 option. The Verity SSL 
Option Pack is a Verity Spider add-on available separately from a Verity salesperson.
-jumps
Type: Web crawling only
Syntax
-jumps num_jumps
Specifies the maximum number of levels an indexing job can go from the starting URL. Specify a 
number between 0 and 254.
The default value is unlimited. If you see extremely large numbers of documents in a collection 
where you do not expect them, consider experimenting with this option, in conjunction with the 
Content options, to pare down your collection.
-nodocrobo
Specifies to ignore ROBOT META tag directives.
In HTML 3.0 and earlier, robot directives could only be given as the file robots.txt under the root 
directory of a website. In HTML 4.0, every document can have robot directives embedded in the 
META field. Use this option to ignore them. Use this option with discretion.
-nofollow
Type: Web crawling only
Syntax
-nofollow "exp"
Specifies that Verity Spider cannot follow any URLs that match the exp expression. If you do not 
specify an exp value for the 
-nofollow
 option, Verity Spider assumes a value of "*", where no 
documents are followed.
You can use wildcard expressions, where the asterisk (*) is for text strings and the question mark 
(?) is for single characters. Always encapsulate the exp values in double-quotation marks to ensure 
that they are properly interpreted. 
If you use backslashes, you must double them so that they are properly escaped; for example:
C:\\test\\docs\\path
To use regular expressions, also specify the 
Earlier versions of Verity Spider did not allow the use of an expression. This meant that for each 
starting point URL, only the first document would be indexed. With the addition of the 
expression functionality, you can now selectively skip URLs, even within documents.
See also