Skip to content

exclude_patterns

The crawler will ignore a URL if it matches any of these exclude patterns.

Key: exclude_patterns
Type: List<String>
Can be set in: collection.cfg

Description

This option is a comma-separated list of substrings that is used by the crawler to determine if it will process a web page. If the page's URL contains one of the substrings the crawler will not process the page.

Use of this option will allow the search administrator to exclude an individual web page or to exclude a site as a whole.

See: include and exclude patterns for a description on how include and exclude patterns work and advanced techniques such as the use of a regular expression exclude pattern.

Default Value

exclude_patterns=/cgi-bin,/vti,/_vti,calendar,SQ_DESIGN_NAME=print,SQ_ACTION=logout,SQ_PAINT_LAYOUT_NAME=,%3E%3C/script%3E,google-analytics.com

Examples

Ignore standard paths, plus a local sales folder.

exclude_patterns=/cgi-bin,/vti,/_vti,calendar,/sales

Note: The parameter is stored in the collection.cfg file in the form:

See Also

top

Funnelback logo
v15.22.0