Skip to content

Padre Cooler Options

Description

This page describes the possible options for tuning the ranking using the cool query processor option. For more information about how raking works, see Funnelback_Ranking_Algorithms.

Those options can either be set in Query processor options (collection.cfg) or using CGI parameters (e.g. ...&cool.2=12&cool.3=34...).

List of cooler options

NumberDescription
0content: content weight
1onlink: onsite link weight
2offlink: offsite link weight
3urllen: URL length weight
4qie: external evidence (qie) weight
5date_proximity: proximity to current date weight
6urltype: URL attractiveness (Homepages favoured. Copyright pages and URLS with lots of punctuation deprecated.)
7annie: annotation weight (annie)
8domain_weight: weight associated with this domain
9geoprox: geographical proximity to origin
10nonbin: non-binariness (1 for html, xml, txt, 0 otherwise)
11no_ads: freedom from ads
12imp_phrase: implicit phrase match score
13consistency: consistency of evidence. (Extra reward for docs with non-zero scores on both content and annie.)
14log_annie: logarithm of annotation weight (log(annie))
15anlog_annie: absolute-normalised logarithm of annotation weight.
16annie_rank: annotation rank = (k - rank)/ k. where k = 2 x highest rank requested - if rank > k, rank = k
17BM25F: field-weighted Okapi score
18an_okapi: absolute-normalised Okapi score.
19BM25F_rank: field-weighted Okapi rank.
20mainhosts: bias in favour of principal servers (web search only).
21comp_wt: component collection weighting. (meta collections only).
22document_number: document number in the crawl. An early position in the crawl may correlate with importance
23host_incoming_link_score
24host_click_score
25host_linking_hosts_score
26host_linked_hosts_score
27host_rank_in_crawl_order_score
28host_domain_shallowness_score
29doc_matches_regex: document matches administrator supplied regex
30doc_does_not_match_regex: document does not match administrator supplied regex
31titleWords: number of words in title
32contentWords: number of indexed words in document
33compressionFactor: compressibility of document text
34entropy: entropy of document
35stopwordFraction: fraction of stopwords in the document
36stopwordCover: fraction of stopword list present in the document
37averageTermLen: average term length
38distinctWords: number of distinct words in the document
39maxFreq: frequency of most frequently occurring term
40titleWords_neg: Neg number of words in title
41contentWords_neg: Neg number of indexed words in document
42compressionFactor_neg: Neg compressibility of document text
43entropy_neg: Neg entropy of document
44stopwordFraction_neg: Neg fraction of stopwords in the document
45stopwordCover_neg: Neg fraction of stopword list present in the document
46averageTermLen_neg: Neg average term length
47distinctWords_neg: Neg number of distinct words in the document
48maxFreq_neg: Neg frequency of most frequently occurring term
49titleWords_abs: Abs number of words in title
50contentWords_abs: Abs number of indexed words in document
51compressionFactor_abs: Abs compressibility of document text
52entropy_abs: Abs entropy of document
53stopwordFraction_abs: Abs fraction of stopwords in the document
54stopwordCover_abs: Abs fraction of stopword list present in the document
55averageTermLen_abs: Abs average term length
56distinctWords_abs: Abs number of distinct words in the document
57maxFreq_abs: Abs frequency of most frequently occurring term
58titleWords_abs_neg: Abs number of words in title
59contentWords_abs_neg: Neg abs number of indexed words in document
60compressionFactor_abs_neg: Neg abs compressibility of document text
61entropy_abs_neg: Neg abs entropy of document
62stopwordFraction_abs_neg: Neg abs fraction of stopwords in the document
63stopwordCover_abs_neg: Neg abs fraction of stopword list present in the document
64averageTermLen_abs_neg: Neg abs average term length
65distinctWords_abs_neg: Neg abs number of distinct words in the document
66maxFreq_abs_neg: Neg abs frequency of most frequently occurring term
67lexical_span_score
68doc_matches_cgscope1: Documents which match gscope defined by -cgscope1 (if defined)
69doc_matches_cgscope2: Documents which match gscope defined by -cgscope2 (if defined)
70doc_does_not_match_cgscope1: Documents which do not match gscope defined by -cgscope1 (if defined)
71doc_does_not_match_cgscope2: Documents which do not match gscope defined by -cgscope2 (if defined)
72raw_annie: Untransformed annie score linealry scaled to 0..1

Values

Values are unbounded, but typical weights range from 0-100.

Example

To set the query processor to ignore URL length, but give a high weight to phrase matches implied by the query:

 query_processor_options=-cool.3=0 -cool.12=100

See also

top

Funnelback logo
v15.20.0