Metadata Classes

Funnelback allows metadata classes to be defined as ASCII alphanumeric strings up to 64 characters long, which do NOT start with upper or lower case FUN. Funnelback has some predefined metadata classes which should be used when possible.

Reserved Classes

The following metadata classes are reserved for internal use, and should not normally be used for other purposes.

Predefined Metadata ClassExplanation of Reservation
hOutgoing link target information.
iImage information (alt and src attributes of img tags).
kAnchor text referring to the document (text within a tags).
mEmail addresses within the document (a tags using mailto: in href attributes).
uURL hostname information.
vURL path and filename information.
KUser click information referring to the document.

Any metadata class that starts with FUN or any upper or lower case variation is also reserved.

Special Classes

The following metadata classes are treated specially by Funnelback. It may be appropriate to map metadata into them, but they will be treated differently internally as described below.

Metadata ClassExplanationDefault mappings
dUsed for document date information. Date sources may be mapped in metamap.cfg or xml.cfg, and will be used when the document date is displayed and for recency related ranking.dc.date (and qualifications like dc.data.published not mentioned thereafter), dc.date.modified, dc.date.created, dc.date.issued, Last Modified Date (from HTTP headers), dc.date.expires, dc.date.valid, in order of decreasing priority. See supported date formats for more information.
fUsed for file format information. Will be used as the original type of a file (e.g. HTML, PDF, Word Document) where this information is displayed.dc.format, funnelback.format, text/html
tUsed for title information. Title sources may be mapped in metamap.cfg or xml.cfg. The first title found will be used when the document title is displayed and all title content will be up-weighted by default in ranking. For html documents the title in

Predefined Classes

Funnelback has predefined the following classes, this allows Funnelback to look for some metadata within html documents and display this data on the search results page without heavy customisation.

Metadata ClassExplanationMetadata fields included
*****Anywhere. In any metadata field or in the page content.N/A
aAuthorAuthor, DC.Creator, DC.Author, DC.Contributor, from: (email)
bRightsDC.Rights
cDescriptionDC.Description
eTypeDC.Type
fFormatDC.Format
gRelationDC.Relation
jAvailability/IdentifierDC.Identifier, AGLS.Availability
lLanguageDC.Language
nSourceDC.Source
oCoverageDC.Coverage
pPublisherDC.Publisher
qFunctionAGLS.Function
rRecipientsto: (email),AGLS.audience
sSubject/Keywordskeywords, DC.Subject, subject: (in the case of email)
w-AGLS.Mandate
SUsed for document security information in DLS-enabled collections.-

Listing classes

In a number of situations, for example some query processor options, Funnelback supports providing a list of metadata classes.

The standard syntax for such a list is a comma separated list of class names within square brackets, for example:

[class1,class2...,classN]

See also

top