Skip to content

filter.text-cleanup.ranges-to-replace

Specify Unicode blocks for replacement during filtering (to avoid 'corrupt' character display).

Key: filter.text-cleanup.ranges-to-replace
Type: List<String>
Can be set in: collection.cfg

Description

This parameter lists the Unicode blocks of characters which should be removed by the TextCleanupFilterProvider when it is used in the filter.classes config setting.

The list is expected to be comma separated, and contain names as listed in the defined unicode block names. Block names are case-insensitive (so PLAYING_CARDS and playing_cards are both permitted and are equivalent).

Default Value

filter.text-cleanup.ranges-to-replace=private_use_area

Examples

If both private use and playing card characters occur in documents but should not be shown in search results or cached copies:

filter.text-cleanup.ranges-to-replace=private_use_area,playing_cards

See Also

top

Funnelback logo
v15.24.0