Skip to content

filter.tika.types (collection.cfg setting)

Description

This option specifies which filetypes the TikaFilterProvider should filter. This filter has support for a wide range of file types and would normally try to filter as many types of input as it can. To allow other filters to work on specific file types we use this "white list" to control what Tika does/doesn't filter.

Default value

filter.tika.types=doc,dot,ppt,xls,rtf,docx,pptx,xlsx,pdf,png,gif,jpg,jpeg,tif,tiff,epub,vsd,msg,odt,odp,ods,odg

Examples

Remove TIFF files from the list to filter those files using custom filter added to filter.classes setting:

filter.tika.types=doc,dot,ppt,xls,rtf,docx,pptx,xlsx,pdf,png,gif,jpg,jpeg,epub,vsd,msg,odt,odp,ods,odg

See also:

top

Funnelback logo
v15.16.0