Skip to content

filter.tika.types

Specifies which file types to filter using the TikaFilterProvider.

Key: filter.tika.types
Type: List<String>
Can be set in: collection.cfg

Description

This option specifies which filetypes the TikaFilterProvider should filter. This filter has support for a wide range of file types and would normally try to filter as many types of input as it can. To allow other filters to work on specific file types we use this "white list" to control what Tika does/doesn't filter.

Default Value

filter.tika.types=doc,dot,ppt,xls,rtf,docx,docm,pptx,pptm,xlsx,xlsm,pdf,png,gif,jpg,jpeg,tif,tiff,epub,vsd,msg,odt,odp,ods,odg,dwg,key,pages,numbers

Examples

Remove TIFF files from the list to filter those files using custom filter added to filter.classes setting:

filter.tika.types=doc,dot,ppt,xls,rtf,docx,pptx,xlsx,pdf,png,gif,jpg,jpeg,epub,vsd,msg,odt,odp,ods,odg

See Also

:

top

Funnelback logo
v15.24.0