crawler.accept_files (collection.cfg setting)
This is a comma-separated list of file extensions that will be downloaded by the crawler. It is normally left empty, so that the crawler will accept all valid content regardless of the suffix.
This means there are no restrictions on what files will be downloaded.
In this example a specific list of filetypes (based on suffix) is listed - only files of these types will be downloaded.