Tag: Files

How does Cludo index files?

09 Feb, 2023 FAQ 0

As long as a file is machine-readable (not an image), Cludo is able to crawl its content along with the information sent with the HTTP headers. How to enable or disable file indexing By default, the crawler is configured to index files for the specified domain. You can enable or disable . . . Read more

Why is this file not indexed?

31 Jan, 2023 FAQ 0

Once a crawler has crawled the defined domain(s), you may experience a specific file not being added to the search index. This will typically be due to one of the following reasons:

What is the maximum file size Cludo can index?

30 Jan, 2023 FAQ 0

Cludo’s crawlers can index files up to 50 MB. Anything larger can be pushed directly via Cludo’s API. The extraction of files removes the size of images and other irrelevant information prior to looking at the file size. For reference, the raw text of the entire Bible is around 5MB.

What file types does Cludo index?

30 Jan, 2023 FAQ 0

The indexability of a file is not defined by its extension (e.g. “.pdf”), but rather by the content type, as returned in the HTTP headers. In the list below, we have added extensions as examples. Supported file types

What are you looking for?

Explore topics

Tag: Files

How does Cludo index files?

Why is this file not indexed?

What is the maximum file size Cludo can index?

What file types does Cludo index?