Category: FAQ

What are search categories?

31 Jan, 2023 FAQ 0

The “Category” field is a standard field in the crawler, and can be set up to identify a specific type of content when crawling. This can be useful when implementing the template as it becomes very easy to implement a filter on said category. The example above is just one . . . Read more

How to set up Cludo for intranets

31 Jan, 2023 FAQ 0

You may wish to implement Cludo on an intranet solution that is otherwise closed to the public. For this, you will want to consider how the crawler should access the site as well as how secure the implementation needs to be. Ways to allow crawling behind login In order to . . . Read more

How do I change the language in MyCludo?

31 Jan, 2023 FAQ 0

The MyCludo interface supports both English and Danish. This guide will explain how to change the interface language for a user. Please note this language change does not affect any engines, visitors, or even other users in MyCludo. If the changed user is the currently logged-in user, the entire interface . . . Read more

What is Intelligent Autocomplete?

31 Jan, 2023 FAQ 0

Intelligent Autocomplete is a feature for suggesting search terms as the visitor is typing. It uses machine learning to predict the most relevant suggestions for the specific engine. Based on previous visitor behavior and successful searches combined with result titles, Intelligent Autocomplete will predict the most likely search term for . . . Read more

What is XPath?

31 Jan, 2023 FAQ 0

XPath (XML Path Language) is a path expression, used to refer to a specific subset of XML or HTML markup. In MyCludo, XPath can be used in the crawler configuration to define a specific part of the HTML of the page to use for a field. Using the HTML markup, different . . . Read more

Setting up time-scheduled crawling

31 Jan, 2023 FAQ 0

It is possible to set a specific time schedule for the crawler to run at a specific time of day. Currently, it is not possible to set the time schedule for a crawler via MyCludo. To configure time-scheduled crawling, submit a support ticket, informing your timezone and at which time of . . . Read more

What is async crawling?

31 Jan, 2023 FAQ 0

Async crawling is meant for websites where content is loaded asynchronously (AJAX-generated content). AJAX-generated content allows the web page and web browser to process data without having to reload the page. For example, if you hit a “Submit” button on the page, AJAX processes the information and updates the content . . . Read more

What are the crawlers’ user agent and IP addresses?

30 Jan, 2023 FAQ 0

In some cases, the crawler may be blocked from indexing your website. To fix this, you may need to whitelist our IP address to allow the crawler to access the site. Our crawler’s user agent is: Our crawler’s user agent can be referred to simply as cludo. User-agent: cludoAllow: * Our . . . Read more

Where can I find the API documentation?

30 Jan, 2023 FAQ 0

You can find developer resources, including Cludo’s API documentation, here.

What is the maximum file size Cludo can index?

30 Jan, 2023 FAQ 0

Cludo’s crawlers can index files up to 15 MB. Anything larger can be pushed directly via Cludo’s API. The extraction of files removes the size of images and other irrelevant information prior to looking at the file size. For reference, the raw text of the entire Bible is around 5MB.

1 2 3 4 … 6

What are you looking for?

Explore topics

Category: FAQ

What are search categories?

How to set up Cludo for intranets

How do I change the language in MyCludo?

What is Intelligent Autocomplete?

What is XPath?

Setting up time-scheduled crawling

What is async crawling?

What are the crawlers’ user agent and IP addresses?

Where can I find the API documentation?

What is the maximum file size Cludo can index?