Category: FAQ

Connecting Cludo to your cookie consent functionality

It is possible to add development to have the cookies set by Cludo connect with your general cookie consent formula. For general information on cookies set by Cludo, read this article. Technical Overview At the moment Cludo stores session information in local storage that persists across end-user visits, we will . . . Read more

How does the crawler index and delete pages?

Cludo’s strategy for crawling sites is based on finding as many pages as possible within the user-defined domains, indexing, and storing their content. Cludo’s crawlers run every 24 hours, plus the time it takes to complete the most recent crawl. The step-by-step process can be seen in detail in the . . . Read more

How are compound words treated in searches?

What is a compound word? A compound word is a word that consists of two or more nouns that together form a word with its own meaning, which is very typical in some languages, like the Scandinavian languages. How is a compound word treated in a search? When searching, a . . . Read more

How does Cludo index files?

As long as a file is machine-readable (not an image), Cludo is able to crawl its content along with the information sent with the HTTP headers. How to enable or disable file indexing By default, the crawler is configured to index files for the specified domain. You can enable or disable . . . Read more

How to avoid duplicate results?

When searching, you may experience the same content appearing more than once in the results. Since a crawler is unable to index the same URL twice, this will always be due to the same content existing on multiple URLs. Note: Having two crawlers that index the same pages added to . . . Read more

Why is this file not indexed?

Once a crawler has crawled the defined domain(s), you may experience a specific file not being added to the search index. This will typically be due to one of the following reasons:

How to set up Cludo for intranets

You may wish to implement Cludo on an intranet solution that is otherwise closed to the public. For this, you will want to consider how the crawler should access the site as well as how secure the implementation needs to be. Ways to allow crawling behind login In order to . . . Read more

How do I change the language in MyCludo?

The MyCludo interface supports both English and Danish. This guide will explain how to change the interface language for a user. Please note this language change does not affect any engines, visitors, or even other users in MyCludo. If the changed user is the currently logged-in user, the entire interface . . . Read more

How does Autocomplete work?

Intelligent Autocomplete is a feature for suggesting search terms as the visitor is typing. It uses machine learning to predict the most relevant suggestions for the specific engine. Based on previous visitor behavior and successful searches combined with result titles, Intelligent Autocomplete will predict the most likely search term for . . . Read more

What is XPath?

XPath (XML Path Language) is a path expression, used to refer to a specific subset of XML or HTML markup. In MyCludo, XPath can be used in the crawler configuration to define a specific part of the HTML of the page to use for a field. Using the HTML markup, different . . . Read more