Author: Cludo

Connecting Cludo to your cookie consent functionality

It is possible to add development to have the cookies set by Cludo connect with your general cookie consent formula. For general information on cookies set by Cludo, read this article. Technical Overview At the moment Cludo stores session information in local storage that persists across end-user visits, we will . . . Read more

How does the crawler index and delete pages?

Cludo’s strategy for crawling sites is based on finding as many pages as possible within the user-defined domains, indexing, and storing their content. The step-by-step process can be seen in detail in the diagram at the end of the article and will be explained further below: Crawling: Step-by-step process 1: Sites . . . Read more

Cludo’s API Analytics script

Cludo has developed a script specifically for API implementations. With this, you won’t have to manually set up tracking for your API implementation. Introduction The Cludo API Analytics script is a JavaScript snippet to be placed on your search results page that will send query and click data to Cludo’s . . . Read more

How are compound words treated in searches?

What is a compound word? A compound word is a word that consists of two or more nouns that together form a word with its own meaning, which is very typical in some languages, like the Scandinavian languages. How is a compound word treated in a search? When searching, a . . . Read more

Filtering searches

If you would like an existing engine to only show results for a specific area, this can be done by adding a filter in the script. Scoped search allows you to limit search results to a specific section or type of content within the website instead of searching across the whole . . . Read more

How does Cludo index files?

As long as a file is machine-readable (not an image), Cludo is able to crawl its content along with the information sent with the HTTP headers. How to enable or disable file indexing By default, the crawler is configured to index files for the specified domain. You can enable or disable . . . Read more

Page Inventory

If you’re ever wondering about the number of pages in your search results or find the need to check up on any indexed content, Page Inventory is here to help. Page Inventory will provide you with an overview of indexed content for all your crawlers to provide you with a . . . Read more

How to avoid duplicate results?

When searching, you may experience the same content appearing more than once in the results. Since a crawler is unable to index the same URL twice, this will always be due to the same content existing on multiple URLs. Note: Having two crawlers that index the same pages added to . . . Read more

Why is this file not indexed?

Once a crawler has crawled the defined domain(s), you may experience a specific file not being added to the search index. This will typically be due to one of the following reasons:

Web Content Highlighter

What is Web Content highlighter? Web Content Highlighter is a feature that highlights the search query on the specific landing page the visitor has selected from the search results. This makes it easier for the visitor to find exactly what they were searching for. This way, the feature helps to . . . Read more