Tag: Crawling

Why is this page not indexed?

Once a crawler has crawled the defined domain(s), you may experience a specific page not being added to the search index. This will typically be due to one of the following reasons: Feel free to contact support if you have further questions on why a page was not indexed as . . . Read more

How to configure a crawl delay

A crawl delay will limit the frequency at which the crawler will request pages on a website. This results in an overall slower crawl but can prevent overloading the website with too many requests at once. This is rarely needed, but it can be useful for “sensitive” servers that don’t . . . Read more

How to set up async crawling

The async crawling feature cannot currently be set up using MyCludo and requires interference from a trained Cludo employee, as this is always configured manually depending on the site structure. In order to have async crawling configured, please reach out to support.

How to review the crawler log

The crawler log offers insights into the activity of the crawler. It allows for further investigation into which pages were indexed and which were not and why.

Exclude pages

It is possible to exclude pages in different ways, each with their own set of pros and cons. It is recommended to read and understand all the ways of excluding pages before setting up the exclusion logic to ensure the best setup for your needs. How to exclude pages via . . . Read more

Best practice for async crawling

If your website has content that is rendered asynchronously, you may want to look into async crawling. However, before deciding if there is a need for this, it is recommended to consider the following: As async crawling requires more resources, it is recommended to first assess if there are easy-to-implement . . . Read more