How to use cludooff/cludoon

If there are certain parts of the content on a page that you would like the crawler to ignore, this can be achieved using cludooff/cludoon.

The cludooff/cludoon feature essentially helps you:

  • Exclude unwanted or unpredictable content from being indexed and become searchable, such as comment sections and user-generated content, or modules that appear on all pages, such as ratings (\”Was this page useful?\”), commands (share/print), navigation elements, or notices (such as a cookie notice)
  • Improve the relevancy of your search
  • Suppress certain information from being highlighted in a result

How and when to use cludooff/cludoon

When setting up a crawler, it is generally recommended to ensure that it doesn’t pick up elements that shouldn’t be part of the information for a page, such as the header, the footer, side menus, or breadcrumbs. However, the structure of a page doesn’t always allow you to refine the indexed content as much as we’d like it to. On top of that, there may be text or content that you do not want to index within the content that you’d otherwise like indexed, such as image descriptions. In any case, cludooff/cludoon will help you.

If you’re seeing that the crawler is picking up content that you do not want to be searchable and/or appear in the results, use the cludooff/cludoon functionality simply by adding <!--cludooff: index--> before the content that you do not want to be indexed. Then, add <!--cludoon: index--> where the crawler is allowed to index content again.

Example

<body>
  <p>Content I want indexable</p> 
  <!--cludooff: index-->
  <p> Content I don't want indexed</p>
  <!--cludoon: index-->
  <p>More content I want indexable</p>
</body>


The text that appears between the <!--cludooff: index--> and <!--cludoon: index--> tags will not be indexed by Cludo’s crawler.

If you need help setting up cludooff/cludoon, don’t hesitate to get in touch with support.

Tags: