Exclude pages from Indexing



  • Question

    What is the best way to exclude multiple pages from search when they are not in the same folder structure?

    Environment Production
    Reported product version M'21
    Resolved in version M'21
    Module Content Sources

    Answer

    The best way to exclude the docs while crawling would be to add all the URLs to the "Should not crawl" section. Below are the steps:

    1. Navigate to Content Sources in Admin Panel
    2. Edit website type Content Source from which you want to exclude the docs.
    3. Navigate to Rule tab and click "By Filter"
    4. Add pages in "should not crawl" section. This will prevent these pages from being indexed and shown in search result pages.

    Screenshot_2.png

    Note: These URLs will also be excluded by auto crawlers that are running after a given frequency. Also, this feature is only available for website type content source.



Suggested Topics

  • 0
  • 0
  • 0
  • 0
  • 0
  • 0
  • 0