The Moz Q&A Forum

    • Forum
    • Questions
    • My Q&A
    • Users
    • Ask the Community

    Welcome to the Q&A Forum

    Browse the forum for helpful insights and fresh discussions about all things SEO.

    1. SEO and Digital Marketing Q&A Forum
    2. Categories
    3. Intermediate & Advanced SEO
    4. How to prevent Google from crawling our product filter?

    How to prevent Google from crawling our product filter?

    Intermediate & Advanced SEO
    4 2 2.6k
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as question
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • footsteps
      footsteps last edited by

      Hi All,

      We have a crawler problem on one of our sites www.sneakerskoopjeonline.nl.

      On this site, visitors can specify criteria to filter available products. These filters are passed as http/get arguments. The number of possible filter urls is virtually limitless.

      In order to prevent duplicate content, or an insane amount of pages in the search indices, our software automatically adds noindex, nofollow and noarchive directives to these filter result pages. However, we’re unable to explain to crawlers (Google in particular) to ignore these urls.

      We’ve already changed the on page filter html to javascript, hoping this would cause the crawler to ignore it. However, it seems that Googlebot executes the javascript and crawls the generated urls anyway.

      What can we do to prevent Google from crawling all the filter options?

      Thanks in advance for the help.

      Kind regards,

      Gerwin

      1 Reply Last reply Reply Quote 0
      • alexhoug
        alexhoug last edited by

        I would use your robots.txt file to prevent them from crawling the specific strings / pages. Go into your Google Webmaster Tools and you can see all the information Google has on your site and any issues, you can also specify robots.txt information in there. That would be the best route as Google is obedient with what is on the robots.txt file. If you want more information about robots.txt, go here.

        footsteps 2 Replies Last reply Reply Quote 1
        • footsteps
          footsteps @alexhoug last edited by

          The url looks like this;

          http://www.sneakerskoopjeonline.nl/herensneakers?product_brand=

          So just adding;

          User-agent: *
          Disallow: /*?product_brand

          Should do the trick?
          Most important is that herensneakers itself should be indexed, followed and crawled

          1 Reply Last reply Reply Quote 0
          • footsteps
            footsteps @alexhoug last edited by

            The following is added to our robots.txt .. now lets wait and see the results

            User-agent: * Disallow: /admin/
            Disallow: /?
            Allow /?product_date=&product_date2=*
            Disallow /?product_date=&product_date2=&

            To check the working of the robots.txt i found a handy website;

            http://phpweby.com/services/robots

            1 Reply Last reply Reply Quote 0
            • 1 / 1
            • First post
              Last post
            • I want to do tracking of normal product and sample product conversion separately in google adwords
              dsouzac
              dsouzac
              0
              3
              73

            • Prevent Google from crawling Ajax
              Shawn_Huber
              Shawn_Huber
              0
              3
              1.0k

            • Why is my site not getting crawled by google?
              DonnaDuncan
              DonnaDuncan
              0
              4
              162

            • Google Webmaster successfully fetched one of my webpages. Does that mean Google will crawl them or readable by bots?
              Gyorgy
              Gyorgy
              0
              2
              72

            • Google and Product Description Tabs
              ReferralCandy
              ReferralCandy
              0
              4
              278

            • Is there a way to contact Google besides the google product forum?
              trophycentraltrophiesandawards
              trophycentraltrophiesandawards
              0
              3
              377

            • How to stop Google crawling after 301 redirect?
              irvingw
              irvingw
              0
              2
              997

            • Is there any delay between crawling a page by google and displaying of the ratings in rich snippet of the results in google?
              NEWCRAFT
              NEWCRAFT
              0
              3
              545

            Get started with Moz Pro!

            Unlock the power of advanced SEO tools and data-driven insights.

            Start my free trial
            Products
            • Moz Pro
            • Moz Local
            • Moz API
            • Moz Data
            • STAT
            • Product Updates
            Moz Solutions
            • SMB Solutions
            • Agency Solutions
            • Enterprise Solutions
            • Digital Marketers
            Free SEO Tools
            • Domain Authority Checker
            • Link Explorer
            • Keyword Explorer
            • Competitive Research
            • Brand Authority Checker
            • Local Citation Checker
            • MozBar Extension
            • MozCast
            Resources
            • Blog
            • SEO Learning Center
            • Help Hub
            • Beginner's Guide to SEO
            • How-to Guides
            • Moz Academy
            • API Docs
            About Moz
            • About
            • Team
            • Careers
            • Contact
            Why Moz
            • Case Studies
            • Testimonials
            Get Involved
            • Become an Affiliate
            • MozCon
            • Webinars
            • Practical Marketer Series
            • MozPod
            Connect with us

            Contact the Help team

            Join our newsletter
            Moz logo
            © 2021 - 2026 SEOMoz, Inc., a Ziff Davis company. All rights reserved. Moz is a registered trademark of SEOMoz, Inc.
            • Accessibility
            • Terms of Use
            • Privacy