The Moz Q&A Forum

    • Forum
    • Questions
    • My Q&A
    • Users
    • Ask the Community

    Welcome to the Q&A Forum

    Browse the forum for helpful insights and fresh discussions about all things SEO.

    1. SEO and Digital Marketing Q&A Forum
    2. Categories
    3. Moz Tools
    4. Why is Roger crawling pages that are disallowed in my robots.txt file?

    Why is Roger crawling pages that are disallowed in my robots.txt file?

    Moz Tools
    5 2 928
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as question
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • MeltButterySpread
      MeltButterySpread last edited by

      I have specified the following in my robots.txt file:

      Disallow: /catalog/product_compare/

      Yet Roger is crawling these pages = 1,357 errors.

      Is this a bug or am I missing something in my robots.txt file?

      Here's one of the URLs that Roger pulled:

      <colgroup><col width="312"></colgroup>
      |

      example.com/catalog/product_compare/add/product/19241/uenc/aHR0cDovL2ZyZXNocHJvZHVjZWNsb3RoZXMuY29tL3RvcHMvYWxsLXRvcHM_cD02/

      Please let me know if my problem is in robots.txt or if Roger spaced this one. Thanks!

      |

      1 Reply Last reply Reply Quote 0
      • blu42media
        blu42media last edited by

        Have you specified a User-Agent?

        MeltButterySpread 1 Reply Last reply Reply Quote 0
        • MeltButterySpread
          MeltButterySpread @blu42media last edited by

          Yes, blocking all --> *

          1 Reply Last reply Reply Quote 0
          • blu42media
            blu42media last edited by

            Digging back through the Q&A... I'm several posts reporting this sort of thing.

            http://www.seomoz.org/dp/rogerbot

            Perhaps you could try specifically blocking rogerbot?  If that doesn't work, an email to the SEOmoz team may do the trick 🙂

            MeltButterySpread 1 Reply Last reply Reply Quote 1
            • MeltButterySpread
              MeltButterySpread @blu42media last edited by

              Digging in further I discovered that rogerbot had blocked a portion of these URL variations, but 2/3 slipped through. I sent an email to support. Thanks for the suggestion.

              1 Reply Last reply Reply Quote 0
              • 1 / 1
              • First post
                Last post
              • Robots.txt file issues on Shopify server
                Expansyon
                Expansyon
                0
                2
                44

              • Our crawler was not able to access the robots.txt file on your site.
                Optimal_Strategies
                Optimal_Strategies
                0
                3
                80

              • Will moz crawl pages blocked by robots.txt and nofollow links?
                Ryan_Watson
                Ryan_Watson
                0
                2
                184

              • Seo moz has only crawled 2 pages of my site. Ive been notified of a 403 error and need an answer as to why my pages are not being crawled?
                nitro-digital
                nitro-digital
                0
                9
                319

              • Roger found Duplicate Page Content, but all my pages are canonicalized
                Dr-Pete
                Dr-Pete
                0
                7
                367

              • Does Rogerbot respect the robots.txt file for wildcards?
                AC_Pro
                AC_Pro
                0
                4
                574

              • How to remove URLS from from crawl diagnostics blocked by robots.txt
                GrouchyKids
                GrouchyKids
                0
                2
                416

              • Too Many On-Page Links: Crawl Diag vs On-Page
                Dryope
                Dryope
                0
                3
                481

              Get started with Moz Pro!

              Unlock the power of advanced SEO tools and data-driven insights.

              Start my free trial
              Products
              • Moz Pro
              • Moz Local
              • Moz API
              • Moz Data
              • STAT
              • Product Updates
              Moz Solutions
              • SMB Solutions
              • Agency Solutions
              • Enterprise Solutions
              • Digital Marketers
              Free SEO Tools
              • Domain Authority Checker
              • Link Explorer
              • Keyword Explorer
              • Competitive Research
              • Brand Authority Checker
              • Local Citation Checker
              • MozBar Extension
              • MozCast
              Resources
              • Blog
              • SEO Learning Center
              • Help Hub
              • Beginner's Guide to SEO
              • How-to Guides
              • Moz Academy
              • API Docs
              About Moz
              • About
              • Team
              • Careers
              • Contact
              Why Moz
              • Case Studies
              • Testimonials
              Get Involved
              • Become an Affiliate
              • MozCon
              • Webinars
              • Practical Marketer Series
              • MozPod
              Connect with us

              Contact the Help team

              Join our newsletter
              Moz logo
              © 2021 - 2026 SEOMoz, Inc., a Ziff Davis company. All rights reserved. Moz is a registered trademark of SEOMoz, Inc.
              • Accessibility
              • Terms of Use
              • Privacy