The Moz Q&A Forum

    • Forum
    • Questions
    • My Q&A
    • Users
    • Ask the Community

    Welcome to the Q&A Forum

    Browse the forum for helpful insights and fresh discussions about all things SEO.

    1. SEO and Digital Marketing Q&A Forum
    2. Categories
    3. Moz Tools
    4. Crawl Errors Confusing Me

    Crawl Errors Confusing Me

    Moz Tools
    7 4 823
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as question
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • mjtaylor
      mjtaylor last edited by

      The SEOMoz crawl tool is telling me that I have a slew of crawl errors on the blog of one domain. All are related to the MSNbot. And related to trackbacks (which we do want to block, right?) and attachments (makes sense to block those, too) ... any idea why these are crawl issues with MSNbot and not Google? My robots.txt is here: http://www.wevegotthekeys.com/robots.txt.

      Thanks, MJ

      1 Reply Last reply Reply Quote 0
      • ENSO
        ENSO last edited by

        I have the same problem looks like MSN bot is disallowed from accessing wordpress content. So pages show up as ?page=111 so from what I understand so far anything that shows as below is blocked from MSNbot. I don't have a definite answer for you as to what to do, but I can tell you will need to "allow" msn bot the googlebot is.

        Disallow: /key-west-blog/*?*
        
        1 Reply Last reply Reply Quote 0
        • DanDeceuster
          DanDeceuster last edited by

          In your robots.txt file, you have the Disallow: command under MSNbot and Noindex: under Googlebot. Noindex is not a robots.txt command. Change Noindex: to Disallow: and those pages will be blocked for all bots. Not sure if that is what is causing the issue, but that would explain the discrepancy. If you want to noindex a page, you do it with a meta tag like this:

          You can change follow to nofollow if you want, really doesn't matter much.

          mjtaylor 1 Reply Last reply Reply Quote 0
          • mjtaylor
            mjtaylor @DanDeceuster last edited by

            The robots.txt file DOES contain

            User-agent: Msnbot
            Crawl-delay: 120
            Disallow: /key-west-blog/*?*
            Disallow: /key-west-blog/*.rss
            Disallow: /key-west-blog/*feed
            Disallow: /key-west-blog/*trackback
            Disallow: /key-west-blog/*wp-
            Disallow: /key-west-blog/*login.php
            Disallow: /key-west-blog/tag/
            Disallow: /key-west-blog/search/
            Disallow: /key-west-blog/archives/
            Disallow: /key-west-blog/category/
            Disallow: /key-west-blog/2009
            Disallow: /key-west-blog/2010
            
            But you are saying I should remove the lines with noindex? 
            
            DanDeceuster mjtaylor 2 Replies Last reply Reply Quote 0
            • DanDeceuster
              DanDeceuster @mjtaylor last edited by

              I am saying this:

              User-agent: Googlebot
              Noindex: /key-west-blog/*?*
              Noindex: /key-west-blog/*.rss
              Noindex: /key-west-blog/*feed
              Noindex: /key-west-blog/*trackback
              Noindex: /key-west-blog/*wp-
              Noindex: /key-west-blog/tag/
              Noindex: /key-west-blog/search/
              Noindex: /key-west-blog/archives/
              Noindex: /key-west-blog/category/
              Noindex: /key-west-blog/2009
              Noindex: /key-west-blog/2010
              
              and this:
              
              

              User-agent: Googlebot-Mobile
              Noindex: /key-west-blog/?
              Noindex: /key-west-blog/*.rss
              Noindex: /key-west-blog/*feed
              Noindex: /key-west-blog/*trackback
              Noindex: /key-west-blog/*wp-
              Noindex: /key-west-blog/tag/
              Noindex: /key-west-blog/search/
              Noindex: /key-west-blog/archives/
              Noindex: /key-west-blog/category/
              Noindex: /key-west-blog/2009
              Noindex: /key-west-blog/2010

              
              They use Noindex which is a syntax I am unfamiliar with in robots.txt. So you can check out http://www.robotstxt.org/robotstxt.html for more info on robots.txt and proper syntaxt. I would change Noindex: to Disallow: and that should fix the error in the robots.txt file.
              
              1 Reply Last reply Reply Quote 1
              • mjtaylor
                mjtaylor @mjtaylor last edited by

                Yes, I thought that's what you meant ... thanks!

                1 Reply Last reply Reply Quote 0
                • Cyrus-Shepard
                  Cyrus-Shepard last edited by

                  I'm a little late to the party, but I want to summarize what I see as the answer.

                  1. The "Search Engine Blocked by Robots.txt" is only a warning, and not an error. If you intend for these pages not to get crawled (and it does seem like you have a good reason for this), then there is nothing to worry about.

                  2. The reason the warning appears for MSNbot and not Google is that currently, your robots.txt allows Google to crawl those files. As Daniel pointed out, you would need to add the identical directives to your robots.txt file to make this happen. Does that make sense? Or you could just add all of these files under the * directive to apply to all robots.

                  1 Reply Last reply Reply Quote 1
                  • 1 / 1
                  • First post
                    Last post
                  • Does anyone know the linking of hashtags on Wix sites does it negatively or postively impact SEO. It is coming up as an error in site crawls 'Pages with 404 errors' Anyone got any experience please?
                    Nozzle
                    Nozzle
                    0
                    5
                    82

                  • Since July 1, we've had a HUGE jump in errors on our weekly crawl. We don't think anything has changed on our website. Has MOZ changed something that would account for a large leap in duplicate content and duplicate title errors?
                    KristyFord
                    KristyFord
                    0
                    3
                    77

                  • When I did my first crawl, I was given some errors.
                    immortalgamer
                    immortalgamer
                    0
                    3
                    56

                  • Seo moz has only crawled 2 pages of my site. Ive been notified of a 403 error and need an answer as to why my pages are not being crawled?
                    nitro-digital
                    nitro-digital
                    0
                    9
                    319

                  • Crawl Diagnostic Errors
                    rosstaylor
                    rosstaylor
                    0
                    9
                    923

                  • SEOmoz crawl error questions
                    Malarowski
                    Malarowski
                    0
                    4
                    528

                  • Errors on my Crawl Diagnostics
                    rhutchings
                    rhutchings
                    0
                    2
                    645

                  • Crawl test. Bot crawled only 200 or so links when it should have crawled thousands
                    Ev84
                    Ev84
                    0
                    9
                    1.2k

                  Get started with Moz Pro!

                  Unlock the power of advanced SEO tools and data-driven insights.

                  Start my free trial
                  Products
                  • Moz Pro
                  • Moz Local
                  • Moz API
                  • Moz Data
                  • STAT
                  • Product Updates
                  Moz Solutions
                  • SMB Solutions
                  • Agency Solutions
                  • Enterprise Solutions
                  • Digital Marketers
                  Free SEO Tools
                  • Domain Authority Checker
                  • Link Explorer
                  • Keyword Explorer
                  • Competitive Research
                  • Brand Authority Checker
                  • Local Citation Checker
                  • MozBar Extension
                  • MozCast
                  Resources
                  • Blog
                  • SEO Learning Center
                  • Help Hub
                  • Beginner's Guide to SEO
                  • How-to Guides
                  • Moz Academy
                  • API Docs
                  About Moz
                  • About
                  • Team
                  • Careers
                  • Contact
                  Why Moz
                  • Case Studies
                  • Testimonials
                  Get Involved
                  • Become an Affiliate
                  • MozCon
                  • Webinars
                  • Practical Marketer Series
                  • MozPod
                  Connect with us

                  Contact the Help team

                  Join our newsletter
                  Moz logo
                  © 2021 - 2026 SEOMoz, Inc., a Ziff Davis company. All rights reserved. Moz is a registered trademark of SEOMoz, Inc.
                  • Accessibility
                  • Terms of Use
                  • Privacy