The Moz Q&A Forum

    • Forum
    • Questions
    • My Q&A
    • Users
    • Ask the Community

    Welcome to the Q&A Forum

    Browse the forum for helpful insights and fresh discussions about all things SEO.

    1. SEO and Digital Marketing Q&A Forum
    2. Categories
    3. Technical SEO Issues
    4. GWT False Reporting or GoogleBot has weird crawling ability?

    GWT False Reporting or GoogleBot has weird crawling ability?

    Technical SEO Issues
    5 3 162
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as question
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • baldnut
      baldnut last edited by

      Hi I hope someone can help me.

      I have launched a new website and trying hard to make everything perfect. I have been using Google Webmaster Tools (GWT) to ensure everything is as it should be but the crawl errors being reported do not match my site. I mark them as fixed and then check again the next day and it reports the same or similar errors again the next day.

      Example:

      http://www.mydomain.com/category/article/ (this would be a correct structure for the site).

      GWT reports:

      http://www.mydomain.com/category/article/category/article/ 404 (It does not exist, never has and never will) I have been to the pages listed to be linking to this page and it does not have the links in this manner. I have checked the page source code and all links from the given pages are correct structure and it is impossible to replicate this type of crawl.

      This happens accross most of the site, I have a few hundred pages all ending in a trailing slash and most pages of the site are reported in this manner making it look like I have close to 1000, 404 errors when I am not able to replicate this crawl using many different methods.

      The site is using a htacess file with redirects and a rewrite condition.

      Rewrite Condition:

      Need to redirect when no trailing slash

      RewriteCond %{REQUEST_FILENAME} !-f
      RewriteCond %{REQUEST_FILENAME} !.(html|shtml)$
      RewriteCond %{REQUEST_URI} !(.)/$
      RewriteRule ^(.
      )$ /$1/ [L,R=301]

      The above condition forces the trailing slash on folders.

      Then we are using redirects in this manner:

      Redirect 301 /article.html http://www.domain.com/article/

      In addition to the above we had a development site whilst I was building the new site which was http://dev.slimandsave.co.uk now this had been spidered without my knowledge until it was too late. So when I put the site live I left the development domain in place (http://dev.domain.com) and redirected it like so:

      <ifmodule mod_rewrite.c="">RewriteEngine on
        RewriteRule ^ - [E=protossl]
        RewriteCond %{HTTPS} on
        RewriteRule ^ - [E=protossl:s]

      RewriteRule ^ http%{ENV:protossl}://www.domain.com%{REQUEST_URI} [L,R=301]</ifmodule>

      Is there anything that I have done that would cause this type of redirect 'loop' ?

      Any help greatly appreciated.\

      1 Reply Last reply Reply Quote 0
      • Whebb
        Whebb last edited by

        Doesn't sound like GWT is false reporting. May want to check your trailing slash URL rewrite. It seems like there is an issue there as what you are describing sounds like the URLs are being written incorrectly and causing the incorrect URLs to be generated and show up in GWT.

        Your 301 looks ok and if the dev site was spidered and indexed, you should just add the site to GWT and then use the URL removal tool to remove the site from the index, then remove the site and redirect.

        CommT 1 Reply Last reply Reply Quote 1
        • baldnut
          baldnut last edited by

          Sorry I also should add that the url structure that google generates is like this:

          http://www.domain.com/category/article/

          http://www.domain.com/category/article/same-category/differentarticle/

          http://www.domain.com/category/article/same-category/another-different-article/

          http://www.domain.com/category/article/another-different-category/differentarticle/

          etc, it is like it gets to a category article and then moves sideways and somehow adds the move onto the current url without keeping hold of the suffix of the URL

          1 Reply Last reply Reply Quote 0
          • baldnut
            baldnut last edited by

            Anyone any thoughts on this?

            1 Reply Last reply Reply Quote 0
            • CommT
              CommT @Whebb last edited by

              Yeah - do this!

              1 Reply Last reply Reply Quote 0
              • 1 / 1
              • First post
                Last post
              • Googlebot crawl error Javascript method is not defined
                BlueprintMarketing
                BlueprintMarketing
                0
                4
                33

              • Strange Crawl Report
                Neon_Rain
                Neon_Rain
                0
                3
                105

              • Ignore these external links reported in GWT?
                EEE3
                EEE3
                0
                3
                73

              • Can Googlebot crawl the content on this page?
                danatanseo
                danatanseo
                0
                7
                156

              • GWT crawl errors: How big a ranking issue?
                Keszi
                Keszi
                0
                2
                117

              • Crawl Diagnostics Report 500 erorr
                Cyrus-Shepard
                Cyrus-Shepard
                0
                9
                568

              • False Negative Warnings with Crawl Diagnostic Test
                wishack
                wishack
                1
                16
                2.2k

              • Crawl report showing only 1 crawled page
                Mikpam
                Mikpam
                0
                4
                913

              Get started with Moz Pro!

              Unlock the power of advanced SEO tools and data-driven insights.

              Start my free trial
              Products
              • Moz Pro
              • Moz Local
              • Moz API
              • Moz Data
              • STAT
              • Product Updates
              Moz Solutions
              • SMB Solutions
              • Agency Solutions
              • Enterprise Solutions
              • Digital Marketers
              Free SEO Tools
              • Domain Authority Checker
              • Link Explorer
              • Keyword Explorer
              • Competitive Research
              • Brand Authority Checker
              • Local Citation Checker
              • MozBar Extension
              • MozCast
              Resources
              • Blog
              • SEO Learning Center
              • Help Hub
              • Beginner's Guide to SEO
              • How-to Guides
              • Moz Academy
              • API Docs
              About Moz
              • About
              • Team
              • Careers
              • Contact
              Why Moz
              • Case Studies
              • Testimonials
              Get Involved
              • Become an Affiliate
              • MozCon
              • Webinars
              • Practical Marketer Series
              • MozPod
              Connect with us

              Contact the Help team

              Join our newsletter
              Moz logo
              © 2021 - 2026 SEOMoz, Inc., a Ziff Davis company. All rights reserved. Moz is a registered trademark of SEOMoz, Inc.
              • Accessibility
              • Terms of Use
              • Privacy