The Moz Q&A Forum

    • Forum
    • Questions
    • My Q&A
    • Users
    • Ask the Community

    Welcome to the Q&A Forum

    Browse the forum for helpful insights and fresh discussions about all things SEO.

    1. SEO and Digital Marketing Q&A Forum
    2. Categories
    3. Intermediate & Advanced SEO
    4. Googlebot crawling partial URLs

    Googlebot crawling partial URLs

    Intermediate & Advanced SEO
    4 3 1.2k
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as question
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • panini
      panini last edited by

      Hi guys,

      I've checked my email this morning and I've got a number of 404 errors over the weekend where Google has tried to crawl some of my existing pages but not found the full URL.

      Instead of hitting 'domain.com/folder/complete-pagename.php' it's hit 'domain.com/folder/comp'.

      This is definitely Googlebot/2.1; http://www.google.com/bot.html (66.249.72.53) but I can't find where it would have found only the partial URL. It certainly wasn't on the domain it's crawling and I can't find any links from external sites pointing to us with the incorrect URL. GoogleBot is doing the same thing across a single domain but in different sub-folders.

      Having checked Webmaster Tools there aren't any hard 404s and the soft ones aren't related and haven't occured since August. I'm really confused as to how this is happening..

      Thanks!

      1 Reply Last reply Reply Quote 0
      • irvingw
        irvingw last edited by

        I'm seeing it too - It looks like it's coming from Superpages but the truncated URLs are not actually hyperlinks, so why is Google following them is a good question.

        http://swbd-out.superpages.com/webresults.htm?qkw=Find+A+Physician&qcat=web

        I'm fixing this on my end with a modrewrite in HTACCESS, all of my sites truncated URL problems either end in ".." or "..." so any URL that ends in those two instances will get 301 redirected to the homepage.

        panini 1 Reply Last reply Reply Quote 0
        • panini
          panini @irvingw last edited by

          @vitalscom - it's at least good to know someone else has experienced this!

          Due to the volume I don't consider doing 301s a permanent solution. Fortunately there is a noindex on our 404 page so Google et al shouldn't take these errors into consideration.

          1 Reply Last reply Reply Quote 0
          • Improvements
            Improvements last edited by

            This is why I love this forum. We recently started seeing these urls in our GWT report. We have hundreds of truncated urls that end in "..." that go nowhere. We can't figure out where these are coming from. We thought it could be G's relatively new privacy policy w/ not passing along the data, but we're not sure. Anyone have any thoughts on that?

            Thanks!

            1 Reply Last reply Reply Quote 0
            • 1 / 1
            • First post
              Last post
            • Suggested Screaming Frog configuration to mirror default Googlebot crawl?
              0
              1
              68

            • 301 vs Canonical - With A Side of Partial URL Rewrite and Google URL Parameters-OH MY
              TStorm
              TStorm
              1
              3
              53

            • When I crawl my website I have urls with (#!162738372878) at the end of my urls
              Matt16
              Matt16
              0
              3
              44

            • Duplicate page url crawl report
              MikeRoberts
              MikeRoberts
              0
              5
              78

            • URL Parameter & crawl stats
              katemorris
              katemorris
              0
              4
              232

            • Lots of incorrect urls indexed - Googlebot found an extremely high number of URLs on your site
              SarahCollins
              SarahCollins
              0
              8
              1.1k

            • Googlebot found an extremely high number of URLs on your site
              Myntra
              Myntra
              0
              5
              2.6k

            • Why do old URL format are still being crawled by Rogerbot?
              Trigun
              Trigun
              0
              9
              858

            Get started with Moz Pro!

            Unlock the power of advanced SEO tools and data-driven insights.

            Start my free trial
            Products
            • Moz Pro
            • Moz Local
            • Moz API
            • Moz Data
            • STAT
            • Product Updates
            Moz Solutions
            • SMB Solutions
            • Agency Solutions
            • Enterprise Solutions
            • Digital Marketers
            Free SEO Tools
            • Domain Authority Checker
            • Link Explorer
            • Keyword Explorer
            • Competitive Research
            • Brand Authority Checker
            • Local Citation Checker
            • MozBar Extension
            • MozCast
            Resources
            • Blog
            • SEO Learning Center
            • Help Hub
            • Beginner's Guide to SEO
            • How-to Guides
            • Moz Academy
            • API Docs
            About Moz
            • About
            • Team
            • Careers
            • Contact
            Why Moz
            • Case Studies
            • Testimonials
            Get Involved
            • Become an Affiliate
            • MozCon
            • Webinars
            • Practical Marketer Series
            • MozPod
            Connect with us

            Contact the Help team

            Join our newsletter
            Moz logo
            © 2021 - 2026 SEOMoz, Inc., a Ziff Davis company. All rights reserved. Moz is a registered trademark of SEOMoz, Inc.
            • Accessibility
            • Terms of Use
            • Privacy