The Moz Q&A Forum

    • Forum
    • Questions
    • My Q&A
    • Users
    • Ask the Community

    Welcome to the Q&A Forum

    Browse the forum for helpful insights and fresh discussions about all things SEO.

    1. SEO and Digital Marketing Q&A Forum
    2. Categories
    3. Technical SEO Issues
    4. Google tries to index non existing language URLs. Why?

    Google tries to index non existing language URLs. Why?

    Technical SEO Issues
    2 2 34
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as question
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • TheHecksler
      TheHecksler last edited by

      Hi,

      I am working for a SAAS client. He uses two different language versions by using two different subdomains.
      de.domain.com/company for german and en.domain.com for english. Many thousands URLs has been indexed correctly.

      But Google Search Console tries to index URLs which were never existing before and are still not existing.

      de.domain.com**/en/company
      en.domain.com
      /de/**company

      ... and an thousand more using the /en/ or /de/ in between. We never use this variant and calling these URLs will throw up a 404 Page correctly (but with wrong respond code  -  we`re fixing that 😉 ). But Google tries to index these kind of URLs again and again. And, I couldnt find any source of these URLs. No Website is using this as an out going link, etc.
      We do see in our logfiles, that a Screaming Frog Installation and moz.com w opensiteexplorer were trying to access this earlier.

      My Question: How does Google comes up with that? From where did they get these URLs, that (to our knowledge) never existed?

      Any ideas? Thanks 🙂

      1 Reply Last reply Reply Quote 0
      • NickSamuel
        NickSamuel last edited by

        Hi Hecksler,

        Did you ever resolve this?

        Quick idea from me is to double check ALL version of your website within Google Search Console. You can now register the entire domain property using DNS: https://searchengineland.com/how-to-set-up-google-search-console-domain-verification-for-site-wide-reporting-data-313256

        I found that Google was trying to crawl a very old HTTP sitemap from about five years ago for one of my sites, and thus I was able to delete it.

        There's some mixed comments/feeling within the Search Community about whether or not GoogleBot really "guesses" URLs, so it's probably more than likely they are getting the links from somewhere....https://stackoverflow.com/questions/20855082/googlebot-guesses-urls-how-to-avoid-handle-this-crawling

        Look forward to hearing from you,

        Nick

        1 Reply Last reply Reply Quote 0
        • 1 / 1
        • First post
          Last post
        • Google is indexing bad URLS
          effectdigital
          effectdigital
          0
          8
          255

        • Google Indexing Pages with Made Up URL
          evolvingSEO
          evolvingSEO
          0
          6
          120

        • Vanity URLs are being indexed in Google
          seogirl22
          seogirl22
          1
          3
          879

        • 404 errors on non-existent URLs
          AJ234
          AJ234
          0
          3
          609

        • Wordpress URL weirdness - why is google registering non-pretty URLS?
          peterdbaron
          peterdbaron
          0
          4
          585

        • Non existant URLs being generated in index
          WillBlackburn
          WillBlackburn
          0
          6
          544

        • Canonical for non-exist URL ?
          Theo-NL
          Theo-NL
          0
          2
          738

        • Why google index my IP URL
          DarwinChinaSEO
          DarwinChinaSEO
          0
          6
          1.4k

        Get started with Moz Pro!

        Unlock the power of advanced SEO tools and data-driven insights.

        Start my free trial
        Products
        • Moz Pro
        • Moz Local
        • Moz API
        • Moz Data
        • STAT
        • Product Updates
        Moz Solutions
        • SMB Solutions
        • Agency Solutions
        • Enterprise Solutions
        • Digital Marketers
        Free SEO Tools
        • Domain Authority Checker
        • Link Explorer
        • Keyword Explorer
        • Competitive Research
        • Brand Authority Checker
        • Local Citation Checker
        • MozBar Extension
        • MozCast
        Resources
        • Blog
        • SEO Learning Center
        • Help Hub
        • Beginner's Guide to SEO
        • How-to Guides
        • Moz Academy
        • API Docs
        About Moz
        • About
        • Team
        • Careers
        • Contact
        Why Moz
        • Case Studies
        • Testimonials
        Get Involved
        • Become an Affiliate
        • MozCon
        • Webinars
        • Practical Marketer Series
        • MozPod
        Connect with us

        Contact the Help team

        Join our newsletter
        Moz logo
        © 2021 - 2026 SEOMoz, Inc., a Ziff Davis company. All rights reserved. Moz is a registered trademark of SEOMoz, Inc.
        • Accessibility
        • Terms of Use
        • Privacy