The Moz Q&A Forum

    • Forum
    • Questions
    • My Q&A
    • Users
    • Ask the Community

    Welcome to the Q&A Forum

    Browse the forum for helpful insights and fresh discussions about all things SEO.

    1. SEO and Digital Marketing Q&A Forum
    2. Categories
    3. International Issues
    4. International Sites and Duplicate Content

    International Sites and Duplicate Content

    International Issues
    4 2 320
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as question
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • guidoampollini
      guidoampollini last edited by

      Hello,

      I am working on a project where have some doubts regarding the structure of international sites and multi languages.Website is in the fashion industry. I think is a common problem for this industry. Website is translated in 5 languages and sell in 21 countries.

      As you can imagine this create a huge number of urls, so much that with ScreamingFrog I cant even complete the crawling.

      Perhaps the UK site is visible in all those versions

      http://www.MyDomain.com/en/GB/

      http://www.MyDomain.com/it/GB/

      http://www.MyDomain.com/fr/GB/

      http://www.MyDomain.com/de/GB/

      http://www.MyDomain.com/es/GB/

      Obviously for SEO only the first version is important

      One other example, the French site is available in 5 languages and again...

      http://www.MyDomain.com/fr/FR/

      http://www.MyDomain.com/en/FR/

      http://www.MyDomain.com/it/FR/

      http://www.MyDomain.com/de/FR/

      http://www.MyDomain.com/es/FR/

      And so on...this is creating 3 issues mainly:

      1. Endless crawling - with crawlers not focusing on most important pages

      2. Duplication of content

      3. Wrong GEO urls ranking in Google

      I have already implemented href lang but didn't noticed any improvements. Therefore my question is

      Should I exclude with "robots.txt" and "no index" the non appropriate targeting?

      Perhaps for UK leave crawable just English version i.e. http://www.MyDomain.com/en/GB/, for France just the French version http://www.MyDomain.com/fr/FR/ and so on

      What I would like to get doing this is to have the crawlers more focused on the important SEO pages, avoid content duplication and wrong urls rankings on local Google

      Please comment

      1 Reply Last reply Reply Quote 0
      • antonioaraya
        antonioaraya last edited by

        Don't know why you have a UK oriented site for German and Italian people, I think is not important those languages in a country mainly English speaking (not US for example, there you must have a Spanish version, or in Canada for English and French). The owner must have their reasons.

        Besides this, about your questions:

        • If those non-relevant languages must live there, it's correct to implement HREF LANG (may take some time to show results). Also, if the domain is gTLD, you can validate all the subfolders in Google Search Console and choose the proper International targeting. With the ammount of languages and countries I imagine this might be a pain in the ***.
        • About the crawling, for large sitesI recommend to crawl per language. If neccesary, per language-country. In this instance I recommend to create a sitemap XML per language or language-country for just HTML pages (hopefully dynamically updated by the e-commerce), create a Sitemap Index in the root of the domain and submit them in Google Search Console (better if you validated the languages or language-country). With this you can answer the question if some language or country are being not indexed with the Submited/Indexed stadistics of GSC.
        • Maybe the robots.txt might save your crawl budget, but I'm not a fan of de-index if those folders are truly not relevant (after all, there should be a italian living in UK. If you can't delete the irrelevant langauges for some countries, this can be an option
        1 Reply Last reply Reply Quote 4
        • guidoampollini
          guidoampollini last edited by

          Thank you Antonio, insightful and clear.

          There is really not a need of EN versions of localized sites, I think has been done more as was easier to implement (original site is EN-US).

          Don't you think robots and noindex EN version of localized sites could be the best solution? for sure is the easier one to implement without affecting UX.

          antonioaraya 1 Reply Last reply Reply Quote 0
          • antonioaraya
            antonioaraya @guidoampollini last edited by

            Hey Guido, don't know if it's the best solution, but could be a temporary fix until the best solution is in place. I suggest to move forward with proper HREF LANG tagging or definitely delete those irrelevant languages. Try to do what I said before about validate each country/language and submit a sitemap.xml reflecting that folder to see crawl and index stats pero country/language. Add a sitemap index and obviously validate your entire domain. Just block in the robots.txt unnecessary folders, like images, js libraries, etc. to save crawl budget to your domain.

            Let me know if you have another doubt 🙂

            1 Reply Last reply Reply Quote 0
            • 1 / 1
            • First post
              Last post
            • International SEO & Duplicate Content: ccTLD, hreflang, and relcanonical tags
              NickJasuja
              NickJasuja
              0
              4
              1.3k

            • Duplicate product description ranking problems (off-site duplicate content)
              Dr-Pete
              Dr-Pete
              0
              4
              1.0k

            • International hreflang - will this handle duplicate content?
              katemorris
              katemorris
              1
              10
              8.9k

            • E-Commerce site in 2 languages - Duplicate content or not?
              Highland
              Highland
              0
              9
              355

            • Ranking well internationally, usage of hreflang, duplicate country content
              simon_realbuzz
              simon_realbuzz
              0
              4
              506

            • Impact of Japanese .jp site duplicate content?
              SanketPatel
              SanketPatel
              0
              2
              342

            • I have on site translated into several languages on different TLDs, .com, .de, .co.uk, .no, etc. Is this duplicate content?
              hurtigruten
              hurtigruten
              0
              3
              293

            • Internationally targetted subdomains and Duplicate content
              alexhoug
              alexhoug
              0
              2
              1.1k

            Get started with Moz Pro!

            Unlock the power of advanced SEO tools and data-driven insights.

            Start my free trial
            Products
            • Moz Pro
            • Moz Local
            • Moz API
            • Moz Data
            • STAT
            • Product Updates
            Moz Solutions
            • SMB Solutions
            • Agency Solutions
            • Enterprise Solutions
            • Digital Marketers
            Free SEO Tools
            • Domain Authority Checker
            • Link Explorer
            • Keyword Explorer
            • Competitive Research
            • Brand Authority Checker
            • Local Citation Checker
            • MozBar Extension
            • MozCast
            Resources
            • Blog
            • SEO Learning Center
            • Help Hub
            • Beginner's Guide to SEO
            • How-to Guides
            • Moz Academy
            • API Docs
            About Moz
            • About
            • Team
            • Careers
            • Contact
            Why Moz
            • Case Studies
            • Testimonials
            Get Involved
            • Become an Affiliate
            • MozCon
            • Webinars
            • Practical Marketer Series
            • MozPod
            Connect with us

            Contact the Help team

            Join our newsletter
            Moz logo
            © 2021 - 2026 SEOMoz, Inc., a Ziff Davis company. All rights reserved. Moz is a registered trademark of SEOMoz, Inc.
            • Accessibility
            • Terms of Use
            • Privacy