The Moz Q&A Forum

    • Forum
    • Questions
    • My Q&A
    • Users
    • Ask the Community

    Welcome to the Q&A Forum

    Browse the forum for helpful insights and fresh discussions about all things SEO.

    1. SEO and Digital Marketing Q&A Forum
    2. Categories
    3. Link Explorer
    4. Duplicated content detected with MOZ crawl with canonical applied

    Duplicated content detected with MOZ crawl with canonical applied

    Link Explorer
    13 3 461
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as question
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • Eurasmus.com
      Eurasmus.com last edited by

      Hi there!

      I have a slight problem.
      I have a site with Joomla 3.3 that we recently migrated from 2.5.

      Joomla, for some reason that I don´t really get, creates hundreds of weird urls for the site like
      mydomain.com/en -> joomla creates en/home/149-xxx-xxx/xxxxxx-xxxxxx that links to the first one.
      The new version 3.3 knows this bug and applies a rel=canonical to the ones created "artificially", so they should not be identified as duplicated.

      Sample piece of code:  en/home/149-all-en/xxxxxxx-xxxxxx" rel="canonical" /

      MOZ crawler identifies this as duplicated and like this I have thousands of pages duplicated all with titles, content etc... all the ones created by joomla. Still my site has good SEO results and I can not see any penalties but I am a bit concerned they may come in the future....

      Can anyone explain me what is happening?

      Thank you in advance for your time,

      1 Reply Last reply Reply Quote 0
      • Hutch42
        Hutch42 last edited by

        Do you have a link to one of these pages so we can look at how it is deploying the canonical onto the page.

        1 Reply Last reply Reply Quote 0
        • Eurasmus.com
          Eurasmus.com last edited by

          Hey,

          Yes, sure.

          This is the duplicated from the /en

          http://www.spain-internship.com/en/home/149-all-en/placement-spain

          Thanks!

          Hutch42 1 Reply Last reply Reply Quote 0
          • Hutch42
            Hutch42 @Eurasmus.com last edited by

            Do you have an example of one of these generated pages as well, everything looks fine on the main page.

            Eurasmus.com 1 Reply Last reply Reply Quote 0
            • Eurasmus.com
              Eurasmus.com @Hutch42 last edited by

              I will provide few so you can look!

              Detected as duplicated:
              http://www.spain-internship.com/en
              http://www.spain-internship.com/en/home/149-all-en/placement-spain

              Same here:
              http://www.spain-internship.com/fi
              http://www.spain-internship.com/fi/etusivu/186-all-fi/home-page-fi

              http://www.spain-internship.com/en/internships-in-salamanca
              http://www.spain-internship.com/es/internships-in-salamanca

              First one is the original. The rest one have canonical. Still detected as duplicated.

              Hutch42 1 Reply Last reply Reply Quote 0
              • Hutch42
                Hutch42 @Eurasmus.com last edited by

                Ok, the problem is your pages are all canonical to themselves, the canonical tag should point at the main page for the content, not to every page.  For your first example, all pages that get their content from http://www.spain-internship.com/en need to have canonical tags to that page, instead the copy page has this:

                href="http://www.spain-internship.com/fi/etusivu/186-all-fi/home-page-fi" rel="canonical" />

                it should have

                href="http://www.spain-internship.com/fi/" rel="canonical" />

                1 Reply Last reply Reply Quote 2
                • Eurasmus.com
                  Eurasmus.com last edited by

                  Hi there,

                  Thanks for the update. Now that you told me the problem I found out this is a known bug for joomla and I am working on it.
                  I found a plugin http://styleware.eu/store/item/26-styleware-content-canonical-plugin that sends all the duplicated urls, generated automatically with a canonical to the home.

                  Sample:
                  http://www.spain-internship.com/en/home/149-all-en/placement-spain
                  Now with the link http://www.spain-internship.com" rel="canonical" />.

                  This solves the problem of the core canonical bug. 
                  Would this be a proper solution?

                  Now I only have to change all the ones duplicated due to languages config, block then in robots or canonical but as far as I control it, it is ok.

                  Please, let me know if this would be a proper solution.

                  Thank you in advance for your help, if I can help you in some moment with something here we are!

                  KaneJamison 2 Replies Last reply Reply Quote 0
                  • KaneJamison
                    KaneJamison @Eurasmus.com last edited by

                    Hey Jose,

                    If you have an /es/ subfolder then ideally you would be translating that content to Spanish, not canonicalizing that content back to the English version.

                    I can see from http://www.spain-internship.com/es/internships-in-salamanca that not all /es/ pages are translated - is this true across the entire website?

                    If you don't have any Spanish content, then you should just kill off the /es/ version entirely.

                    1 Reply Last reply Reply Quote 0
                    • KaneJamison
                      KaneJamison @Eurasmus.com last edited by

                      Also, if you decide to keep the /es/ section of the website then you'll need to look into hreflang instead of canonical tags, because /es/ and /en/ will not be duplicate content once they're translated.

                      Read this Q&A from Google for details - https://sites.google.com/site/webmasterhelpforum/en/faq-internationalisation#q20

                      1 Reply Last reply Reply Quote 0
                      • Eurasmus.com
                        Eurasmus.com last edited by

                        Thank you Kane.

                        I would like to keep the content in all the languages, ,as I think it is useful for customers to enter easily certain areas.
                        The problem that I am always having is the implementation...

                        There are not real good canonical plugins (that would allow me to do a bulk import), and I am not that advanced as for doing an htaccess redirect with 301... still, I would like that if someone from NL or FI version would like to find the area barcelona could see it....

                        Anything on mind!? Just to say, I tried SH404, does all the work but rewrites the whole url structure (not possible), I tried canonical http://www.cmsplugin.com/products/components/4-canonical-url which solves the duplication by languages but not the random urls created by 3.3...
                        Then I decided to leave the plugin I mentioned before, it deletes all the duplicated urls generated automatically but does not solve the language problem...

                        So, here I am 🙂

                        Any suggestion?

                        KaneJamison 1 Reply Last reply Reply Quote 0
                        • KaneJamison
                          KaneJamison @Eurasmus.com last edited by

                          I personally would not generate new language sections unless the content has been translated and localized on those pages. Right now your Spanish homepage has English content in the body, so I would view this as incomplete. Ideally you'd translate the entire page for those sections.

                          When you do that, you'll want to use hreflang, not canonicals, to indicate different versions of the same content.

                          So, my recommendation is (A) get rid of the Spanish content sections which would solve the duplication problem, or (B) finish translating the content and then install hreflang code, which would also solve the duplication problem.

                          Unfortunately I don't know of a good hreflang tool for Joomla specifically.

                          Let me know if that makes sense?

                          1 Reply Last reply Reply Quote 0
                          • Eurasmus.com
                            Eurasmus.com last edited by

                            Thank you for your time!

                            We are going to setup 301 redirects (one colleague suggested importing those directly in the DB of redirects) from those duplicated pages until joomla has a native solution and we have the time to make all unique content, to avoid penalties.

                            At least, we would solve temporaly the problem, it will take 2 weeks to make all the unique content.

                            Would that make sense?

                            Have a nice weekend!

                            KaneJamison 1 Reply Last reply Reply Quote 0
                            • KaneJamison
                              KaneJamison @Eurasmus.com last edited by

                              If it's a period of 2 weeks and you're going to do it anyways, I would just make the new content and not go to the expense of setting up redirects and then taking them down, which can cause issues when you plan on recreating a URL.

                              1 Reply Last reply Reply Quote 0
                              • 1 / 1
                              • First post
                                Last post
                              • Why my website Backlinks not getting Crawl by Moz?
                                eli.myers
                                eli.myers
                                0
                                2
                                29

                              • Why moz is not crawling my Backlinks & Keyword ?
                                Williamh20
                                Williamh20
                                1
                                5
                                85

                              • Moz can't crawl our site
                                Natalie-Alexis
                                Natalie-Alexis
                                0
                                2
                                22

                              • Does the Moz Pro site crawl, crawl password protected sites?
                                tawnycase
                                tawnycase
                                0
                                4
                                114

                              • How Do You Deal With Duplicate Content On A Retail Site
                                nick_HandCo
                                nick_HandCo
                                1
                                7
                                143

                              • Why is moz telling me I have duplicate content, but neither the content nor the urls are duplicates?
                                ChiarynMiranda
                                ChiarynMiranda
                                0
                                4
                                276

                              • Moz Crawl Canonicals and Duplicates
                                Eurasmus.com
                                Eurasmus.com
                                0
                                5
                                150

                              • How Is a Page Crawled by Moz When Moz Says 'No Links'?
                                xerox432
                                xerox432
                                0
                                9
                                153

                              Get started with Moz Pro!

                              Unlock the power of advanced SEO tools and data-driven insights.

                              Start my free trial
                              Products
                              • Moz Pro
                              • Moz Local
                              • Moz API
                              • Moz Data
                              • STAT
                              • Product Updates
                              Moz Solutions
                              • SMB Solutions
                              • Agency Solutions
                              • Enterprise Solutions
                              • Digital Marketers
                              Free SEO Tools
                              • Domain Authority Checker
                              • Link Explorer
                              • Keyword Explorer
                              • Competitive Research
                              • Brand Authority Checker
                              • Local Citation Checker
                              • MozBar Extension
                              • MozCast
                              Resources
                              • Blog
                              • SEO Learning Center
                              • Help Hub
                              • Beginner's Guide to SEO
                              • How-to Guides
                              • Moz Academy
                              • API Docs
                              About Moz
                              • About
                              • Team
                              • Careers
                              • Contact
                              Why Moz
                              • Case Studies
                              • Testimonials
                              Get Involved
                              • Become an Affiliate
                              • MozCon
                              • Webinars
                              • Practical Marketer Series
                              • MozPod
                              Connect with us

                              Contact the Help team

                              Join our newsletter
                              Moz logo
                              © 2021 - 2026 SEOMoz, Inc., a Ziff Davis company. All rights reserved. Moz is a registered trademark of SEOMoz, Inc.
                              • Accessibility
                              • Terms of Use
                              • Privacy