The Moz Q&A Forum

    • Forum
    • Questions
    • My Q&A
    • Users
    • Ask the Community

    Welcome to the Q&A Forum

    Browse the forum for helpful insights and fresh discussions about all things SEO.

    1. SEO and Digital Marketing Q&A Forum
    2. Categories
    3. Intermediate & Advanced SEO
    4. Removing duplicate content

    Removing duplicate content

    Intermediate & Advanced SEO
    6 5 172
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as question
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • AMHC
      AMHC last edited by

      Due to URL changes and parameters on our ecommerce sites, we have a massive amount of duplicate pages indexed by google, sometimes up to 5 duplicate pages with different URLs.

      1. We've instituted canonical tags site wide.

      2. We are using the parameters function in Webmaster Tools.

      3. We are using 301 redirects on all of the obsolete URLs

      4. I have had many of the pages fetched so that Google can see and index the 301s and canonicals.

      5. I created HTML sitemaps with the duplicate URLs, and had Google fetch and index the sitemap so that the dupes would get crawled and deindexed.

      None of these seems to be terribly effective. Google is indexing pages with parameters in spite of the parameter (clicksource) being called out in GWT. Pages with obsolete URLs are indexed in spite of them having 301 redirects. Google also appears to be ignoring many of our canonical tags as well, despite the pages being identical.

      Any ideas on how to clean up the mess?

      1 Reply Last reply Reply Quote 0
      • PatrickDelehanty
        PatrickDelehanty last edited by

        Hi there

        Are you sure you are using all of the tools above properly? Not saying you're not but people make mistakes and it's just something to look into.

        When did you implement all of the changes? Was it recently or was it a long time ago?

        How is your organic traffic and rankings? Did you check if you have a manual action at all?

        Let me know - thanks!

        Linda-Vassily 1 Reply Last reply Reply Quote 1
        • WebElaine
          WebElaine last edited by

          It sounds like part of the problem may be the sitemaps you're sending. By including duplicates in a sitemap, you're basically telling Google that each version of the page is valid. I would remove them and resubmit a sitemap with only the canonical versions you want indexed and see if that helps.

          1 Reply Last reply Reply Quote 2
          • Linda-Vassily
            Linda-Vassily @PatrickDelehanty last edited by

            Yes, as Patrick said, surprisingly often something like this is a result of a simple oversight because we have been looking at the same code over and over...

            Do you have access to Screaming Frog? You could crawl your site and see whether redirects/canonicals are behaving as you expected.

            Have you taken a look at the html of one of the incorrectly indexed pages when it is loaded in your browser? Can you see the canonical? If you try going to a redirected page, does it redirect? [I know--way to obvious, but sometimes it is good to start at the beginning again when we can't root out an issue.]

            Another culprit in these cases can be internal links. Do you link internally using any of the undesirable URLs? That can send a message to Google that those URLs are still in play. Again, you can use Screaming Frog to find those strings.

            1 Reply Last reply Reply Quote 0
            • RuthBurrReedy
              RuthBurrReedy last edited by

              Are you basing this on a site: search? It's fairly common for URLs to appear in a site: search that otherwise will not appear for any actual searches. Are the undesirable versions of the URLs getting any search traffic?

              AMHC 1 Reply Last reply Reply Quote 0
              • AMHC
                AMHC @RuthBurrReedy last edited by

                Where this is  appearing the most is on cross domain canonicals. We have duplicate content across 2 websites, and we've canonicaled some pages from Site A to Site B, and some from Site B to Site A. In theory, pages that were canonicaled to the other domain should be deindexed. When I run a rankings report, I see pages for the wrong domain ranking, a month later. They are pages with parameters, or old URLs that we've changed. It's like a game of whack a mole. Every time we get a page deindexed, a duplicate with a different parameter takes its place. And this is in spite of calling out these parameters in GWT.

                What I imagine is happening is that we have several URLs for the same page indexed. When Google crawls our site, it is correctly canonicaling the page it crawls. In the rankings, however, Google is probably pulling a duplicate page out of its index, and ranking it without crawling it. If it was crawling it, Google would see the canonical tag, and not rank it. So we have an ongoing battle to get Google to crawl the page it just pulled out of its index to see the the canonical tag.

                The reason for all this is that when a page cross domain canonicals correctly, the rankings for the duplicate page on the other site goes up dramatically. As long as Google keeps ranking the wrong pages, we don't get the rankings bump on the other site.

                1 Reply Last reply Reply Quote 0
                • 1 / 1
                • First post
                  Last post
                • Duplicate Content That Isn't Duplicated
                  LoganRay
                  LoganRay
                  0
                  5
                  140

                • Duplicate Multi-site Content, Duplicate URLs
                  MonicaOConnor
                  MonicaOConnor
                  0
                  2
                  129

                • Duplicate content - how to diagnose duplicate content from another domain before publishing pages?
                  Chemometec
                  Chemometec
                  0
                  7
                  141

                • Duplicate content within sections of a page but not full page duplicate content
                  J_Sinclair
                  J_Sinclair
                  0
                  3
                  112

                • All Thin Content removed and duplicate content replaced. But still no success?
                  apogeecorp
                  apogeecorp
                  0
                  7
                  284

                • Copying my Facebook content to website considered duplicate content?
                  SEM-Freak
                  SEM-Freak
                  0
                  9
                  3.1k

                • How to Remove Joomla Canonical and Duplicate Page Content
                  Cyrus-Shepard
                  Cyrus-Shepard
                  0
                  7
                  2.8k

                • Which duplicate content should I remove?
                  Matthew_Edgar
                  Matthew_Edgar
                  0
                  2
                  312

                Get started with Moz Pro!

                Unlock the power of advanced SEO tools and data-driven insights.

                Start my free trial
                Products
                • Moz Pro
                • Moz Local
                • Moz API
                • Moz Data
                • STAT
                • Product Updates
                Moz Solutions
                • SMB Solutions
                • Agency Solutions
                • Enterprise Solutions
                • Digital Marketers
                Free SEO Tools
                • Domain Authority Checker
                • Link Explorer
                • Keyword Explorer
                • Competitive Research
                • Brand Authority Checker
                • Local Citation Checker
                • MozBar Extension
                • MozCast
                Resources
                • Blog
                • SEO Learning Center
                • Help Hub
                • Beginner's Guide to SEO
                • How-to Guides
                • Moz Academy
                • API Docs
                About Moz
                • About
                • Team
                • Careers
                • Contact
                Why Moz
                • Case Studies
                • Testimonials
                Get Involved
                • Become an Affiliate
                • MozCon
                • Webinars
                • Practical Marketer Series
                • MozPod
                Connect with us

                Contact the Help team

                Join our newsletter
                Moz logo
                © 2021 - 2026 SEOMoz, Inc., a Ziff Davis company. All rights reserved. Moz is a registered trademark of SEOMoz, Inc.
                • Accessibility
                • Terms of Use
                • Privacy