The Moz Q&A Forum

    • Forum
    • Questions
    • My Q&A
    • Users
    • Ask the Community

    Welcome to the Q&A Forum

    Browse the forum for helpful insights and fresh discussions about all things SEO.

    1. SEO and Digital Marketing Q&A Forum
    2. Categories
    3. Moz Tools
    4. Roger keeps telling me my canonical pages are duplicates

    Roger keeps telling me my canonical pages are duplicates

    Moz Tools
    7 3 351
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as question
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • icecarats
      icecarats last edited by

      I've got a site that's brand spanking new that I'm trying to get the error count down to zero on, and I'm basically there except for this odd problem. Roger got into the site like a naughty puppy a bit too early, before I'd put the canonical tags in, so there were a couple thousand 'duplicate content' errors. I put canonicals in (programmatically, so they appear on every page) and waited a week and sure enough 99% of them went away.

      However, there's about 50 that are still lingering, and I'm not sure why they're being detected as such. It's an ecommerce site, and the duplicates are being detected on the product page, but why these 50? (there's hundreds of other products that aren't being detected). The URLs that are 'duplicates' look like this according to the crawl report:

      http://www.site.com/Product-1.aspx

      http://www.site.com/product-1.aspx

      And so on. Canonicals are in place, and have been for weeks, and as I said there's hundreds of other pages just like this not having this problem, so I'm finding it odd that these ones won't go away.

      All I can think of is that Roger is somehow caching stuff from previous crawls? According to the crawl report these duplicates were discovered '1 day ago' but that simply doesn't make sense. It's not a matter of messing up one or two pages on my part either; we made this site to be dynamically generated, and all of the SEO stuff (canonical, etc.) is applied to every single page regardless of what's on it.

      If anyone can give some insight I'd appreciate it!

      1 Reply Last reply Reply Quote 0
      • JoelDay
        JoelDay last edited by

        Hi Valery,

        I took a peek at your campaign and it looks like those few remaining duplicate pages are in fact different, but very minor differences. Basically there's pages for different sizes of things.

        While being different, they vary in such minute ways that Roger see's them as duplicates.

        I Hope that answers the question.

        Thanks,

        Joel.

        icecarats 1 Reply Last reply Reply Quote 0
        • icecarats
          icecarats @JoelDay last edited by

          Joel -

          Thanks a lot for looking into that. The pages are very similar, so I'm not surprised they're being duplicate triggered; but what does surprise me is that they are apparently being considered duplicate to a canonical version of themselves? When I click on the duplicate list I'm expecting to see:

          Product1.aspx

          Product1-Blue.aspx

          Product1-Red.aspx

          But instead I'm seeing:

          Product1.aspx

          product1.aspx

          product1.ASPX

          And so on. The first scenario to me implies that the 3 pages are duplicate to each other, whereas the second is saying that there's either a canonical problem or I literally have different-case versions of those files.

          1 Reply Last reply Reply Quote 0
          • JoelDay
            JoelDay last edited by

            Hey Valery,

            I see those on closer inspection. I know it looks weird, but that's accurate. Your server must be UNIX or Linux so they will actually treat case as a different word.

            For example: banana.com/pancakes.html would be treated differently than banana.com/PanCakes.html.

            So if you have any pages generated dynamically or otherwise that differ only in case, then they will be tagged as duplicate.

            In your CSV file you can see the duplicates being caused by case. I'd also be happy to help provide a few specific examples but would want to generate a ticket for you so we don't divulge any private information.

            Cheers,

            Joel.

            icecarats 1 Reply Last reply Reply Quote 1
            • icecarats
              icecarats @JoelDay last edited by

              I think I get your point; the canonical is pointing to where the juice should go, but the URLs are still functionally different things. I'm guessing some sort of URL rewrite is in order, and to standardize how I do in-text links on the site (with user-editable content this part could be a pain).

              1 Reply Last reply Reply Quote 0
              • ThompsonPaul
                ThompsonPaul last edited by

                Valery, you're definitely going to want to normalize your URLs to lowercase. This is a quirk of IIS that it actually respects case in URLs and will consider different case URLs as different pages.

                In addition to the search engine problems it creates, it's also a major problem for usabilty - yours and your users. For example, a user who is trying to type in a direct URL can get a 404 error depending on what case they use.

                More importantly, your Google Analytics will report on each of those version as separate pages, unless you write a normalizing filter into your GA profiles. Better to do that normalization for the actual site, not just your analytics

                While rel=canonical can resolve a number of issues, I've always found it vastly better to correct the actual problem at its root, rather than rely on canonicalization as a catch-all. Anecdotally, I've found correcting issues like this with rewrites seems to allow affected pages to rank better than when just corrected with canonicalization. WIsh I could find time to do an actual case-study on that 😞

                Managing rewrites on IIS servers will require a plugin like asapi-rewrite as IIS doesn't handle it natively.

                P.S. IIS will also allow and respect spaces in URLs. Users in Internet Explorer will see them as normal with spaces but browsers like Firefox will insert the html entity for a space (%20) into each necessary spot in the URL. This is again a mess for usability, so much better to force rewrite of all URLs to replace spaces with dashes when creating new pages. Many CMSs have plugins for this or you can also use sitewide rewrites to do it after the fact.

                1 Reply Last reply Reply Quote 1
                • icecarats
                  icecarats last edited by

                  ThompsonPaul -

                  Thanks for that info, it pretty much nails exactly what I had discovered independently. This is an IIS7/Win2k8R2 install so luckily the rewriting is a bit easier than in previous iterations. The whole platform is hand coded by us (after the 10th ecommerce site or so you can generally do them in your sleep) so I don't have to worry about CMS implementation and the like, and luckily we already knew that about the spaces so they simply aren't allowed in the filenames. I'm in the middle of making a regex right now that is going to down-case anything in an href="" or src="" tag that will hopefully handle everything on the site side user-created or not. Will consider what to do in regards to external links a bit down the road I think.

                  1 Reply Last reply Reply Quote 0
                  • 1 / 1
                  • First post
                    Last post
                  • Duplicate Content on a page that isn't duplicate?
                    Vanderlindemedia
                    Vanderlindemedia
                    0
                    6
                    52

                  • Duplicate Content: Marketing Page / Content Page
                    AllMedSeo
                    AllMedSeo
                    0
                    3
                    136

                  • Duplicate Page Content on pages that appear to be different?
                    MonicaOConnor
                    MonicaOConnor
                    0
                    5
                    111

                  • I have a canonical tag on several pages that MOZ still reports as duplicate content. What would cause this? Here is an example:
                    MoosaHemani
                    MoosaHemani
                    0
                    2
                    73

                  • Canonical URLs and Duplicate Page Content
                    Matt-Williamson
                    Matt-Williamson
                    0
                    2
                    302

                  • How Do I deal with duplicate page titles for pages on eCommerce site
                    wouldBseoKING
                    wouldBseoKING
                    0
                    3
                    415

                  • Duplicate page content showing up with proper use of canonical tag
                    Leighm
                    Leighm
                    0
                    7
                    665

                  • I have another Duplicate page content Question to ask.Why does my blog tags come up as duplicates when my page gets crawled,how do I fix it?
                    PCTechGuy2012
                    PCTechGuy2012
                    0
                    4
                    515

                  Get started with Moz Pro!

                  Unlock the power of advanced SEO tools and data-driven insights.

                  Start my free trial
                  Products
                  • Moz Pro
                  • Moz Local
                  • Moz API
                  • Moz Data
                  • STAT
                  • Product Updates
                  Moz Solutions
                  • SMB Solutions
                  • Agency Solutions
                  • Enterprise Solutions
                  • Digital Marketers
                  Free SEO Tools
                  • Domain Authority Checker
                  • Link Explorer
                  • Keyword Explorer
                  • Competitive Research
                  • Brand Authority Checker
                  • Local Citation Checker
                  • MozBar Extension
                  • MozCast
                  Resources
                  • Blog
                  • SEO Learning Center
                  • Help Hub
                  • Beginner's Guide to SEO
                  • How-to Guides
                  • Moz Academy
                  • API Docs
                  About Moz
                  • About
                  • Team
                  • Careers
                  • Contact
                  Why Moz
                  • Case Studies
                  • Testimonials
                  Get Involved
                  • Become an Affiliate
                  • MozCon
                  • Webinars
                  • Practical Marketer Series
                  • MozPod
                  Connect with us

                  Contact the Help team

                  Join our newsletter
                  Moz logo
                  © 2021 - 2026 SEOMoz, Inc., a Ziff Davis company. All rights reserved. Moz is a registered trademark of SEOMoz, Inc.
                  • Accessibility
                  • Terms of Use
                  • Privacy