The Moz Q&A Forum

    • Forum
    • Questions
    • My Q&A
    • Users
    • Ask the Community

    Welcome to the Q&A Forum

    Browse the forum for helpful insights and fresh discussions about all things SEO.

    1. SEO and Digital Marketing Q&A Forum
    2. Categories
    3. Technical SEO Issues
    4. How critical is Duplicate content warnings?

    How critical is Duplicate content warnings?

    Technical SEO Issues
    7 4 687
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as question
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • Gamer07
      Gamer07 last edited by

      Hi,

      So I have created my first campaign here and I have to say the tools, user interface and the on-page optimization, everything is useful and I am happy with SEOMOZ.

      However, the crawl report returned thousands of errors and most of them are duplicate content warnings.

      As we use Drupal as our CMS, the duplicate content is caused by Drupal's pagination problems. Let's say there is a page called "/top5list" , the crawler decided /top5list?page=1" to be duplicate of "/top5list". There is no real solution for pagination problems in Drupal (as far as I know).

      I don't have any warnings in Google's webmaster tools regarding this and my sitemap I submitted to Google doesn't include those problematic deep pages. (that are detected as duplicate content by SEOMOZ crawler)

      So my question is, should I be worried about the thousands of error messages in crawler diagnostics?

      any ideas appreciated

      1 Reply Last reply Reply Quote 0
      • SWA_Adam
        SWA_Adam last edited by

        OK, this is just what I've done, and it might not work for everyone.

        As far as I can tell, the duplicate content warnings do not hurt my rankings, I don't think. When I first signed up for SEOMoz they really alarmed me. If they are hurting my rankings, it's not much - as we preform well in many competitive keywords for our industry, and our website traffic has been growing ~20% year over year for many years now.

        The fix for auto-generated duplicate content on our site (which I inherited as my responsibility when I started at my company) would be very expensive. It's something I plan on doing eventually along with some other overhauls, but right now it's not in the budget, because it would basically involve re-architecting how the site and databases function on the back end (ugh).

        So, in order to help mitigate any issues and help keep Google from indexing all the duplicate content that can be generated by our system, I use the "URL Parameters" setting in Google Webmaster Tools (under Site Configuration). I've set up a few parameters for Google to specifically NOT INDEX, to keep the duplicate content out of the search engine. I've also set some parameters to specifically reenforce content I want indexed (along with including the original content in sitemaps I've curated myself, rather than having auto-generated sitemaps potentially polluted with duplicate content).

        My thinking is that while Roger the SEOMoz bot is still finding this stuff and generating warnings, Googlebot is not.

        I don't work at an agency - I'm in-house and I've hard to learn everything by trial and error and often fly by the seat of my pants with this sort of thing. So my conclusion/solutions may be wrong or not work for you, but it seems to work for me.

        It's a band-aid fix at best, but it seems to be better than nothing!

        Hope this helps,

        -Adam

        1 Reply Last reply Reply Quote 1
        • Vahe.Arabian
          Vahe.Arabian last edited by

          For pagination problem's it would be better to use this cannonical method- http://googlewebmastercentral.blogspot.com.au/2012/03/video-about-pagination-with-relnext-and.html .

          Having dup content in the form paginated results will not penalise you, rather the page/link equity will be split between all these pages. This means you would need to spend more time and energy on the original page to outrank your competitors.

          To see these errors in Google Webmaster Tools you should go to the HTML sections area where it will review the sites meta data. I'm sure ull find the same issues there, instead of the sitemaps.

          So to improve the overall health of your website,  I would suggest that you do try and verify this issue.

          Hope this helps. Any issues, best to contact me directly.

          Regards,

          Vahe

          1 Reply Last reply Reply Quote 1
          • Gamer07
            Gamer07 last edited by

            Thanks Adam and Vahe. Your suggestions are definitely helpful.

            1 Reply Last reply Reply Quote 1
            • Dr-Pete
              Dr-Pete last edited by

              One clarification one Vahe's answer - if these continue (?page=2, ?page=3, etc.) then it's traditional pagination. You could use the GWT solution Adam mentioned, although, honestly, I find it's hit-or-miss. It is simpler than other solution. The "ideal" Google solution is very hard to implement (and I actually have issues with it). The other option is to META NOINDEX the variants, but that would take adjusting the template code dynamically.

              If it's just an issue of a bunch of "page=1" duplicates, and this isn't "true" pagination, then canonical tags are probably your best bet. There may be a Drupal plug-in or fix - unfortunately, I don't have much Drupal experience.

              The question is whether these pages are being indexed by Google, and how many of them there are. At large scale, these kinds of near-duplicates can dilute your index, harm rankings, and even contribute to Panda issues. At smaller scale, though, they might have no impact at all. So, it's not always clear cut, and you have to work the risk/cost calculation.

              You can run a command in Google like:

              site:example.com inurl:page=

              ...and try to get a sense of how much of this content is being indexed.

              The GWT approach won't hurt, and it's fine to try. I just find that Google doesn't honor it consistently.

              Gamer07 1 Reply Last reply Reply Quote 2
              • Gamer07
                Gamer07 @Dr-Pete last edited by

                Thanks for that command Dr. Meyers. Apparently, only 5 such pages are indexed. I suppose I shouldn't worry about this then?

                Dr-Pete 1 Reply Last reply Reply Quote 0
                • Dr-Pete
                  Dr-Pete @Gamer07 last edited by

                  Personally, I'd keep an eye on it. These things do have a way of expanding over time, so you may want to be proactive. At the moment, though, you probably don't have to lose sleep over it.

                  1 Reply Last reply Reply Quote 1
                  • 1 / 1
                  • First post
                    Last post
                  • Added 301 redirects, pages still earning duplicate content warning
                    alecfwilson
                    alecfwilson
                    0
                    5
                    121

                  • Duplicate content warning for a hierarchy structure?
                    westsaddle
                    westsaddle
                    0
                    5
                    169

                  • Content relaunch without content duplication
                    TomRayner
                    TomRayner
                    0
                    2
                    68

                  • Is this duplicate content when there is a link back to the original content?
                    JoLindahl91
                    JoLindahl91
                    2
                    6
                    133

                  • Duplicate Content Vs No Content
                    MoosaHemani
                    MoosaHemani
                    0
                    7
                    404

                  • How to avoid duplicate content penalty when our content is posted on other sites too ?
                    Personnel_Concept
                    Personnel_Concept
                    0
                    8
                    460

                  • Lots of duplicate content warnings
                    TakeshiYoung
                    TakeshiYoung
                    0
                    2
                    327

                  • Is 100% duplicate content always duplicate?
                    activitysuper
                    activitysuper
                    0
                    4
                    457

                  Get started with Moz Pro!

                  Unlock the power of advanced SEO tools and data-driven insights.

                  Start my free trial
                  Products
                  • Moz Pro
                  • Moz Local
                  • Moz API
                  • Moz Data
                  • STAT
                  • Product Updates
                  Moz Solutions
                  • SMB Solutions
                  • Agency Solutions
                  • Enterprise Solutions
                  • Digital Marketers
                  Free SEO Tools
                  • Domain Authority Checker
                  • Link Explorer
                  • Keyword Explorer
                  • Competitive Research
                  • Brand Authority Checker
                  • Local Citation Checker
                  • MozBar Extension
                  • MozCast
                  Resources
                  • Blog
                  • SEO Learning Center
                  • Help Hub
                  • Beginner's Guide to SEO
                  • How-to Guides
                  • Moz Academy
                  • API Docs
                  About Moz
                  • About
                  • Team
                  • Careers
                  • Contact
                  Why Moz
                  • Case Studies
                  • Testimonials
                  Get Involved
                  • Become an Affiliate
                  • MozCon
                  • Webinars
                  • Practical Marketer Series
                  • MozPod
                  Connect with us

                  Contact the Help team

                  Join our newsletter
                  Moz logo
                  © 2021 - 2026 SEOMoz, Inc., a Ziff Davis company. All rights reserved. Moz is a registered trademark of SEOMoz, Inc.
                  • Accessibility
                  • Terms of Use
                  • Privacy