The Moz Q&A Forum

    • Forum
    • Questions
    • My Q&A
    • Users
    • Ask the Community

    Welcome to the Q&A Forum

    Browse the forum for helpful insights and fresh discussions about all things SEO.

    1. SEO and Digital Marketing Q&A Forum
    2. Categories
    3. On-Page / Site Optimization
    4. Does this site have a duplicate content issue?

    Does this site have a duplicate content issue?

    On-Page / Site Optimization
    17 5 238
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as question
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • rja214
      rja214 last edited by

      Yes, the page http://www.customgia.com/fashion-beaded-jewelry/shop-for/beaded-bracelets/classic-bracelet/AADE17E9BB964352B7A3912294BB5DF8 is a unique product page with a unique description.  The designer page you referenced: http://www.customgia.com/design-your-own-jewelry/classic-bracelet/AADE17E9BB964352B7A3912294BB5DF8 is also unique because it loads the jewelry design in the product page.

      On every product page the "Design It!" button opens a flash-based designer page that let's the user edit that particular design.  Unfortunately, the Moz crawler (and I assume Google) considers these pages duplicates of http://www.customgia.com/design-your-own-jewelry/classic-bracelet/ (or one of the other five jewelry patterns).  The fact that every designer page loads a unique jewelry design does not seem to matter.  The best (and most costly) solution, I suppose would be to change all the flash code to html but that isn't happening anytime soon.

      The

      tag on all the designer pages is either "Classic Bracelet", "Classic Necklace", "Simple Necklace", "Simple Bracelet", "Pendant Necklace" or "Drop Earrings" (depending on which pattern was used to create the design).  Maybe changing the

      to something like:  Redesigning "Garnet Pearl Bracelet with Silver Hearts" would help tell Google these pages indeed differ from one another - but I think the content below the

      will still be considered duplicate.  If I use canonical tags on these pages, is there any point in creating dynamic

      tags if it doesn't improve the user experience?  Thanks again!

      MickEdwards 1 Reply Last reply Reply Quote 0
      • MickEdwards
        MickEdwards @rja214 last edited by

        Even if the 'design' part was not flash the textual content is pretty much identical.  There is no benefit for it to be indexed so canonical to the main directory URL would make sense.  Then add some good text to those main pages.

        Personally I would only use H1 for user experience rather than keywordy as they don't carry much weight.

        1 Reply Last reply Reply Quote 0
        • Dr-Pete
          Dr-Pete last edited by

          Each of these problems may have a unique solution, so it gets complicated. Regarding the "design your own" pages, I'm seeing over 5K of those URLs in the search index, and they do probably look very similar. Since these are not the core product pages, I'd strongly consider using META NOINDEX on them. I find that Robots.txt does not do a good job of blocking content that has already been indexed, in most cases. You can add the meta tag dynamically in your code, hopefully, so that just a few lines of code will serve all of these pages.

          While these pages aren't "true" duplicates, they look similar enough that, at the scale of your site, they really are diluting your ability to rank. In extreme cases, if you're also serving up product variations, paginated search results, etc., you could even run into Panda issues. Whether or not this is your core problem, from an SEO perspective, cleaning it up can't hurt, and may make it easier to find other problems.

          Everett 1 Reply Last reply Reply Quote 2
          • Everett
            Everett @Dr-Pete last edited by

            I agree with Dr. Pete here, though I think the easiest solution would be to simply block the entire /design-your-own-jewelry/* directory from being indexed using robots.txt and, to Dr. Pete's point, you'll want to remove that directory from the index in both Bing and Google webmaster tools, as discussed here:

            http://googlewebmastercentral.blogspot.com/2007/04/requesting-removal-of-content-from-our.html (see the section under "entire directory")

            Something I think about with regard to robots.txt block Vs meta robots block is crawl budget. Google has to access a page to see the meta noindex tag, while a single disallow statement in the robots.txt file can save Googlebot the hassle of visiting potentially thousands of unnecessary pages.

            If down the road you figure out a way to put custom content on those pages and want to try and rank for things like "Custom Garnet Pearl Bracelet" or "Design Your Own Beaded Bracelet" then I'd look into some of the other options discussed here. Until then I feel they would just be complicating something as simple as the need to remove very thin, mostly duplicate content from the index.

            Dr-Pete 1 Reply Last reply Reply Quote 1
            • Dr-Pete
              Dr-Pete @Everett last edited by

              I've had a lot of issues where, if pages were already indexed, Robots.txt did a poor job of removing them. Absolutely agree on the crawl budget issue and it's a whole lot easier to remove a folder in Robots.txt, but I've just had a bunch of odd problems with Robots.txt at large scale. If I actually had to do it on my own site, I'd probably:

              (1) META NOINDEX the pages

              (2) Monitor removal

              (3) Once removal was progressing well (80%+), then add to Robots.txt

              Everett 1 Reply Last reply Reply Quote 1
              • Everett
                Everett @Dr-Pete last edited by

                That sounds like a winning plan Dr. Pete, though I'd append 2.1 "Request removal of directory in Bing and Google webmaster tools".

                1 Reply Last reply Reply Quote 1
                • rja214
                  rja214 last edited by

                  Thank you for the excellent advice.  We put the META NOINDEX tags into place this morning.  The URL removal request is next but I have a slight change to what Everett outlined above.

                  The six designer pages that are accessible from the homepage: /design-your-own-jewelry/classic-necklace, /design-your-own-jewelry/simple-necklace, /design-your-own-jewelry/classic-bracelet, /design-your-own-jewelry/simple-bracelet, /design-your-own-jewelry/pendant-necklace, /design-your-own-jewelry/drop-earrings are not duplicates and the Moz crawl did not show them as duplicates.  All the other designer pages are considered duplicates of each other or duplicates of one of these six pages.  So we put INDEX, NOFOLLOW on these six pages to keep them indexed. I think the removal request should follow suit.

                  What's your opinion on placing a removal request for each of the following? -  /design-your-own-jewelry/classic-necklace/, /design-your-own-jewelry/simple-necklace/, /design-your-own-jewelry/classic-bracelet/, /design-your-own-jewelry/simple-bracelet/, /design-your-own-jewelry/pendant-necklace/, /design-your-own-jewelry/drop-earrings/.  Correct me if I'm wrong, but this should remove all the designer pages from the index except the six that are accessible from the homepage, giving those six pages a chance to rank.

                  1 Reply Last reply Reply Quote 0
                  • Dr-Pete
                    Dr-Pete last edited by

                    Oh, sorry - yeah, this is why these questions can be dangerous in the scope of Q&A. If some of the pages in that virtual folder are main nav pages/links, then it's definitely going to look weird to block them (it's a mixed signal, at best). I'm not sure I fully understand the site structure, but my gut reaction is to leave those indexed. The wild-cards should work - the other option would be to give them each their own shorter URLs and not put them in the "/design-your-own-jewelry" folder, but that can be a ton of work, depending on how your site is built (plus, you'd have to 301-redirect the old URLs, which opens up a whole new mess).

                    1 Reply Last reply Reply Quote 0
                    • rja214
                      rja214 last edited by

                      Okay, last question on this (I hope).  As far as I can tell, Google's URL removal tool does not support the use of wildcards.  And according to their removal requirements, I can't remove an entire directory unless that directory is already blocked in the robots.txt.  So before I submit the removal request for: http://www.customgia.com/design-your-own-jewelry/classic-necklace/, I have to add:  Disallow: /design-your-own-jewelry/classic-necklace/ to robots.txt.  Is this the right way to do this?  And again, thanks for helping me with this.

                      1 Reply Last reply Reply Quote 0
                      • Dr-Pete
                        Dr-Pete last edited by

                        This is where things get a bit dicey - I'm not 100% sure that won't remove the main page, too (and how Google handles the trailing "/"). You might need a "/*" wild-card in the Robots.txt. Frankly, I'd ease into it with just one directory. These things never seem to work quite the way in practice that we all say they should in theory.

                        1 Reply Last reply Reply Quote 0
                        • 1 / 1
                        • First post
                          Last post
                        • To avoid the duplicate content issue I have created new urls for that specific site I am posting to and redirecting that url to the original on my site. Is this the right way to do it?
                          0
                          1
                          24

                        • Duplicate content issue, across site domains (blogging)
                          HashtagHustler
                          HashtagHustler
                          0
                          6
                          146

                        • Duplicate content on events site
                          Martijn_Scheijbeler
                          Martijn_Scheijbeler
                          0
                          3
                          206

                        • Duplicate content issues?
                          HBPGroup
                          HBPGroup
                          0
                          4
                          96

                        • Duplicate content on partner site
                          Dr-Pete
                          Dr-Pete
                          0
                          7
                          749

                        • Duplicate content issue
                          irvingw
                          irvingw
                          0
                          2
                          121

                        • Issue: Duplicate Page Content
                          JonathanLeplang
                          JonathanLeplang
                          0
                          3
                          300

                        • How could I avoid the "Duplicate Page Content" issue on the search result pages of a webshop site?
                          seohive-222720
                          seohive-222720
                          0
                          3
                          545

                        Get started with Moz Pro!

                        Unlock the power of advanced SEO tools and data-driven insights.

                        Start my free trial
                        Products
                        • Moz Pro
                        • Moz Local
                        • Moz API
                        • Moz Data
                        • STAT
                        • Product Updates
                        Moz Solutions
                        • SMB Solutions
                        • Agency Solutions
                        • Enterprise Solutions
                        • Digital Marketers
                        Free SEO Tools
                        • Domain Authority Checker
                        • Link Explorer
                        • Keyword Explorer
                        • Competitive Research
                        • Brand Authority Checker
                        • Local Citation Checker
                        • MozBar Extension
                        • MozCast
                        Resources
                        • Blog
                        • SEO Learning Center
                        • Help Hub
                        • Beginner's Guide to SEO
                        • How-to Guides
                        • Moz Academy
                        • API Docs
                        About Moz
                        • About
                        • Team
                        • Careers
                        • Contact
                        Why Moz
                        • Case Studies
                        • Testimonials
                        Get Involved
                        • Become an Affiliate
                        • MozCon
                        • Webinars
                        • Practical Marketer Series
                        • MozPod
                        Connect with us

                        Contact the Help team

                        Join our newsletter
                        Moz logo
                        © 2021 - 2026 SEOMoz, Inc., a Ziff Davis company. All rights reserved. Moz is a registered trademark of SEOMoz, Inc.
                        • Accessibility
                        • Terms of Use
                        • Privacy