The Moz Q&A Forum

    • Forum
    • Questions
    • My Q&A
    • Users
    • Ask the Community

    Welcome to the Q&A Forum

    Browse the forum for helpful insights and fresh discussions about all things SEO.

    1. SEO and Digital Marketing Q&A Forum
    2. Categories
    3. Intermediate & Advanced SEO
    4. Crawling/indexing of near duplicate product pages

    Crawling/indexing of near duplicate product pages

    Intermediate & Advanced SEO
    3 2 80
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as question
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • AMAGARD
      AMAGARD last edited by

      Hi,

      Hope someone can help me out here. This is the current situation:

      We sell stones/gravel/sand/pebbles etc. for gardens. I will take a type of pebbles and the corresponding pages/URL's to illustrate my question --> black beach pebbles.

      • We have a 'top' product page for black beach pebbles on which you can find different types of quantities (differing from 20kg untill 1600 kg).
      • There is not any search volume related to the different quantities
      • The 'top' page does not link to the pages for the different quantities
      • The content on the pages for the different quantities is not exactly the same (different price + slightly different content). But a lot of the content is the same.

      Current situation:
      - Most pages for the different quantities do not have internal links (about 95%)

      • But the sitemap does contain all of these pages.
      • Because the sitemap contains all these URL's, google frequently crawls them (I checked the logfiles) and has indexed them.

      Problems:

      • Google spends its time crawling irrelevant pages --> our entire website is not that big, so these quantity URL's kind of double the total number of URL's.
      • Having url's in the sitemap that do not have an internal link is a problem on its own
      • All these pages are indexed so all sorts of gravel/pebbles have near duplicates.

      My solution:

      • remove these URL's from the sitemap --> that will probably stop Google from regularly crawling these pages
      • Putting a canonical on the quantity pages pointing to the top-product page. --> that will hopefully remove the irrelevant (no search volume) near duplicates from the index

      My questions:

      • To be able to see the canonical, google will need to crawl these pages. Will google still do that after removing them from the sitemap?
      • Do you agree that these pages are near duplicates and that it is best to remove them from the index?
      • A few of these quantity pages do have intenral links (a few procent of them) because of a sale campaign. So there will be some (not much) internal links pointing to non-canonical pages. Would that be a problem?

      Thanks a lot in advance for your help!

      Best!

      1 Reply Last reply Reply Quote 1
      • Seenlyst
        Seenlyst last edited by

        Hello there,

        To answer your questions,

        1. Google will still crawl your pages even if it's not from the sitemap unless you specify disallow from your robots.txt

        2. If they are similar content with the main difference at "quantities" couldn't you consolidate them into one single page that lists all the quantities your company sell in and then 301 redirect the other pages to the consolidated one?

        3. It doesn't seem like going to be causing any problem nor hurting your SEO performance, but you could always change these link to the canonical link.

        Hope this helps,
        Joseph Yap

        AMAGARD 1 Reply Last reply Reply Quote 1
        • AMAGARD
          AMAGARD @Seenlyst last edited by

          Hi Joseph, thanks for your reply, really helpful! 301 is not really an option, because these quantity URL's are sometimes used for promotions and need to be reachable. Therefore I guess canonicals are the second best solution.

          We will implement the solution I described and see what will happen. Thanks again!

          1 Reply Last reply Reply Quote 0
          • 1 / 1
          • First post
            Last post
          • One Page Design / Single Product Page
            evolvingSEO
            evolvingSEO
            1
            3
            65

          • Best way to link to 1000 city landing pages from index page in a way that google follows/crawls these links (without building country pages)?
            lcourse
            lcourse
            0
            7
            54

          • How do we avoid duplicate/thin content on +150,000 product pages?
            EGOL
            EGOL
            0
            3
            277

          • My crawl can't find ANY product pages. The links to product pages aren't links, they're script. :(
            Joe.Robison
            Joe.Robison
            0
            8
            247

          • A/B Testing - Should I add product descriptions on my category landing pages as well as on product pages and if so . how to do this to avoid duplicate content
            PeteC12
            PeteC12
            0
            3
            162

          • Duplicate Content: Is a product feed/page rolled out across subdomains deemed duplicate content?
            danwebman
            danwebman
            0
            4
            146

          • How to remove "/magento/" and "/index.php/" showing in internal links and dup pages in GWT
            GarGar
            GarGar
            0
            6
            6.4k

          • Is there a way to stop my product pages with the "show all" catagory/attribute from duplicating content?
            cscoville
            cscoville
            0
            5
            362

          Get started with Moz Pro!

          Unlock the power of advanced SEO tools and data-driven insights.

          Start my free trial
          Products
          • Moz Pro
          • Moz Local
          • Moz API
          • Moz Data
          • STAT
          • Product Updates
          Moz Solutions
          • SMB Solutions
          • Agency Solutions
          • Enterprise Solutions
          • Digital Marketers
          Free SEO Tools
          • Domain Authority Checker
          • Link Explorer
          • Keyword Explorer
          • Competitive Research
          • Brand Authority Checker
          • Local Citation Checker
          • MozBar Extension
          • MozCast
          Resources
          • Blog
          • SEO Learning Center
          • Help Hub
          • Beginner's Guide to SEO
          • How-to Guides
          • Moz Academy
          • API Docs
          About Moz
          • About
          • Team
          • Careers
          • Contact
          Why Moz
          • Case Studies
          • Testimonials
          Get Involved
          • Become an Affiliate
          • MozCon
          • Webinars
          • Practical Marketer Series
          • MozPod
          Connect with us

          Contact the Help team

          Join our newsletter
          Moz logo
          © 2021 - 2026 SEOMoz, Inc., a Ziff Davis company. All rights reserved. Moz is a registered trademark of SEOMoz, Inc.
          • Accessibility
          • Terms of Use
          • Privacy