The Moz Q&A Forum

    • Forum
    • Questions
    • My Q&A
    • Users
    • Ask the Community

    Welcome to the Q&A Forum

    Browse the forum for helpful insights and fresh discussions about all things SEO.

    1. SEO and Digital Marketing Q&A Forum
    2. Categories
    3. Search Engine Trends
    4. Large site with faceted navigation using rel=canonical, but Google still has issues

    Large site with faceted navigation using rel=canonical, but Google still has issues

    Search Engine Trends
    14 5 2.3k
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as question
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • PeteGregory
      PeteGregory @gmk1567 last edited by

      Yeah that is why I am not real excited about using robots.txt or even a no index in this instance.  They are not session ids, but more like:

      www.example.com/catgeoryname/a,

      www.example.com/catgeoryname/b

      www.example.com/catgeoryname/c

      etc

      which would show all products that start with those letters.  There are a lot of other filters too, such as color, size, etc, but the bottom line is I point all those back to just www.example.com/categoryname using rel canonical and am not understanding why it isn't working properly.

      AlanMosley 1 Reply Last reply Reply Quote 0
      • AlanMosley
        AlanMosley @PeteGregory last edited by

        OK, sorry I was thinking too many pages, not links.
        using no-index will not stop PR flowing, the search engine will still follow the links.

        1 Reply Last reply Reply Quote 1
        • PeteGregory
          PeteGregory @PeteGregory last edited by

          thx

          1 Reply Last reply Reply Quote 0
          • AlanMosley
            AlanMosley @PeteGregory last edited by

            Because rel canonical does nothing more than give credit to teh chosen page and aviod duplicat content. it does not tell the SE to stop indexing or redirect. as far as finding the links it has no affect

            1 Reply Last reply Reply Quote 0
            • sichristie
              sichristie last edited by

              Did you find a solution for this? I have exactly the same issue and have implemented the rel canonical in exactly the same way.

              The issue you are trying to address is improving crawl bandwidth/equity by not letting Google crawl these faceted pages.

              I am thinking of Ajax loading in these pages to the parent category page and/or adding nofollow to the links. But the pages have already been indexed, so I wonder if nofollow will have any effect.

              Have you had any progress? Any further ideas?

              PeteGregory 1 Reply Last reply Reply Quote 0
              • Dr-Pete
                Dr-Pete last edited by

                I'm working on a couple of cases now, and it is extremely tricky. Google often doesn't re-crawl/re-cache deeper pages for weeks or months, so getting the canonical to work can be a long process. Still, it is generally a very effective tag and can happen quickly.

                I agree with others that Robots.txt isn't a good bet. It also tends to work badly with pages that are already indexed. It's good for keeping things out of the index (especially whole folders, for example), but once 1000s of pages are indexed, Robots.txt often won't clean them up.

                Another option is META NOINDEX, but it depends on the nature of the facets.

                A couple of things to check:

                (1) Using site: with inurl:, monitor the faceted navigation pages in the Google index. Are the numbers gradually dropping? That's what you want to see - the GWT error may not update very often. Keep in mind that these numbers can be unreliable, so monitor them daily over a few weeks.

                (2) Are there are other URLs you're missing? On a large, e-commerce site, it's entirely possibly this wasn't the only problem.

                (3) Did you cut the crawl paths? A common problem is that people canonical, 301-redirect, or NOINDEX, but then nofollow or otherwise cut links to those duplicates. Sounds like a good idea, except that the canonical tag has to be crawled to work. I see this a lot, actually.

                sichristie 1 Reply Last reply Reply Quote 1
                • sichristie
                  sichristie @Dr-Pete last edited by

                  Thanks Dr Pete,

                  Yes I've used meta no-index on pages that are simply not useful in any way shape or form for Google to find.

                  I would be hesitant noindexing my filters in question, but it sounds promising that you are backing the canonical approach and there is a latency on reporting. Our PA and DA is extremely high and we get crawled daily, so curious about your measurement tip (inurl) which is a good one!

                  Many thanks.

                  Simon

                  1 Reply Last reply Reply Quote 0
                  • Dr-Pete
                    Dr-Pete last edited by

                    I should actually add that Google doesn't condone using rel-canonical back to the main search page or page 1. They allow canonical to a "View All" or a complex mix of rel-canonical and rel=prev/next. If you use rel-canonical on too many non-identical pages, they could ignore it (although I don't often find that to be true).

                    Vanessa Fox just did a write-up on Google's approach:

                    http://searchengineland.com/implementing-pagination-attributes-correctly-for-google-114970

                    I have to be honest, though - I'm not a fan of Google's approach. It's incredibly complicated, easy to screw up, doesn't seem to work in all cases, and doesn't work on Bing. This is a very complex issue and really depends on the site in question. Adam Audette did a good write-up:

                    http://searchengineland.com/five-step-strategy-for-solving-seo-pagination-problems-95494

                    sichristie 1 Reply Last reply Reply Quote 1
                    • PeteGregory
                      PeteGregory @sichristie last edited by

                      As for my situation it worked out quite nicely, I just wasn't patient enough.  After about 2 months the issue corrected itself for the most part and I was able to reduce about a million "waste" pages out of the index.  This is a very large site so losing a million pages in a handful of categories helped me gain in a whole lot of other areas and spread the crawler around to more places that were important for us.

                      I also spent some time doing some restructuring of internal linking from some of our more authoritative pages that I believe also assisted with this, but in my case rel="canonical" worked out pretty nicely.  Just took some time and patience.

                      1 Reply Last reply Reply Quote 0
                      • sichristie
                        sichristie @Dr-Pete last edited by

                        Yes that's a different situation. You're now talking about pagination, which quite rightly, canonicals to parent page is not to be used.

                        For faceted/filtered navigation it seems like canonical usage is indeed the right way to go about it, given Peter's experience just mentioned above, and the article you linked to that says, "...(in part because Google only indexes the content on the canonical page, so any content from the rest of the pages in the series would be ignored)."

                        1 Reply Last reply Reply Quote 0
                        • 1 / 1
                        • First post
                          Last post
                        • Site:www Issue - Homepage of the website is not showing in Google
                          Nigel_Carr
                          Nigel_Carr
                          0
                          2
                          51

                        • Rel canonical on other page instead of duplicate page. How Google responds?
                          vtmoz
                          vtmoz
                          0
                          6
                          90

                        • We recently transitioned a site to our server, but Google is still showing the old server's urls. Is there a way to stop Google from showing urls?
                          BlueprintMarketing
                          BlueprintMarketing
                          0
                          11
                          125

                        • Rel canonical
                          FFTCOUK
                          FFTCOUK
                          0
                          6
                          149

                        • Is it better to build a large site that covers many verticals or many sites dedicated to each vertical
                          TommyTan
                          TommyTan
                          0
                          4
                          160

                        • Should I use canonical tags on my site?
                          Dr-Pete
                          Dr-Pete
                          0
                          3
                          512

                        • Will google punish us for using formulaic keyword-rich content on different pages on our site?
                          Dan-Petrovic
                          Dan-Petrovic
                          0
                          5
                          926

                        • Does Google factor in site traffic, time on site, bounce rate, etc into it's ranking algorithm?
                          CPU
                          CPU
                          0
                          5
                          1.4k

                        Get started with Moz Pro!

                        Unlock the power of advanced SEO tools and data-driven insights.

                        Start my free trial
                        Products
                        • Moz Pro
                        • Moz Local
                        • Moz API
                        • Moz Data
                        • STAT
                        • Product Updates
                        Moz Solutions
                        • SMB Solutions
                        • Agency Solutions
                        • Enterprise Solutions
                        • Digital Marketers
                        Free SEO Tools
                        • Domain Authority Checker
                        • Link Explorer
                        • Keyword Explorer
                        • Competitive Research
                        • Brand Authority Checker
                        • Local Citation Checker
                        • MozBar Extension
                        • MozCast
                        Resources
                        • Blog
                        • SEO Learning Center
                        • Help Hub
                        • Beginner's Guide to SEO
                        • How-to Guides
                        • Moz Academy
                        • API Docs
                        About Moz
                        • About
                        • Team
                        • Careers
                        • Contact
                        Why Moz
                        • Case Studies
                        • Testimonials
                        Get Involved
                        • Become an Affiliate
                        • MozCon
                        • Webinars
                        • Practical Marketer Series
                        • MozPod
                        Connect with us

                        Contact the Help team

                        Join our newsletter
                        Moz logo
                        © 2021 - 2026 SEOMoz, Inc., a Ziff Davis company. All rights reserved. Moz is a registered trademark of SEOMoz, Inc.
                        • Accessibility
                        • Terms of Use
                        • Privacy