The Moz Q&A Forum

    • Forum
    • Questions
    • My Q&A
    • Users
    • Ask the Community

    Welcome to the Q&A Forum

    Browse the forum for helpful insights and fresh discussions about all things SEO.

    1. SEO and Digital Marketing Q&A Forum
    2. Categories
    3. Intermediate & Advanced SEO
    4. Google Indexing Duplicate URLs : Ignoring Robots & Canonical Tags

    Google Indexing Duplicate URLs : Ignoring Robots & Canonical Tags

    Intermediate & Advanced SEO
    2 2 321
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as question
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • JBGlobalSEO
      JBGlobalSEO last edited by

      Hi Moz Community,

      We have the following robots command that should prevent URLs with tracking parameters being indexed.

      Disallow: /*?

      We have noticed google has started indexing pages that are using tracking parameters. Example below.

      http://www.oakfurnitureland.co.uk/furniture/original-rustic-solid-oak-4-drawer-storage-coffee-table/1149.html

      http://www.oakfurnitureland.co.uk/furniture/original-rustic-solid-oak-4-drawer-storage-coffee-table/1149.html?ec=affee77a60fe4867

      These pages are identified as duplicate content yet have the correct canonical tags:

      https://www.google.co.uk/search?num=100&site=&source=hp&q=site%3Ahttp%3A%2F%2Fwww.oakfurnitureland.co.uk%2Ffurniture%2Foriginal-rustic-solid-oak-4-drawer-storage-coffee-table%2F1149.html&oq=site%3Ahttp%3A%2F%2Fwww.oakfurnitureland.co.uk%2Ffurniture%2Foriginal-rustic-solid-oak-4-drawer-storage-coffee-table%2F1149.html&gs_l=hp.3..0i10j0l9.4201.5461.0.5879.8.8.0.0.0.0.82.376.7.7.0....0...1c.1.58.hp..3.5.268.0.JTW91YEkjh4

      With various affiliate feeds available for our site, we effectively have duplicate versions of every page due to the tracking query that Google seems to be willing to index, ignoring both robots rules & canonical tags.

      Can anyone shed any light onto the situation?

      1 Reply Last reply Reply Quote 0
      • AlanBleiweiss
        AlanBleiweiss last edited by

        Google's multi-layered multi-algorithm system has come a long way in being able to "figure it all out", yet at the same time, falls far short of always successfully "getting it right".

        Robots.txt files are no longer an absolute directive.  They're now "just another signal", as are canonical tags, meta robots instructions, and their own Google Webmaster URL Parameters system.

        Because of this its critical to be consistent across all signals.  If you've got the robots.txt file set to not index pages, but also have inbound links from affiliates, that's a prime example of where inbound link signals can override the robots.txt file's instruction if they're not nofollowed links.

        While they technically SHOULD not index them after discovering them off-site (because the destination says "index this other version"), that's part of their confused multilayered system.

        I have a question though - from what limited information you've provided, this example is based on a url parameter of ?ec=

        When I search Google using site:http://www.oakfurnitureland.co.uk/ inurl:ec

        I see only three such pages indexed AND where those pages are "fully" indexed.  All the rest (over 1,000 additional URLs), are in the Google system, however every one of those others has a meta description of "A description for this result is not available because of this site's robots.txt - learn more."

        What that means is they are NOT fully indexing those pages - there is no worry to be had about duplicate content for those. Google is simply tracking that those URLs exist.

        So - is that the only URL parameter you're worried about? If so, it's not a major problem on your site. Except for those few exceptions, Google is doing what you need them to do with those.

        1 Reply Last reply Reply Quote 1
        • 1 / 1
        • First post
          Last post
        • Best Practice Approaches to Canonicals vs. Indexing in Google Sitemap vs. No Follow Tags
          effectdigital
          effectdigital
          0
          4
          57

        • How to get a large number of urls out of Google's Index when there are no pages to noindex tag?
          effectdigital
          effectdigital
          0
          2
          50

        • Does we need to add a canonical tag with the mobile url in each desktop version as a result of mobile first index?
          CleverPhD
          CleverPhD
          0
          3
          613

        • Will disallowing URL's in the robots.txt file stop those URL's being indexed by Google
          Martijn_Scheijbeler
          Martijn_Scheijbeler
          0
          11
          1.6k

        • Canonical URL & sitemap URL mismatch
          LynnPatchett
          LynnPatchett
          0
          2
          612

        • Is it a problem that Google's index shows paginated page urls, even with canonical tags in place?
          94501
          94501
          0
          3
          249

        • Will Canonical tag on parameter URLs remove those URL's from Index, and preserve link juice?
          StreamlineMetrics
          StreamlineMetrics
          0
          2
          113

        • Google tagged URL an overly-dynamic URL?
          ThompsonPaul
          ThompsonPaul
          0
          2
          228

        Get started with Moz Pro!

        Unlock the power of advanced SEO tools and data-driven insights.

        Start my free trial
        Products
        • Moz Pro
        • Moz Local
        • Moz API
        • Moz Data
        • STAT
        • Product Updates
        Moz Solutions
        • SMB Solutions
        • Agency Solutions
        • Enterprise Solutions
        • Digital Marketers
        Free SEO Tools
        • Domain Authority Checker
        • Link Explorer
        • Keyword Explorer
        • Competitive Research
        • Brand Authority Checker
        • Local Citation Checker
        • MozBar Extension
        • MozCast
        Resources
        • Blog
        • SEO Learning Center
        • Help Hub
        • Beginner's Guide to SEO
        • How-to Guides
        • Moz Academy
        • API Docs
        About Moz
        • About
        • Team
        • Careers
        • Contact
        Why Moz
        • Case Studies
        • Testimonials
        Get Involved
        • Become an Affiliate
        • MozCon
        • Webinars
        • Practical Marketer Series
        • MozPod
        Connect with us

        Contact the Help team

        Join our newsletter
        Moz logo
        © 2021 - 2026 SEOMoz, Inc., a Ziff Davis company. All rights reserved. Moz is a registered trademark of SEOMoz, Inc.
        • Accessibility
        • Terms of Use
        • Privacy