The Moz Q&A Forum

    • Forum
    • Questions
    • My Q&A
    • Users
    • Ask the Community

    Welcome to the Q&A Forum

    Browse the forum for helpful insights and fresh discussions about all things SEO.

    1. SEO and Digital Marketing Q&A Forum
    2. Categories
    3. Intermediate & Advanced SEO
    4. Google Indexing Duplicate URLs : Ignoring Robots & Canonical Tags

    Google Indexing Duplicate URLs : Ignoring Robots & Canonical Tags

    Intermediate & Advanced SEO
    2 2 321
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as question
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • JBGlobalSEO
      JBGlobalSEO last edited by

      Hi Moz Community,

      We have the following robots command that should prevent URLs with tracking parameters being indexed.

      Disallow: /*?

      We have noticed google has started indexing pages that are using tracking parameters. Example below.

      http://www.oakfurnitureland.co.uk/furniture/original-rustic-solid-oak-4-drawer-storage-coffee-table/1149.html

      http://www.oakfurnitureland.co.uk/furniture/original-rustic-solid-oak-4-drawer-storage-coffee-table/1149.html?ec=affee77a60fe4867

      These pages are identified as duplicate content yet have the correct canonical tags:

      https://www.google.co.uk/search?num=100&site=&source=hp&q=site%3Ahttp%3A%2F%2Fwww.oakfurnitureland.co.uk%2Ffurniture%2Foriginal-rustic-solid-oak-4-drawer-storage-coffee-table%2F1149.html&oq=site%3Ahttp%3A%2F%2Fwww.oakfurnitureland.co.uk%2Ffurniture%2Foriginal-rustic-solid-oak-4-drawer-storage-coffee-table%2F1149.html&gs_l=hp.3..0i10j0l9.4201.5461.0.5879.8.8.0.0.0.0.82.376.7.7.0....0...1c.1.58.hp..3.5.268.0.JTW91YEkjh4

      With various affiliate feeds available for our site, we effectively have duplicate versions of every page due to the tracking query that Google seems to be willing to index, ignoring both robots rules & canonical tags.

      Can anyone shed any light onto the situation?

      1 Reply Last reply Reply Quote 0
      • AlanBleiweiss
        AlanBleiweiss last edited by

        Google's multi-layered multi-algorithm system has come a long way in being able to "figure it all out", yet at the same time, falls far short of always successfully "getting it right".

        Robots.txt files are no longer an absolute directive.  They're now "just another signal", as are canonical tags, meta robots instructions, and their own Google Webmaster URL Parameters system.

        Because of this its critical to be consistent across all signals.  If you've got the robots.txt file set to not index pages, but also have inbound links from affiliates, that's a prime example of where inbound link signals can override the robots.txt file's instruction if they're not nofollowed links.

        While they technically SHOULD not index them after discovering them off-site (because the destination says "index this other version"), that's part of their confused multilayered system.

        I have a question though - from what limited information you've provided, this example is based on a url parameter of ?ec=

        When I search Google using site:http://www.oakfurnitureland.co.uk/ inurl:ec

        I see only three such pages indexed AND where those pages are "fully" indexed.  All the rest (over 1,000 additional URLs), are in the Google system, however every one of those others has a meta description of "A description for this result is not available because of this site's robots.txt - learn more."

        What that means is they are NOT fully indexing those pages - there is no worry to be had about duplicate content for those. Google is simply tracking that those URLs exist.

        So - is that the only URL parameter you're worried about? If so, it's not a major problem on your site. Except for those few exceptions, Google is doing what you need them to do with those.

        1 Reply Last reply Reply Quote 1
        • 1 / 1
        • First post
          Last post
        • Same URL-Structure & the same number of URLs indexed on two different websites - can it lead to a Google penalty?
          0
          1
          13

        • My url disappeared from Google but Search Console shows indexed. This url has been indexed for more than a year. Please help!
          jacobmartinnn
          jacobmartinnn
          0
          3
          74

        • Google Ignoring Canonical Tag for Hundreds of Sites
          ThomasHarvey
          ThomasHarvey
          1
          7
          1.0k

        • Case Sensitive URLs, Duplicate Content & Link Rel Canonical
          allianzireland
          allianzireland
          0
          4
          3.0k

        • Canonical URL & sitemap URL mismatch
          LynnPatchett
          LynnPatchett
          0
          2
          612

        • URL Parameter Being Improperly Crawled & Indexed by Google
          kevin_reyes
          kevin_reyes
          0
          3
          396

        • Will Canonical tag on parameter URLs remove those URL's from Index, and preserve link juice?
          StreamlineMetrics
          StreamlineMetrics
          0
          2
          113

        • Duplicate Title Tags & Duplication Meta Description after 301 Redirect
          avassa
          avassa
          0
          3
          2.4k

        Get started with Moz Pro!

        Unlock the power of advanced SEO tools and data-driven insights.

        Start my free trial
        Products
        • Moz Pro
        • Moz Local
        • Moz API
        • Moz Data
        • STAT
        • Product Updates
        Moz Solutions
        • SMB Solutions
        • Agency Solutions
        • Enterprise Solutions
        • Digital Marketers
        Free SEO Tools
        • Domain Authority Checker
        • Link Explorer
        • Keyword Explorer
        • Competitive Research
        • Brand Authority Checker
        • Local Citation Checker
        • MozBar Extension
        • MozCast
        Resources
        • Blog
        • SEO Learning Center
        • Help Hub
        • Beginner's Guide to SEO
        • How-to Guides
        • Moz Academy
        • API Docs
        About Moz
        • About
        • Team
        • Careers
        • Contact
        Why Moz
        • Case Studies
        • Testimonials
        Get Involved
        • Become an Affiliate
        • MozCon
        • Webinars
        • Practical Marketer Series
        • MozPod
        Connect with us

        Contact the Help team

        Join our newsletter
        Moz logo
        © 2021 - 2026 SEOMoz, Inc., a Ziff Davis company. All rights reserved. Moz is a registered trademark of SEOMoz, Inc.
        • Accessibility
        • Terms of Use
        • Privacy