The Moz Q&A Forum

    • Forum
    • Questions
    • My Q&A
    • Users
    • Ask the Community

    Welcome to the Q&A Forum

    Browse the forum for helpful insights and fresh discussions about all things SEO.

    1. SEO and Digital Marketing Q&A Forum
    2. Categories
    3. Intermediate & Advanced SEO
    4. Robots.txt, Disallow & Indexed-Pages..

    Robots.txt, Disallow & Indexed-Pages..

    Intermediate & Advanced SEO
    5 4 341
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as question
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • thekiller99
      thekiller99 last edited by

      Hi guys,

      hope you're well.

      I have a problem with my new website. I have 3 pages with the same content:

      • http://example.examples.com/brand/brand1 (good page)
      • http://example.examples.com/brand/brand1?show=false
      • http://example.examples.com/brand/brand1?show=true

      The good page has rel=canonical & it is the only page should be appear in Search results but Google has indexed 3 pages...

      I don't know how should do now, but, i am thinking 2 posibilites:

      1. Remove filters (true, false) and leave only the good page and show 404 page for others pages.
      2. Update robots.txt with disallow for these parameters & remove those URL's manually

      Thank you so much!

      1 Reply Last reply Reply Quote 0
      • Yoav-Blustein
        Yoav-Blustein last edited by

        Hi!

        Not sure if i understood how you implemented the canonical element on your pages, but it sounds like you have only put the canonical code to what you call "good page"

        The scenario should be like this:
        1. You have 3 pages with similar/exact content.
        2. Obviously you want to index only one of them and in your case it is the one without the parameters ("good page")
        3. You need to go ahead and implement the canonical elements in the following way:

        • page-1:  http://example.examples.com/brand/brand1  (you do not have to, but if it makes it ieasier for you you can use self canonical.)
        • page-2:  http://example.examples.com/brand/brand1?show=false  (canonical to page-1)
        • page-3:  http://example.examples.com/brand/brand1?show=true (canonical page-1)

        PS. Google best practice suggests that you should never use robots.txt to de-index a page from the search results. In case you decide to remove certain pages completely from the search results, the best practice is to 404 them and use Google Search console to signal google that these pages are no longer available. But if you implement the canonical element as described above, you will have no problems.

        Best

        Yossi

        1 Reply Last reply Reply Quote 0
        • solvid
          solvid last edited by

          Hi,

          Did you actually implement canonical tags on duplicate pages, and do the point to the original piece?

          1 Reply Last reply Reply Quote 1
          • MattRoney
            MattRoney last edited by

            Hi thekiller99! Did this get worked out? We'd love an update. 🙂

            1 Reply Last reply Reply Quote 0
            • thekiller99
              thekiller99 last edited by

              Finally, i decided to do the next:

              1. Delete all pages from my site with filters (i have the option and it wasn't a problem)

              2. Delete URL using GWT individually

              It works!

              1 Reply Last reply Reply Quote 0
              • 1 / 1
              • First post
                Last post
              • Is robots met tag a more reliable than robots.txt at preventing indexing by Google?
                Bobbi_Tschumper
                Bobbi_Tschumper
                1
                7
                3.0k

              • How do we decide which pages to index/de-index? Help for a 250k page site
                julie-getonthemap
                julie-getonthemap
                0
                2
                63

              • Robots.txt Disallowed Pages and Still Indexed
                Igor.Go
                Igor.Go
                0
                3
                2.9k

              • Will disallowing URL's in the robots.txt file stop those URL's being indexed by Google
                Martijn_Scheijbeler
                Martijn_Scheijbeler
                0
                11
                1.6k

              • Will disallowing in robots.txt noindex a page?
                FranckNlemba
                FranckNlemba
                0
                6
                510

              • Disallow my store in robots.txt?
                AlanMosley
                AlanMosley
                0
                2
                308

              • Why are new pages not being indexed, and old pages (now in robots.txt) remain in the index?
                KeriMorgret
                KeriMorgret
                0
                3
                378

              • Should we block urls like this - domainname/shop/leather-chairs.html?brand=244&cat=16&dir=ascℴ=price&price=1 within the robots.txt?
                sferrino
                sferrino
                0
                2
                864

              Get started with Moz Pro!

              Unlock the power of advanced SEO tools and data-driven insights.

              Start my free trial
              Products
              • Moz Pro
              • Moz Local
              • Moz API
              • Moz Data
              • STAT
              • Product Updates
              Moz Solutions
              • SMB Solutions
              • Agency Solutions
              • Enterprise Solutions
              • Digital Marketers
              Free SEO Tools
              • Domain Authority Checker
              • Link Explorer
              • Keyword Explorer
              • Competitive Research
              • Brand Authority Checker
              • Local Citation Checker
              • MozBar Extension
              • MozCast
              Resources
              • Blog
              • SEO Learning Center
              • Help Hub
              • Beginner's Guide to SEO
              • How-to Guides
              • Moz Academy
              • API Docs
              About Moz
              • About
              • Team
              • Careers
              • Contact
              Why Moz
              • Case Studies
              • Testimonials
              Get Involved
              • Become an Affiliate
              • MozCon
              • Webinars
              • Practical Marketer Series
              • MozPod
              Connect with us

              Contact the Help team

              Join our newsletter
              Moz logo
              © 2021 - 2026 SEOMoz, Inc., a Ziff Davis company. All rights reserved. Moz is a registered trademark of SEOMoz, Inc.
              • Accessibility
              • Terms of Use
              • Privacy