The Moz Q&A Forum

    • Forum
    • Questions
    • My Q&A
    • Users
    • Ask the Community

    Welcome to the Q&A Forum

    Browse the forum for helpful insights and fresh discussions about all things SEO.

    1. SEO and Digital Marketing Q&A Forum
    2. Categories
    3. Intermediate & Advanced SEO
    4. How to Disallow Tag Pages With Robot.txt

    How to Disallow Tag Pages With Robot.txt

    Intermediate & Advanced SEO
    6 3 4.0k
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as question
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • monster99
      monster99 last edited by

      Hi i have a site which i'm dealing with that has tag pages for instant -

      http://www.domain.com/news/?tag=choice

      How can i exclude these tag pages (about 20+ being crawled and indexed by the search engines with robot.txt

      Also sometimes they're created dynamically so i want something which automatically excludes tage pages from being crawled and indexed.

      Any suggestions?

      Cheers,

      Mark

      1 Reply Last reply Reply Quote 0
      • DeanAndrews
        DeanAndrews last edited by

        Hi Mark

        If your using Wordpress then I would recommend SEO Yoast to resolve the tag issue. If not then I suggest you amend the robots.txt file to resolve.

        Here is an example:

        Disallow: /?tag=
        Disallow: /
        ?subcats=
        Disallow: /*?features_hash=

        NOTE:

        Be very careful when blocking search engines. Test and test again!

        monster99 1 Reply Last reply Reply Quote 0
        • NakulGoyal
          NakulGoyal last edited by

          I agree. I would suggest adding the noindex on the pages and letting the bots crawl them. Blocking them would prevent future crawl of these pages, but I am guessing you would also want to remove the existing pages.

          Therefore add the noindex first, wait a few days and then add the disallow (Although technically if they are noindex, you don't really need the disallow).

          1 Reply Last reply Reply Quote 0
          • monster99
            monster99 @DeanAndrews last edited by

            Thanks, is there a way to test it out before actually implementing it with the site.

            The site is non-wordpress aswell.

            Cheers,

            Mark

            NakulGoyal monster99 2 Replies Last reply Reply Quote 0
            • NakulGoyal
              NakulGoyal @monster99 last edited by

              What CMS is it Mark ?

              1 Reply Last reply Reply Quote 0
              • monster99
                monster99 @monster99 last edited by

                Hi Nakul, its Drupal

                Mark

                1 Reply Last reply Reply Quote 0
                • 1 / 1
                • First post
                  Last post
                • Is robots met tag a more reliable than robots.txt at preventing indexing by Google?
                  Bobbi_Tschumper
                  Bobbi_Tschumper
                  1
                  7
                  3.0k

                • Robots.txt, Disallow & Indexed-Pages..
                  thekiller99
                  thekiller99
                  0
                  5
                  341

                • Should I use meta noindex and robots.txt disallow?
                  ntcma
                  ntcma
                  0
                  5
                  923

                • Robots.txt: Syntax URL to disallow
                  Anti-Alex
                  Anti-Alex
                  0
                  8
                  479

                • Do I need to disallow the dynamic pages in robots.txt?
                  esiow2013
                  esiow2013
                  0
                  11
                  1.0k

                • Disallow my store in robots.txt?
                  AlanMosley
                  AlanMosley
                  0
                  2
                  308

                • Why are new pages not being indexed, and old pages (now in robots.txt) remain in the index?
                  KeriMorgret
                  KeriMorgret
                  0
                  3
                  378

                • Robots.txt is blocking Wordpress Pages from Googlebot?
                  Desiree-CP
                  Desiree-CP
                  0
                  4
                  10.7k

                Get started with Moz Pro!

                Unlock the power of advanced SEO tools and data-driven insights.

                Start my free trial
                Products
                • Moz Pro
                • Moz Local
                • Moz API
                • Moz Data
                • STAT
                • Product Updates
                Moz Solutions
                • SMB Solutions
                • Agency Solutions
                • Enterprise Solutions
                • Digital Marketers
                Free SEO Tools
                • Domain Authority Checker
                • Link Explorer
                • Keyword Explorer
                • Competitive Research
                • Brand Authority Checker
                • Local Citation Checker
                • MozBar Extension
                • MozCast
                Resources
                • Blog
                • SEO Learning Center
                • Help Hub
                • Beginner's Guide to SEO
                • How-to Guides
                • Moz Academy
                • API Docs
                About Moz
                • About
                • Team
                • Careers
                • Contact
                Why Moz
                • Case Studies
                • Testimonials
                Get Involved
                • Become an Affiliate
                • MozCon
                • Webinars
                • Practical Marketer Series
                • MozPod
                Connect with us

                Contact the Help team

                Join our newsletter
                Moz logo
                © 2021 - 2026 SEOMoz, Inc., a Ziff Davis company. All rights reserved. Moz is a registered trademark of SEOMoz, Inc.
                • Accessibility
                • Terms of Use
                • Privacy