The Moz Q&A Forum

    • Forum
    • Questions
    • My Q&A
    • Users
    • Ask the Community

    Welcome to the Q&A Forum

    Browse the forum for helpful insights and fresh discussions about all things SEO.

    1. SEO and Digital Marketing Q&A Forum
    2. Categories
    3. Technical SEO Issues
    4. No index tag robots.txt

    No index tag robots.txt

    Technical SEO Issues
    11 5 3.3k
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as question
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • WeAreDigital_BE
      WeAreDigital_BE last edited by

      Hi Mozzers,

      A client's website has a lot of internal directories defined as /node/*.

      I already added the rule 'Disallow: /node/*' to the robots.txt file to prevents bots from crawling these pages.

      However, the pages are already indexed and appear in the search results.

      In an article of Deepcrawl, they say you can simply add the rule 'Noindex: /node/*' to the robots.txt file, but other sources claim the only way is to add a noindex directive in the meta robots tag of every page.

      Can someone tell me which is the best way to prevent these pages from getting indexed? Small note: there are more than 100 pages.

      Thanks!
      Jens

      1 Reply Last reply Reply Quote 0
      • KinsellaTax
        KinsellaTax last edited by

        This post is deleted!
        1 Reply Last reply Reply Quote 0
        • Nigel_Carr
          Nigel_Carr last edited by

          Hi Jens

          You can't add a noindex in the Robots.txt file.

          Firstly you need to add a noindex tag to all of the pages in the /node/ directory.
          Then remove the nofollow directive in the Robots.txt

          You need to do this for Google to see the noindex tags!

          If you have a noindex tag and a nofollow then the directory is blocked so Google can't see the tags!

          Once all the pages have gone from search then add the nofollow back to the Robots.txt file so that Google doesn't waste crawl budget trying to index them.

          This will solve your problem.

          Regards

          Nigel

          davebuts 1 Reply Last reply Reply Quote 1
          • davebuts
            davebuts @Nigel_Carr last edited by

            Hi Nigel and Jens,

            Just to clarify - noindex is valid in robots.txt for Google but it's not recognized by Bing.

            Here's a case study by Stone Temple on using noindex in robots.txt: https://www.stonetemple.com/does-google-respect-robots-txt-noindex-and-should-you-use-it/

            From their case study, it was found to be pretty effective, but not 100%. It would be a good solution for large websites, but if you're only looking at 100+ pages I would do as Nigel said above and implement the meta robots noindex tags.

            Cheers,

            David

            Nigel_Carr 1 Reply Last reply Reply Quote 0
            • Nigel_Carr
              Nigel_Carr @davebuts last edited by

              Hi Jens/David

              You should not use a noindex in Robots.txt. You can put it on the page as a robots tag, but not in Robots.txt

              I have never ever seen it used in the Robots.txt - I have seen it mentioned a few times on some questionable sites and the odd mention many years ago but it's bad practice in my opinion.

              Read more about Robots.txt here: https://moz.com/learn/seo/robotstxt

              If you follow what I have said, that is the correct solution.

              Regards Nigel

              davebuts 1 Reply Last reply Reply Quote 1
              • davebuts
                davebuts @Nigel_Carr last edited by

                Hi Nigel,

                I agreed that what you said is the best solution in this case but noindex can definitely be done in robots.txt.

                I'm not sure of the questionable sites you've seen it mentioned on, but I'd consider Stone Temple and Deep Crawl to be reputable sources.

                That said, I always like to test things for myself!

                I tried robots.txt noindex on one of my own big sports news websites a little while ago because I didn't want to manually set thousands of old posts to noindex. The robots.txt noindex worked fine.

                Cheers,

                David

                Nigel_Carr 1 Reply Last reply Reply Quote 0
                • WeAreDigital_BE
                  WeAreDigital_BE last edited by

                  Thanks a lot for your answers guys!

                  1 Reply Last reply Reply Quote 1
                  • Nigel_Carr
                    Nigel_Carr @davebuts last edited by

                    Hi David

                    I'd rather listen to John Mueller - he has specifically said to not use it:

                    https://www.seroundtable.com/google-do-not-use-noindex-in-robots-txt-20873.html

                    I wouldn't be advising people to use it on that basis whether it has worked for you this time or not. It's not best practice.

                    That's all. (Sorry Jens!)

                    Regards

                    Nigel

                    R0bin_L0rd 1 Reply Last reply Reply Quote 0
                    • R0bin_L0rd
                      R0bin_L0rd @Nigel_Carr last edited by

                      For the sake of balance, probably worth mentioning that I'm with David in that I've seen a robots.txt noindex work. It has been relatively recently used by a large publisher when they had an article they had to take down but which Google was holding on to. That's irrelevant nuance in this situation but I think David deserves more credit than he got here.

                      In terms of this specific fix I agree with Nigel - remove the Disallow and add a noindex (prompt Google to crawl the pages, with a sitemap if they don't seem to be shifting). You can re-add the Disallow if you think it's necessary but once all of the appropriate pages have a noindex tag they should stay out of the index and if they are heavily linked to on the site disallowing them could result in a loss of link equity (it'll stop with the link to the disallowed pages). So if you think you can achieve this with just a noindex you might want to leave it at that.

                      WeAreDigital_BE 1 Reply Last reply Reply Quote 1
                      • WeAreDigital_BE
                        WeAreDigital_BE @R0bin_L0rd last edited by

                        Hi Guys,

                        In Drupal between the advanced tags (meta tags), there is an option:
                        ' Prevents search engines from indexing this page '

                        Do you happen to know whether these tags are seen as valid by Searchbots?

                        Thanks again guys!

                        Nigel_Carr 1 Reply Last reply Reply Quote 0
                        • Nigel_Carr
                          Nigel_Carr @WeAreDigital_BE last edited by

                          Hi Jens

                          I don't know Drupal but if it's like Wordpress it will add a noindex tag to the page.

                          Do it for one page then take a look at the code.

                          Go to the page: right click > View Source

                          Then go to the three dots top right in chrome and search noindex. It will look like this attached. (ignore the red line crossed out piece)

                          Best Regards Nigel

                          x6DFb9q.jpg

                          1 Reply Last reply Reply Quote 1
                          • 1 / 1
                          • First post
                            Last post
                          • Should you use robots.txt for pages within your site which do not have high quality content or are not contributing a great deal so when Google crawls your site the best performing content has a higher chance of being indexed?
                            Jacksons_Fencing
                            Jacksons_Fencing
                            0
                            5
                            44

                          • Google Indexing Development Site Despite Robots.txt Block
                            DeanAndrews
                            DeanAndrews
                            0
                            6
                            990

                          • Google is indexing blocked content in robots.txt
                            bjs2010
                            bjs2010
                            0
                            5
                            147

                          • Meta Robots Noindex and Robots.txt File
                            Devanur-Rafi
                            Devanur-Rafi
                            0
                            2
                            125

                          • Robots.txt to disallow /index.php/ path
                            Mikkehl
                            Mikkehl
                            0
                            9
                            7.1k

                          • Un-Indexing a Page without robots.txt or access to HEAD
                            Desiree-CP
                            Desiree-CP
                            0
                            5
                            412

                          • Site not being Indexed that fast anymore, Is something wrong with this Robots.txt
                            oznappies
                            oznappies
                            0
                            4
                            828

                          • Robots.txt and robots meta
                            TheEspresseo
                            TheEspresseo
                            0
                            5
                            1.1k

                          Get started with Moz Pro!

                          Unlock the power of advanced SEO tools and data-driven insights.

                          Start my free trial
                          Products
                          • Moz Pro
                          • Moz Local
                          • Moz API
                          • Moz Data
                          • STAT
                          • Product Updates
                          Moz Solutions
                          • SMB Solutions
                          • Agency Solutions
                          • Enterprise Solutions
                          • Digital Marketers
                          Free SEO Tools
                          • Domain Authority Checker
                          • Link Explorer
                          • Keyword Explorer
                          • Competitive Research
                          • Brand Authority Checker
                          • Local Citation Checker
                          • MozBar Extension
                          • MozCast
                          Resources
                          • Blog
                          • SEO Learning Center
                          • Help Hub
                          • Beginner's Guide to SEO
                          • How-to Guides
                          • Moz Academy
                          • API Docs
                          About Moz
                          • About
                          • Team
                          • Careers
                          • Contact
                          Why Moz
                          • Case Studies
                          • Testimonials
                          Get Involved
                          • Become an Affiliate
                          • MozCon
                          • Webinars
                          • Practical Marketer Series
                          • MozPod
                          Connect with us

                          Contact the Help team

                          Join our newsletter
                          Moz logo
                          © 2021 - 2026 SEOMoz, Inc., a Ziff Davis company. All rights reserved. Moz is a registered trademark of SEOMoz, Inc.
                          • Accessibility
                          • Terms of Use
                          • Privacy