The Moz Q&A Forum

    • Forum
    • Questions
    • My Q&A
    • Users
    • Ask the Community

    Welcome to the Q&A Forum

    Browse the forum for helpful insights and fresh discussions about all things SEO.

    1. SEO and Digital Marketing Q&A Forum
    2. Categories
    3. Search Engine Trends
    4. Meta robots at every page rather than using robots.txt for blocking crawlers? How they'll get indexed if we block crawlers?

    Meta robots at every page rather than using robots.txt for blocking crawlers? How they'll get indexed if we block crawlers?

    Search Engine Trends
    3 3 253
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as question
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • vtmoz
      vtmoz last edited by

      Hi all,

      The suggestion to use meta robots tag rather than robots.txt file is to make sure the pages do not get indexed if their hyperlinks are available anywhere on the internet. I don't understand how the pages will be indexed if the entire site is blocked? Even though there are page links are available, will Google really index those pages? One of our site got blocked from robots file but internal links are available on internet for years which are not been indexed. So technically robots.txt file is quite enough right? Please clarify and guide me if I'm wrong.

      Thanks

      1 Reply Last reply Reply Quote 0
      • GastonRiera
        GastonRiera last edited by

        Hi there,

        TLDR; The solution to deindexing and never index again:

        1. Allow (with robots.txt) the web to be crawable
        2. Aplly meta robots tag: noindex,follow
        3. Wait somte weeks to be completely deindexed
        4. block the entire site/section with robots.txt

        Robots.txt and the robots meta tag can make the same effect, but to understand them must be analyzed separatedly.

        • Robots.txt, here you just tell bots where they can go BEFORE they crawl any of the website. This is just a signal, not a directive... Because robots can choose to ignore the what's in the file. Here you can block from the entire web, to an entire section or just specific pages. More info: Robots.txt official page and a really cool and complete guide to robots.txt

        • Robots meta tag, with it you have more signals to tell, the most used are: noindex, nofollow and follow, due to the usual issues about indexing. More info: Robots.txt offical page, Google developers, Meta Robots directive - Moz and a complete guide to meta robots tag - YOAST.

        Hope this is what you wanted.
        Best luck
        GR.

        ThompsonPaul 1 Reply Last reply Reply Quote 1
        • ThompsonPaul
          ThompsonPaul @GastonRiera last edited by

          I agree with Gaston's approach right up to step 4. If you add the no-indexed pages back into a block in the robots.txt file, you'll end up back where you started from. Because Google will still discover the no-indexed URLs elsewhere and the robots,txt block will stop them from discovering the no-index, and the URLs will likely start to get added to the index again.

          No-indexed URLs must not be blocked in robots.txt. Those two processes are mutually exclusive.

          1 Reply Last reply Reply Quote 1
          • 1 / 1
          • First post
            Last post
          • When block bots using robots.txt vs meta tag "no index"?
            gertseoleverage
            gertseoleverage
            0
            5
            79

          • Have you ever seen or experienced a page indexed which is actually from a website which is blocked by robots.txt?
            GastonRiera
            GastonRiera
            0
            2
            54

          • What is the appropriate Robot.txt to unblock if Google cannot get all the resources from my homepage?
            OlegKorneitchouk
            OlegKorneitchouk
            0
            2
            5.7k

          • Google indexing my website's Search Results pages. Should I block this?
            irvingw
            irvingw
            0
            4
            4.9k

          • Does google index non-public pages ie. members logged in page
            leschal
            leschal
            0
            3
            704

          • Has Google problems in indexing pages that use <base href=""> the last days?
            0
            1
            1.4k

          • Should I block non-informative pages from Google's index?
            UnderRugSwept
            UnderRugSwept
            1
            10
            795

          • Website moving up and down SERPs alongside others in 'blocks'.
            DanDeceuster
            DanDeceuster
            0
            2
            1.1k

          Get started with Moz Pro!

          Unlock the power of advanced SEO tools and data-driven insights.

          Start my free trial
          Products
          • Moz Pro
          • Moz Local
          • Moz API
          • Moz Data
          • STAT
          • Product Updates
          Moz Solutions
          • SMB Solutions
          • Agency Solutions
          • Enterprise Solutions
          • Digital Marketers
          Free SEO Tools
          • Domain Authority Checker
          • Link Explorer
          • Keyword Explorer
          • Competitive Research
          • Brand Authority Checker
          • Local Citation Checker
          • MozBar Extension
          • MozCast
          Resources
          • Blog
          • SEO Learning Center
          • Help Hub
          • Beginner's Guide to SEO
          • How-to Guides
          • Moz Academy
          • API Docs
          About Moz
          • About
          • Team
          • Careers
          • Contact
          Why Moz
          • Case Studies
          • Testimonials
          Get Involved
          • Become an Affiliate
          • MozCon
          • Webinars
          • Practical Marketer Series
          • MozPod
          Connect with us

          Contact the Help team

          Join our newsletter
          Moz logo
          © 2021 - 2026 SEOMoz, Inc., a Ziff Davis company. All rights reserved. Moz is a registered trademark of SEOMoz, Inc.
          • Accessibility
          • Terms of Use
          • Privacy