The Moz Q&A Forum

    • Forum
    • Questions
    • My Q&A
    • Users
    • Ask the Community

    Welcome to the Q&A Forum

    Browse the forum for helpful insights and fresh discussions about all things SEO.

    1. SEO and Digital Marketing Q&A Forum
    2. Categories
    3. Search Engine Trends
    4. Have you ever seen or experienced a page indexed which is actually from a website which is blocked by robots.txt?

    Have you ever seen or experienced a page indexed which is actually from a website which is blocked by robots.txt?

    Search Engine Trends
    2 2 54
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as question
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • vtmoz
      vtmoz last edited by

      Hi all,

      We use robots file and meta robots tags for blocking website or website pages to block bots from crawling. Mostly robots.txt will be used for website and expect all the pages to not getting indexed. But there is a condition here that any page from website can be indexed by Google even the site is blocked from robots.txt; because crawler may find the page link somewhere on internet as stated here at last paragraph. I wonder if this really the case where some webpages have got indexed.

      And even we use meta tags at page level; do we need to block from robots.txt file? Can we use both techniques at a time?

      Thanks

      1 Reply Last reply Reply Quote 0
      • GastonRiera
        GastonRiera last edited by

        Hi vtmoz,

        The most mandatory way to prevent any page to be indexed is by using a meta robots tag with a _noindex _parameter.
        Then using robots.txt will help to optimize your server resources and is a way that prevent google to crawl any new page that do not have the meta robots tag.

        And yeah, its very common to have indexed pages even the robots.txt file blocks the entire website.

        If what you are looking for is to remove from index the pages, follow this steps:

        1. Allow the whole website to be crawable (or at least that specific pages/section) in the robots.txt
        2. add the robots meta tag with "noindex,follow" parametres
        3. wait several weeks, 6 to 8 weeks is a fairly good time. Or just do a followup on those pages
        4. when you got the results (all your desired pages to be de-indexed) re-block with robots.txt those pages
        5. DO NOT erase the meta robots tag.

        Hope it helps.
        Best luck.
        GR.

        1 Reply Last reply Reply Quote 1
        • 1 / 1
        • First post
          Last post
        • Indexed, though blocked by robots.txt: Need to bother?
          GastonRiera
          GastonRiera
          1
          2
          121

        • Duplicate website pages indexed: Ranking dropped. Does Google checks the duplicate domain association?
          vtmoz
          vtmoz
          0
          4
          127

        • Meta robots at every page rather than using robots.txt for blocking crawlers? How they'll get indexed if we block crawlers?
          ThompsonPaul
          ThompsonPaul
          0
          3
          253

        • Bing not indexing pages
          katemorris
          katemorris
          0
          4
          211

        • Website dropping from page 1google uk
          Johnny4B
          Johnny4B
          0
          8
          211

        • Google indexing my website's Search Results pages. Should I block this?
          irvingw
          irvingw
          0
          4
          4.9k

        • Should I block non-informative pages from Google's index?
          UnderRugSwept
          UnderRugSwept
          1
          10
          795

        • Why would my product pages no longer be indexed in Google?
          KeriMorgret
          KeriMorgret
          0
          9
          2.7k

        Get started with Moz Pro!

        Unlock the power of advanced SEO tools and data-driven insights.

        Start my free trial
        Products
        • Moz Pro
        • Moz Local
        • Moz API
        • Moz Data
        • STAT
        • Product Updates
        Moz Solutions
        • SMB Solutions
        • Agency Solutions
        • Enterprise Solutions
        • Digital Marketers
        Free SEO Tools
        • Domain Authority Checker
        • Link Explorer
        • Keyword Explorer
        • Competitive Research
        • Brand Authority Checker
        • Local Citation Checker
        • MozBar Extension
        • MozCast
        Resources
        • Blog
        • SEO Learning Center
        • Help Hub
        • Beginner's Guide to SEO
        • How-to Guides
        • Moz Academy
        • API Docs
        About Moz
        • About
        • Team
        • Careers
        • Contact
        Why Moz
        • Case Studies
        • Testimonials
        Get Involved
        • Become an Affiliate
        • MozCon
        • Webinars
        • Practical Marketer Series
        • MozPod
        Connect with us

        Contact the Help team

        Join our newsletter
        Moz logo
        © 2021 - 2026 SEOMoz, Inc., a Ziff Davis company. All rights reserved. Moz is a registered trademark of SEOMoz, Inc.
        • Accessibility
        • Terms of Use
        • Privacy