The Moz Q&A Forum

    • Forum
    • Questions
    • My Q&A
    • Users
    • Ask the Community

    Welcome to the Q&A Forum

    Browse the forum for helpful insights and fresh discussions about all things SEO.

    1. SEO and Digital Marketing Q&A Forum
    2. Categories
    3. Intermediate & Advanced SEO
    4. SEO Best Practices regarding Robots.txt disallow

    SEO Best Practices regarding Robots.txt disallow

    Intermediate & Advanced SEO
    5 3 1.1k
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as question
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • jamiegriz
      jamiegriz last edited by

      I cannot find hard and fast direction about the following issue:

      It looks like the Robots.txt file on my server has been set up to disallow "account" and "search" pages within my site, so I am receiving warnings from the Google Search console that URLs are being blocked by Robots.txt. (Disallow: /Account/ and Disallow: /?search=). Do you recommend unblocking these URLs?

      I'm getting a warning that over 18,000 Urls are blocked by robots.txt. ("Sitemap contains urls which are blocked by robots.txt"). Seems that I wouldn't want that many urls blocked. ?

      Thank you!!

      1 Reply Last reply Reply Quote 0
      • TheKatzMeow
        TheKatzMeow last edited by

        Typically, you only want robots.txt to block access points that would allow hackers into your site like an admin page (e.g. www.examplesite.com/admin/). You definitely don't want it blocking your whole site. A developer or webmaster would be better at speaking to the specifics, but that's the quick, high-level answer.

        1 Reply Last reply Reply Quote 1
        • mememax
          mememax last edited by

          That could be completely normal. Google sends a warning because you're giving conflicting directions as you are preventing them to crawl pages (via robots) you asked them to index (via sitemap).

          They do not know how important those pages may be for you so you are the one that needs to assess what to do net.

          Are those pages important for you? Do you want them to be in the index? if that's the case change your robots.txt rule, if not then remove them from the sitemap.

          About the previous answer robots text is not used to block hackers but quite the opposite. Hackers can easily find via the robots txt which are the pages you'd like to block and visit them as they may be key pages (ex. wp-admin), but let's not focus on that as hackers have so many ways to find core pages that it's not the topic. Robots txt is normally used to avoid duplication issues and to prevent google from crawling low value pages and waste crawl budget.

          jamiegriz 1 Reply Last reply Reply Quote 1
          • jamiegriz
            jamiegriz @mememax last edited by

            Thank you for your response! I'm going to do a bit more research but I think I will disallow "account", but unblock "search". The search feature on my site pulls up quality content, so seems like I would want that to be crawled. Does this sound logical to you? 🙂

            mememax 1 Reply Last reply Reply Quote 0
            • mememax
              mememax @jamiegriz last edited by

              mmm it depends.

              it's really hard for me to answer without knowing your site but I would say that you're in the good direction. You want to provide google more ways to reach your quality content.

              Now do you have any other page that is bringing bots there via a normal user navigation or is it all search driven?

              While google can crawl pages that discovered via internal/external links it can't reproduce searches by typing in your nav bar, so I doubt those pages should be extremely valuable unless you link to them somehow. In that case you may want to keep google crawling them.

              A different thing would be if you want to "index" them, as being searches they are probably aggregating different information already present on the site. For indexation purposes you may want to keep them out of the index while still allowing the bot to run through them.

              Again beware of the crawl budget, you don't want google to be wandering around millions of search results instead of your money pages, unless you're able to let them crawl only a sub portion of that.

              I hope this made sense 🙂

              1 Reply Last reply Reply Quote 0
              • 1 / 1
              • First post
                Last post
              • Best practice for disallowing URLS with Robots.txt
                TimHolmes
                TimHolmes
                0
                3
                650

              • Robots.txt Blocking - Best Practices
                ReunionMarketing
                ReunionMarketing
                0
                7
                456

              • Should comments and feeds be disallowed in robots.txt?
                FedeEinhorn
                FedeEinhorn
                0
                5
                4.2k

              • Robots.txt: Syntax URL to disallow
                Anti-Alex
                Anti-Alex
                0
                8
                479

              • Files blocked in robot.txt and seo
                john4math
                john4math
                0
                4
                344

              • Disallow my store in robots.txt?
                AlanMosley
                AlanMosley
                0
                2
                308

              • What are best practices for multi-language seo?
                petrakraft
                petrakraft
                0
                3
                1.6k

              • Robots.txt disallow subdomain
                oznappies
                oznappies
                0
                7
                1.9k

              Get started with Moz Pro!

              Unlock the power of advanced SEO tools and data-driven insights.

              Start my free trial
              Products
              • Moz Pro
              • Moz Local
              • Moz API
              • Moz Data
              • STAT
              • Product Updates
              Moz Solutions
              • SMB Solutions
              • Agency Solutions
              • Enterprise Solutions
              • Digital Marketers
              Free SEO Tools
              • Domain Authority Checker
              • Link Explorer
              • Keyword Explorer
              • Competitive Research
              • Brand Authority Checker
              • Local Citation Checker
              • MozBar Extension
              • MozCast
              Resources
              • Blog
              • SEO Learning Center
              • Help Hub
              • Beginner's Guide to SEO
              • How-to Guides
              • Moz Academy
              • API Docs
              About Moz
              • About
              • Team
              • Careers
              • Contact
              Why Moz
              • Case Studies
              • Testimonials
              Get Involved
              • Become an Affiliate
              • MozCon
              • Webinars
              • Practical Marketer Series
              • MozPod
              Connect with us

              Contact the Help team

              Join our newsletter
              Moz logo
              © 2021 - 2026 SEOMoz, Inc., a Ziff Davis company. All rights reserved. Moz is a registered trademark of SEOMoz, Inc.
              • Accessibility
              • Terms of Use
              • Privacy