The Moz Q&A Forum

    • Forum
    • Questions
    • My Q&A
    • Users
    • Ask the Community

    Welcome to the Q&A Forum

    Browse the forum for helpful insights and fresh discussions about all things SEO.

    1. SEO and Digital Marketing Q&A Forum
    2. Categories
    3. Intermediate & Advanced SEO
    4. Is our robots.txt file correct?

    Is our robots.txt file correct?

    Intermediate & Advanced SEO
    5 4 175
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as question
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • BMPIRE
      BMPIRE last edited by

      Could you please review our robots.txt file and let me know if this is correct.

      www.faithology.com/robots.txt

      Thank you!

      1 Reply Last reply Reply Quote 0
      • StreamlineMetrics
        StreamlineMetrics last edited by

        There are some errors, but since I'm not sure what you are trying to accomplish, I recommend checking it with a tool first. Here is a great tool to check your robots.txt file and give you information on errors - http://tool.motoricerca.info/robots-checker.phtml

        If you still need assistance after running it through the tool, please reply and we can help you further.

        1 Reply Last reply Reply Quote 0
        • BMPIRE
          BMPIRE last edited by

          Thank you for the reply. We want to allow all crawling, except for rogerbot in the community folder.

          I have updated the robots.txt to the following, does this look right?:

          User-agent: *
          Disallow:
          
          User-agent: rogerbot
          Disallow: /community/
          
          User-agent: Mediapartners-Google
          Disallow:
          
          Sitemap: http://www.faithology.com/sitemap.xml
          
          view the robots here: http://www.faithology.com/robots.txt
          
          mememax 1 Reply Last reply Reply Quote 0
          • mememax
            mememax @BMPIRE last edited by

            Hi, it seems correct to me however try to use the robots.txt checker tool in GWTools. You may try to include a couple of your urls and see if google can crawl them.

            I find only redundant the follwing rule:

            User-agent: Mediapartners-Google.

            If you have already set up a disallow: rule for all bot excluding rogerbot which can't access the community folder why create a new rule stating the same for mediapartners?

            Again, why are you saying to all bots they can access the entire site, being that the default rule? Avoid those lines, include just the rogerbot and sitemaps rule and you're done.

            1 Reply Last reply Reply Quote 0
            • Igal_Zeifman
              Igal_Zeifman last edited by

              What's the end goal here?
              Are you actively trying  to block all bots?

              If so, I would still suggest "Disallow:/".
              The other syn-text may also work, but if Google suggests using a backslash, you should probably use it.

              1 Reply Last reply Reply Quote 0
              • 1 / 1
              • First post
                Last post
              • Robots.txt wildcards - the devs had a disagreement - which is correct?
                McTaggart
                McTaggart
                0
                8
                97

              • Our parent company has included their sitemap links in our robots.txt file - will that have an impact on the way our site is crawled?
                GlobeRunner
                GlobeRunner
                0
                2
                197

              • Robots.txt: how to exclude sub-directories correctly?
                MickEdwards
                MickEdwards
                1
                10
                48.0k

              • What should I block with a robots.txt file?
                Travis-W
                Travis-W
                1
                3
                298

              • Effect duration of robots.txt file.
                KeriMorgret
                KeriMorgret
                0
                4
                255

              • Could you use a robots.txt file to disalow a duplicate content page from being crawled?
                KyleChamp
                KyleChamp
                0
                11
                1.3k

              • Using 2 wildcards in the robots.txt file
                lonniea
                lonniea
                0
                2
                605

              • Negative impact on crawling after upload robots.txt file on HTTPS pages
                ShaMenz
                ShaMenz
                0
                2
                892

              Get started with Moz Pro!

              Unlock the power of advanced SEO tools and data-driven insights.

              Start my free trial
              Products
              • Moz Pro
              • Moz Local
              • Moz API
              • Moz Data
              • STAT
              • Product Updates
              Moz Solutions
              • SMB Solutions
              • Agency Solutions
              • Enterprise Solutions
              • Digital Marketers
              Free SEO Tools
              • Domain Authority Checker
              • Link Explorer
              • Keyword Explorer
              • Competitive Research
              • Brand Authority Checker
              • Local Citation Checker
              • MozBar Extension
              • MozCast
              Resources
              • Blog
              • SEO Learning Center
              • Help Hub
              • Beginner's Guide to SEO
              • How-to Guides
              • Moz Academy
              • API Docs
              About Moz
              • About
              • Team
              • Careers
              • Contact
              Why Moz
              • Case Studies
              • Testimonials
              Get Involved
              • Become an Affiliate
              • MozCon
              • Webinars
              • Practical Marketer Series
              • MozPod
              Connect with us

              Contact the Help team

              Join our newsletter
              Moz logo
              © 2021 - 2026 SEOMoz, Inc., a Ziff Davis company. All rights reserved. Moz is a registered trademark of SEOMoz, Inc.
              • Accessibility
              • Terms of Use
              • Privacy