The Moz Q&A Forum

    • Forum
    • Questions
    • My Q&A
    • Users
    • Ask the Community

    Welcome to the Q&A Forum

    Browse the forum for helpful insights and fresh discussions about all things SEO.

    1. SEO and Digital Marketing Q&A Forum
    2. Categories
    3. Technical SEO Issues
    4. Is robots.txt a must-have for 150 page well-structured site?

    Is robots.txt a must-have for 150 page well-structured site?

    Technical SEO Issues
    5 3 988
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as question
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • scanlin
      scanlin last edited by

      By looking in my logs I see dozens of 404 errors each day from different bots trying to load robots.txt. I have a small site (150 pages) with clean navigation that allows the bots to index the whole site (which they are doing). There are no secret areas I don't want the bots to find (the secret areas are behind a Login so the bots won't see them).

      I have used rel=nofollow for internal links that point to my Login page.

      Is there any reason to include a generic robots.txt file that contains "user-agent: *"?

      I have a minor reason: to stop getting 404 errors and clean up my error logs so I can find other issues that may exist. But I'm wondering if not having a robots.txt file is the same as some default blank file (or 1-line file giving all bots all access)?

      1 Reply Last reply Reply Quote 0
      • kpaulin
        kpaulin last edited by

        Hi Mike...

        I am sure that you are always going to get a range of opinions to this kind of question.

        I think that for your site the answer may be simply that having a robots.txt file is more of a “belt and braces” safe harbour-type thing – the same goes for say whether you should have a keywords meta tag – many say these pieces of code can be of marginal value but, when you are competing head to head for a #1 listing (ie 35%+ of the clicks) then you should use every option and weapon possible ...furthermore, if your site is likely to grow significantly or eventually have content/files that you may want excluded, it’s just a “tidy” thing to have had indexed over time.

        Also, don’t forget that best practice robots.txt file taxonomy is to also include directions to your xml sitemap/s.

        Here is an example from one of our sites...

        User-agent: *
        Disallow: /design_examples.xml
        Disallow: /case_studies.xml

        User-agent: Googlebot-Image
        Disallow: /

        Sitemap: http://www.sitetopleveldomain.com/sitemap.xml

        In this example there are two root files specifically excluded from all bots and this site has also specifically excluded the Google Images bot as they were getting a lot of traffic from image searches and then subsequently seeing the same copyright images turn up on a hundred junk sites – this doesn’t stop image scraping but certainly reduces the ease of finding them.

        In relation to the  “or 1-line file giving all bots all access” part of your question...

        Some bots (most notably Google) now support an additional field called "Allow:"

        As the name suggests, "Allow:" lets you specifically indicate what files/folders CAN be crawled, excluding all others. However, this field is currently not part of the "robots.txt" protocol and so not universally supported, so my suggestion would be to test it for your site for a week, as it might confuse some less intelligent crawlers.

        So, in summary, my recommendation is to keep a simple robots.txt file, test if the Allow: field works for you and also ensure you have that guide to your xml sitemap – although wearing a belt and braces might not be a good look, at least your pants are unlikely to fall down 😉

        scanlin 1 Reply Last reply Reply Quote 1
        • scanlin
          scanlin @kpaulin last edited by

          Thanks, Keith. Makes sense.

          So how important is an xml sitemap for a 150 page site with clean navigation? As near as I can tell (from the site: command) my whole site is already being indexed by Google. Does a sitemap buy me anything? What happens if my sitemap is partial (ie if I forget to add new pages to it, but I do link to the new pages from my other indexed pages, then will the new pages get indexed)? I'm a little worried about sitemap maintenance as I add new blog entries and so on...

          KeriMorgret scanlin 2 Replies Last reply Reply Quote 0
          • KeriMorgret
            KeriMorgret @scanlin last edited by

            The phrase "blog entries" makes me ask are you on a CMS like Wordpress, or are the blog entries pages you are creating from scratch?

            If you're on WP or a CMS, you'll want a robots.txt so that your admin, plugin, and other directories aren't indexed. On the plus side, WP (and other CMSs) have plugins that will generate a sitemap.xml file you as you add pages.

            Google will find pages if you don't have a site map, or forget to add them. The sitemap is a way to let Google know what is out there, but it a) isn't required for Google to index a page and b) won't force Google to index a page.

            1 Reply Last reply Reply Quote 0
            • scanlin
              scanlin @scanlin last edited by

              Thanks, Keri. No, it's a hand-built blog. No CMS.

              I think the googlebot is doing a good job of indexing my site. The site is small and when I search for my content I do find it in google. I was pretty sure that google worked the way you describe. So it sounds like sitemaps are an optional hint, and perhaps not needed for relatively small sites (couple hundred pages of well linked content). Thanks.

              1 Reply Last reply Reply Quote 0
              • 1 / 1
              • First post
                Last post
              • One robots.txt file for multiple sites?
                RyanPurkey
                RyanPurkey
                0
                4
                1.1k

              • Robots txt. in page with 301 redirect
                LauraSultan
                LauraSultan
                0
                6
                3.1k

              • Do I need to block my cart page in robots.txt?
                Hutch42
                Hutch42
                0
                3
                2.4k

              • Should I block Map pages with robots.txt?
                imaginex
                imaginex
                0
                3
                73

              • How to use robots.txt to block areas on page?
                LauraHT
                LauraHT
                0
                8
                225

              • Block or remove pages using a robots.txt
                OlegKorneitchouk
                OlegKorneitchouk
                0
                2
                422

              • Mobile site: robots.txt best practices
                AlanMosley
                AlanMosley
                0
                2
                542

              • Robots.txt and robots meta
                TheEspresseo
                TheEspresseo
                0
                5
                1.1k

              Get started with Moz Pro!

              Unlock the power of advanced SEO tools and data-driven insights.

              Start my free trial
              Products
              • Moz Pro
              • Moz Local
              • Moz API
              • Moz Data
              • STAT
              • Product Updates
              Moz Solutions
              • SMB Solutions
              • Agency Solutions
              • Enterprise Solutions
              • Digital Marketers
              Free SEO Tools
              • Domain Authority Checker
              • Link Explorer
              • Keyword Explorer
              • Competitive Research
              • Brand Authority Checker
              • Local Citation Checker
              • MozBar Extension
              • MozCast
              Resources
              • Blog
              • SEO Learning Center
              • Help Hub
              • Beginner's Guide to SEO
              • How-to Guides
              • Moz Academy
              • API Docs
              About Moz
              • About
              • Team
              • Careers
              • Contact
              Why Moz
              • Case Studies
              • Testimonials
              Get Involved
              • Become an Affiliate
              • MozCon
              • Webinars
              • Practical Marketer Series
              • MozPod
              Connect with us

              Contact the Help team

              Join our newsletter
              Moz logo
              © 2021 - 2026 SEOMoz, Inc., a Ziff Davis company. All rights reserved. Moz is a registered trademark of SEOMoz, Inc.
              • Accessibility
              • Terms of Use
              • Privacy