The Moz Q&A Forum

    • Forum
    • Questions
    • My Q&A
    • Users
    • Ask the Community

    Welcome to the Q&A Forum

    Browse the forum for helpful insights and fresh discussions about all things SEO.

    1. SEO and Digital Marketing Q&A Forum
    2. Categories
    3. Technical SEO Issues
    4. Does this robots.txt file look right?

    Does this robots.txt file look right?

    Technical SEO Issues
    8 3 188
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as question
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • Furious-D
      Furious-D last edited by

      This post is deleted!
      1 Reply Last reply Reply Quote 0
      • Martijn_Scheijbeler
        Martijn_Scheijbeler last edited by

        Hi Sean,

        Like you already said, I wouldn't recommend your current robots.txt as it would indeed block all files ending with .html. So I would go with your own robots.txt file with only the User Agent.

        Good luck!

        Furious-D 1 Reply Last reply Reply Quote 1
        • Furious-D
          Furious-D @Martijn_Scheijbeler last edited by

          Thank you, Martijn!

          1 Reply Last reply Reply Quote 0
          • ThompsonPaul
            ThompsonPaul last edited by

            Actually, while I agree this is not a worthwhile robots.txt file, it's not blocking all html files. In order to do that, it would need to be using the " ***** " wildcard, not the slash.

            I suspect this was a miguided attempt to use the robots.txt to deal with the .html version of your homepage. (Which needs to be done using redirects in the .htaccess file instead.)

            You do have an issue with your home page though.

            www.rfsystemlab.us has been 301-redirected to www.rfsystemlab.us/home.html. This is the opposite of what you want, unless you have some very unusual requirements that aren't visible from the site. You always want the raw domain www.rfsystemlab.us to resolve as your home page. The www.rfsystemlab.us/home.html should actually be redirected back to the primary domain URL, instead of vice versa as you have it now. (You also need the non-www version rfsystemlab.us to 301-redirect to the www.rfsystemlab.us as well.)

            It looks like this was a recent change as your primary URL is still indexed, but I'd get that corrected asap before you start messing up your home page rankings.

            Paul

            P.S. The most universally accepted robots file that doesn't block anything is

            User-agent: *
            Disallow:

            By having nothing after the disallow directive, it essentially means allow everything to be indexed. Google and Bing also recognize the Allow directive, even though it's not standard, so you could also use:

            User-agent: *
            Allow: /

            But both these do the same thing and the first is more standards compliant, so that's my preference. I also like to add in the path to my xml sitemap as standard as well, so the final file would be:

            User-agent: *
            Disallow:

            Sitemap: http://www.yoursite.com/path/to/sitemap.xml

            The advantage to having a default sitemap is it will help keep your error logs cleaner, since a call to a non-existent robots.txt will show as a constant error in your error logs, making it harder to spot the real errors.

            Furious-D ThompsonPaul 3 Replies Last reply Reply Quote 1
            • Furious-D
              Furious-D @ThompsonPaul last edited by

              Paul,

              Thank you for your answer.  I agree with everything you are saying.  I am not sure if we can setup the redirects properly.  The site is built on the SiteKreator.com platform.

              I will take your advice on the proper use or the robots.txt fil and also include a link to my xml sitemap.

              1 Reply Last reply Reply Quote 0
              • Furious-D
                Furious-D @ThompsonPaul last edited by

                Another question here regarding the raw domain:

                How crucial is it to have the bare domain version of the site redirected to the www version?

                ThompsonPaul 1 Reply Last reply Reply Quote 0
                • ThompsonPaul
                  ThompsonPaul @Furious-D last edited by

                  The problem with having both the www version and the non-www version resolving is that the search engines consider those to be two separate sites, F-D. Which means they can be considered duplicate content and competing against each other. And it means links to one don't count as links to the other.

                  So, absolute best practice is to pick one or the other as your primary, and redirect the secondary to it. (And any platform that calls it'self SEO-friendly should have that built in, or at least manually configurable)

                  That said, if there's no way you can get this done on the SiteKreator platform, there are a few things you can do to try to mitigate the problem: (in descending order of importance/ease)

                  1. Decide for yourself which is the primary version (in your case it should be the www version) and ONLY ever use that address when referencing your site. E.g. make sure any links you create use the full www version, whenever you type out your site address, use the full version etc.

                  2. Verify the two sites in Google Webmaster Tools - one for the www address and one for the non-www version (you have to create them as separate sites because  as mentioned, the SEs consider the two URLs to be separate sites) Once verified, use the Configuration -> Settings page to set the Preferred domain to the primary you have chosen in both sites.

                  3. If there's any way to insert canonical tags into the header of your pages, set   for the home page.

                  4. Occasionally check the incoming links to the non-www pages and if they're high-link-juice-value, see if you can get the linking site to update their link with the full URL.

                  Fortunately, at the moment you only have a couple dozen incoming links from a couple of root domains coming to the non-primary URLs so it's not a massive yet. The problem will build if others who you can't control link to you using the non-primary URL. Which is why best practice is to redirect to take care of even those instances.

                  Do the steps as listed above and you'll mitigate the problem as much as possible within the limitations of your platform.

                  Hope that all makes sense?

                  Paul

                  1 Reply Last reply Reply Quote 0
                  • ThompsonPaul
                    ThompsonPaul @ThompsonPaul last edited by

                    Thanks for the backup, Cyrus! Totally agree w/ you on the preference for using blank Disallow - as I mentioned about it being more standards-complaint.

                    Paul

                    1 Reply Last reply Reply Quote 0
                    • 1 / 1
                    • First post
                      Last post
                    • Have I constructed my robots.txt file correctly for sitemap autodiscovery?
                      Bedsite
                      Bedsite
                      0
                      4
                      231

                    • Meta Robots Noindex and Robots.txt File
                      Devanur-Rafi
                      Devanur-Rafi
                      0
                      2
                      125

                    • Robots.txt file
                      Asher
                      Asher
                      0
                      3
                      261

                    • Do i have my robots.txt file set up properly
                      ClaireH-184886
                      ClaireH-184886
                      1
                      4
                      319

                    • Is my robots.txt file working?
                      Resultify
                      Resultify
                      0
                      5
                      1.1k

                    • Does Bing ignore robots txt files?
                      Nightwing
                      Nightwing
                      0
                      3
                      2.8k

                    • Use of Robots.txt file on a job site
                      jennita
                      jennita
                      0
                      5
                      850

                    • Robots.txt and robots meta
                      TheEspresseo
                      TheEspresseo
                      0
                      5
                      1.1k

                    Get started with Moz Pro!

                    Unlock the power of advanced SEO tools and data-driven insights.

                    Start my free trial
                    Products
                    • Moz Pro
                    • Moz Local
                    • Moz API
                    • Moz Data
                    • STAT
                    • Product Updates
                    Moz Solutions
                    • SMB Solutions
                    • Agency Solutions
                    • Enterprise Solutions
                    • Digital Marketers
                    Free SEO Tools
                    • Domain Authority Checker
                    • Link Explorer
                    • Keyword Explorer
                    • Competitive Research
                    • Brand Authority Checker
                    • Local Citation Checker
                    • MozBar Extension
                    • MozCast
                    Resources
                    • Blog
                    • SEO Learning Center
                    • Help Hub
                    • Beginner's Guide to SEO
                    • How-to Guides
                    • Moz Academy
                    • API Docs
                    About Moz
                    • About
                    • Team
                    • Careers
                    • Contact
                    Why Moz
                    • Case Studies
                    • Testimonials
                    Get Involved
                    • Become an Affiliate
                    • MozCon
                    • Webinars
                    • Practical Marketer Series
                    • MozPod
                    Connect with us

                    Contact the Help team

                    Join our newsletter
                    Moz logo
                    © 2021 - 2026 SEOMoz, Inc., a Ziff Davis company. All rights reserved. Moz is a registered trademark of SEOMoz, Inc.
                    • Accessibility
                    • Terms of Use
                    • Privacy