The Moz Q&A Forum

    • Forum
    • Questions
    • My Q&A
    • Users
    • Ask the Community

    Welcome to the Q&A Forum

    Browse the forum for helpful insights and fresh discussions about all things SEO.

    1. SEO and Digital Marketing Q&A Forum
    2. Categories
    3. Intermediate & Advanced SEO
    4. Robots.txt gone wild

    Robots.txt gone wild

    Intermediate & Advanced SEO
    5 4 325
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as question
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • wearehappymedia
      wearehappymedia last edited by

      Hi guys, a site we manage, http://hhhhappy.com received an alert through web master tools yesterday that it can't be crawled. No changes were made to the site.

      Don't know a huge amount about the robots.txt configuration expect that using Yoast by default it sets it not to crawl wp admin folder and nothing else. I checked this against all other sites and the settings are the same. And yet 12 hours later after the issue Happy is still not being crawled and meta data is not showing in search results. Any ideas what may have triggered this?

      1 Reply Last reply Reply Quote 0
      • wearehappymedia
        wearehappymedia last edited by

        Our host has just offered this response which does not get me any closer:

        Hi Radi,

        It looks like your site has its own robots.txt file, which is not blocking any user agents. The only thing it's doing is blocking bots from indexing your admin area:

        <code>User-agent: *
        Disallow: /wp-admin/</code> 
        

        This is a standard robots.txt file, and you shouldn't be having any issues with Google indexing your site from a hosting standpoint. To test this, I curled the site as Googlebot and received a 200OK response:

        <code>curl -A "Googlebot/2.1" -IL [hhhhappy.com](http://hhhhappy.com)
        HTTP/1.1 200 OK
        Date: Sat, 05 Mar 2016 22:17:26 GMT
        Content-Type: text/html; charset=UTF-8
        Connection: keep-alive
        Set-Cookie: __cfduid=d3177a1baa04623fb2573870f1d4b4bac1457216246; expires=Sun, 05-Mar-17 22:17:26 GMT; path=/; domain=.[hhhhappy.com](http://hhhhappy.com); HttpOnly
        X-Cacheable: bot
        Cache-Control: max-age=10800, must-revalidate
        X-Cache: HIT: 17
        X-Cache-Group: bot
        X-Pingback: [http://hhhhappy.com/xmlrpc.php](http://hhhhappy.com/xmlrpc.php)
        Link: <[http://hhhhappy.com/](http://hhhhappy.com/)>; rel=shortlink
        Expires: Thu, 19 Nov 1981 08:52:00 GMT
        X-Type: default
        X-Pass-Why:
        Set-Cookie: X-Mapping-fjhppofk=2C42B261F74DA203D392B5EC5BF07833; path=/
        Server: cloudflare-nginx
        CF-RAY: 27f0f02445920f09-IAD</code> 
        

        I didn't see any plugins on your site that looked like they would overwrite robots.txt, but I urge you to take another look at them, and then dive into your site's settings for the meta value that Googlebot would pick up. Everything on our end seems to be giving the green light.

        Please let us know if you have any other questions or issues in the meantime.

        Cheers,

        1 Reply Last reply Reply Quote 0
        • MattAntonino
          MattAntonino last edited by

          Are you getting the message in Search Console that there were errors crawling your page?

          This typically means that your host was temporarily down when Google landed on your page. These types of things happen all the time and are no big deal.

          Your homepage cache shows a crawl date of today so I'm assuming things are working properly ... if you really want to find out, try doing a "Fetch" of your site in Search Console.

          Crawl > Fetch as Google > Fetch (big red button)

          You should get a status of "Complete." If you get anything else there should be an error message with it. If so, paste that here.

          I have checked the site headers, cache, crawlability with Screaming Frog, and everything is fine. This seems like one of those temporary messages but if the problem persists definitely let us know!

          1 Reply Last reply Reply Quote 1
          • Martijn_Scheijbeler
            Martijn_Scheijbeler last edited by

            Have you checked the downtime of the site recently? Sometimes it could be that Google isn't able to reach your robots.txt file and because of that they'll stop crawling your site temporarily.

            1 Reply Last reply Reply Quote 0
            • MattRoney
              MattRoney last edited by

              Hi Radi!

              Have Matt and/or Martijn answered your question? If so, please mark one or both of their responses "Good Answer." 🙂

              Otherwise, what's still tripping you up?

              1 Reply Last reply Reply Quote 0
              • 1 / 1
              • First post
                Last post
              • Robots.txt Help
                GlobeRunner
                GlobeRunner
                0
                5
                162

              • Robots.txt Allowed
                GlobeRunner
                GlobeRunner
                0
                4
                118

              • Have a Robots.txt Issue
                MattRoney
                MattRoney
                0
                5
                226

              • Meta robots or robot.txt file?
                Andy.Drinkwater
                Andy.Drinkwater
                0
                5
                152

              • Robots.txt
                Travis_Bailey
                Travis_Bailey
                0
                4
                107

              • Robots.txt Syntax
                MichaelC-15022
                MichaelC-15022
                0
                2
                118

              • Robot.txt help
                evolvingSEO
                evolvingSEO
                0
                23
                203

              • Robots.txt unblock
                Elchanan
                Elchanan
                0
                5
                4.3k

              Get started with Moz Pro!

              Unlock the power of advanced SEO tools and data-driven insights.

              Start my free trial
              Products
              • Moz Pro
              • Moz Local
              • Moz API
              • Moz Data
              • STAT
              • Product Updates
              Moz Solutions
              • SMB Solutions
              • Agency Solutions
              • Enterprise Solutions
              • Digital Marketers
              Free SEO Tools
              • Domain Authority Checker
              • Link Explorer
              • Keyword Explorer
              • Competitive Research
              • Brand Authority Checker
              • Local Citation Checker
              • MozBar Extension
              • MozCast
              Resources
              • Blog
              • SEO Learning Center
              • Help Hub
              • Beginner's Guide to SEO
              • How-to Guides
              • Moz Academy
              • API Docs
              About Moz
              • About
              • Team
              • Careers
              • Contact
              Why Moz
              • Case Studies
              • Testimonials
              Get Involved
              • Become an Affiliate
              • MozCon
              • Webinars
              • Practical Marketer Series
              • MozPod
              Connect with us

              Contact the Help team

              Join our newsletter
              Moz logo
              © 2021 - 2026 SEOMoz, Inc., a Ziff Davis company. All rights reserved. Moz is a registered trademark of SEOMoz, Inc.
              • Accessibility
              • Terms of Use
              • Privacy