The Moz Q&A Forum

    • Forum
    • Questions
    • My Q&A
    • Users
    • Ask the Community

    Welcome to the Q&A Forum

    Browse the forum for helpful insights and fresh discussions about all things SEO.

    1. SEO and Digital Marketing Q&A Forum
    2. Categories
    3. Technical SEO Issues
    4. Removing robots.txt on WordPress site problem

    Removing robots.txt on WordPress site problem

    Technical SEO Issues
    21 7 38.1k
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as question
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • Wallander
      Wallander last edited by

      Hey Guys,

      Thanks for your replies...the domain is http://containerforsale.co.uk ,My host told me to look in the Public HTML file folder for the robots.txt file and just delete it but can't see it in there?

      My host said he found a tester site and it doesn't report any issues:

      http://www.searchenginepromotionhelp.com/m/robots-text-tester/robots-checker.php

      This is the display I get from http://containerforsale.co.uk/robots.txt

      User-agent *

      Disallow: /wp-admin/
      Disallow: /wp-includes/

      Sitemap: http://containerforsale.co.uk/sitemap.xml.gz

      Copstead 1 Reply Last reply Reply Quote 0
      • Copstead
        Copstead @Wallander last edited by

        Doesn't appear to be blocked, so maybe it has something to do with your /wp-includes/ directory.

        Change the robots.txt file to this:

        User-agent *

        Disallow:

        Sitemap: http://containerforsale.co.uk/sitemap.xml.gz

        Wallander Copstead 5 Replies Last reply Reply Quote 0
        • Wallander
          Wallander @Copstead last edited by

          Its weird, the front page warning on Google webmaster for robots has disappeared now, but still got the warnings in the sitemap submission area. My host suggests I just wait a bit longer for Google to update because he said same as you - that there doesn't seem to be any robot.txt file.

          1 Reply Last reply Reply Quote 0
          • Copstead
            Copstead @Copstead last edited by

            Well there is a robots.txt file.  You can view it here: http://containerforsale.co.uk/robots.txt

            What warnings are you getting in your sitemap submission area?  It appears to look alright: http://containerforsale.co.uk/sitemap.xml  But I tried to validate it and got a 504 Gateway Time-out error. http://www.xml-sitemaps.com/index.php?op=validate-xml-sitemap&go=1&sitemapurl=http%3A%2F%2Fcontainerforsale.co.uk%2Fsitemap.xml&submit=Validate

            1 Reply Last reply Reply Quote 0
            • Wallander
              Wallander @Copstead last edited by

              Thanks for the heads up.

              The warning just says 7 Url''s blocked by robots.txt. - have seen this issue posted on the WordPress boards by others but no real insight into solutions.

              Perhaps I should try your idea of

              Change the robots.txt file to this:

              User-agent *

              Disallow:

              Sitemap: http://containerforsale.co.uk/sitemap.xml.gz

              1 Reply Last reply Reply Quote 0
              • Copstead
                Copstead @Copstead last edited by

                yes, the urls being blocked are includes from your Wordpress program.

                1 Reply Last reply Reply Quote 0
                • Wallander
                  Wallander @Copstead last edited by

                  Ok thanks Brent, I changed to

                  User-agent: *

                  Disallow:

                  Sitemap: http://containerforsale.co.uk/sitemap.xml.gz

                  Guess I will just have to wait for Google to refresh now...

                  1 Reply Last reply Reply Quote 0
                  • evolvingSEO
                    evolvingSEO last edited by

                    Thanks guys for all the responses and helping!

                    Three Things to try

                    1.Fix Robots.txt

                    Sofia - I just checked your robots.txt now and it reads;

                    User-agent: *
                    
                    Disallow: Sitemap: http://containerforsale.co.uk/sitemap.xml.gz
                    
                    • with the sitemap on the same line as disallow - I'd check on that and make sure its on a separate line.
                    • ALSO, you don't need the .gz on the sitemap file just sitemap.xml

                    2. Re-submit Sitemap

                    • RESUBMIT your sitemap to webmaster tools and make sure its valid.

                    3. Submit URL to Webmaster Tools (only last resort)

                    this is only last case scenario, shouldn't have to do this on the homepage if everything is correct.

                    • go to fetch as googlebot ->run the fetch ->then submit URL
                    • do this for the homepage
                    • see article on google blog for reference

                    Let us know if you're all set, thanks!

                    -Dan

                    Wallander 1 Reply Last reply Reply Quote 0
                    • Wallander
                      Wallander @evolvingSEO last edited by

                      Indeed, thanks everyone - it's really appreciated!

                      I have updated the robots.txt as indicated and re submitted site map but looks like Google still has problems with my site since the error warning for robots is there after the processing is done.

                      Quick question - I am using a plugin called Google XML Sitemaps which has the following tick box option.

                      'Add sitemap URL to the virtual robots.txt file'.
                      The virtual robots.txt generated by WordPress is used. A real robots.txt file must NOT exist in the blog directory!'

                      Should this box be ticked or un-ticked please? Fyi I currently don't have the box ticked.

                      evolvingSEO Wallander 4 Replies Last reply Reply Quote 0
                      • evolvingSEO
                        evolvingSEO @Wallander last edited by

                        Sofia

                        You are using Yoast SEO plugin for WordPress, so use the XML sitemap within Yoast. You don't need a separate plugin for the XML sitemap. And yes, within Yoast turn the sitemap on.

                        Hope that helps!

                        -Dan

                        1 Reply Last reply Reply Quote 0
                        • Wallander
                          Wallander @Wallander last edited by

                          Hi Dan,

                          I followed the above advice and switched to the Yoast generated sitemap but after testing on http://www.xml-sitemaps.com/validate-xml-sitemap.html I got the following result - no idea what it means but it looks nasty...

                          Schema validating with XSV 3.1-1 of 2007/12/11 16:20:05Schema validator crashed

                          The maintainers of XSV will be notified, you don't need to
                          send mail about this unless you have extra information to provide.
                          If there are Schema errors reported below, try correcting
                          them and re-running the validation.Target: http://containerforsale.co.uk
                             (Real name: http://containerforsale.co.uk
                              Server: Apache/2.2.22 (Unix) mod_ssl/2.2.22 OpenSSL/0.9.8e-fips-rhel5 mod_bwlimited/1.4)The target was not assessedLow-level XML well-formedness and/or validity processing output
                          Warning: Undefined entity raquo
                           in unnamed entity at line 16 char 83 of http://containerforsale.co.uk
                          Warning: Undefined entity nbsp
                           in unnamed entity at line 160 char 10 of http://containerforsale.co.uk
                          Error: Expected ; after entity name, but got =
                           in unnamed entity at line 274 char 631 of http://containerforsale.co.u

                          1 Reply Last reply Reply Quote 0
                          • evolvingSEO
                            evolvingSEO @Wallander last edited by

                            Hi Sofia

                            I just ran the same validator on your sitemap and it went through fine - see screenshot

                            I intended to mean that you should just be sure Google Webmaster Tools accepts the sitemap as valid - if so, there's no need to run through a 3rd party validator. Apologies if I didn't state it clearly!

                            Let me know, but from what I can see it looks good!

                            -Dan

                            EDIT - Looking more closely, it looks like your ran the homepage through the validator - you would actually enter the sitemap address its self in the validator - http://containerforsale.co.uk/sitemap.xml

                            1 Reply Last reply Reply Quote 0
                            • Wallander
                              Wallander @Wallander last edited by

                              Dan,

                              Cant thank you enough! The sitemap request is still pending in Google - maybe I sent too many requests But it's time to sit back and wait for the good news hopefully. Thanks again.

                              1 Reply Last reply Reply Quote 0
                              • Wallander
                                Wallander last edited by

                                Quick update - by amending the robots text file and switching sitemap plugin over to Yoast I finally got the sitemap to index without robots.txt warnings although the Home page of site was not indexed, 'oh dear'.  5 out of the 7 pages in the sitemap were indexed by Google so It's a start but some more investigating to be done on my side.

                                evolvingSEO 1 Reply Last reply Reply Quote 0
                                • evolvingSEO
                                  evolvingSEO @Wallander last edited by

                                  Hi Sophia

                                  I just checked and see your homepage indexed in google.co.uk with a cache date of April 26th. You should be all set!

                                  -Dan

                                  1 Reply Last reply Reply Quote 0
                                  • CMcCrone
                                    CMcCrone last edited by

                                    I can help you out as this issue DROVE ME NUTS.

                                    1. I didnt have a Robots.txt (yet)

                                    2. I had Yoast installed

                                    3. Im pretty sure it created a Robots.txt even though it doesnt exist in my root (.com/here)

                                    4. My Google webmaster tools shows this

                                    User-agent: Disallow: /wp-admin/ Disallow: /wp-includes/ Disallow: /cgi-bin Disallow: /wp-admin Disallow: /wp-includes Disallow: /wp-content/plugins Disallow: /plugins Disallow: /wp-content/cache Disallow: /wp-content/themes Disallow: /trackback Disallow: /feed Disallow: /comments Disallow: /category//* Disallow: /trackback Disallow: /feed Disallow: /comments Disallow: /? Disallow: /?Allow: /wp-content/uploadsAllow: /assets Create a Robots.txt

                                    1. login to wordpress 2. Click SEO in your side toolbar (Yoast WordPress Plugin settings) 3. Go to edit files under SEO (in the side toolbar)

                                    And now you have the option to edit your Robots.txt file.

                                    1 Reply Last reply Reply Quote 0
                                    • nextlevelweb
                                      nextlevelweb last edited by

                                      Hi,

                                      I edited the robots.txt file for my website http://debtfreefrombankruptcy.com yesterday to allow search engines to crawl my site. However, Google isn't recognizing the new file and is still saying that my sitemap is blocked from search. Here is a link to the file itself:

                                      http://www.debtfreefrombankruptcy.com/robots.txt

                                      The Blocked URLs tester said that the file allows Google to crawl the site, but in actuality it still isn't recognizing the new file. Any advice would be appreciated. Thanks!

                                      1 Reply Last reply Reply Quote 0
                                      • 1
                                      • 2
                                      • 1 / 2
                                      • First post
                                        Last post
                                      • Should you use robots.txt for pages within your site which do not have high quality content or are not contributing a great deal so when Google crawls your site the best performing content has a higher chance of being indexed?
                                        Jacksons_Fencing
                                        Jacksons_Fencing
                                        0
                                        5
                                        44

                                      • Block or remove pages using a robots.txt
                                        OlegKorneitchouk
                                        OlegKorneitchouk
                                        0
                                        2
                                        422

                                      • BEST Wordpress Robots.txt Sitemap Practice??
                                        evolvingSEO
                                        evolvingSEO
                                        0
                                        2
                                        1.7k

                                      • Help needed with robots.txt regarding wordpress!
                                        surfgimp
                                        surfgimp
                                        0
                                        6
                                        459

                                      • Mobile site: robots.txt best practices
                                        AlanMosley
                                        AlanMosley
                                        0
                                        2
                                        542

                                      • Use of Robots.txt file on a job site
                                        jennita
                                        jennita
                                        0
                                        5
                                        850

                                      • Subdomain Removal in Robots.txt with Conditional Logic??
                                        KeriMorgret
                                        KeriMorgret
                                        0
                                        3
                                        1.9k

                                      • Robots.txt and robots meta
                                        TheEspresseo
                                        TheEspresseo
                                        0
                                        5
                                        1.1k

                                      Get started with Moz Pro!

                                      Unlock the power of advanced SEO tools and data-driven insights.

                                      Start my free trial
                                      Products
                                      • Moz Pro
                                      • Moz Local
                                      • Moz API
                                      • Moz Data
                                      • STAT
                                      • Product Updates
                                      Moz Solutions
                                      • SMB Solutions
                                      • Agency Solutions
                                      • Enterprise Solutions
                                      • Digital Marketers
                                      Free SEO Tools
                                      • Domain Authority Checker
                                      • Link Explorer
                                      • Keyword Explorer
                                      • Competitive Research
                                      • Brand Authority Checker
                                      • Local Citation Checker
                                      • MozBar Extension
                                      • MozCast
                                      Resources
                                      • Blog
                                      • SEO Learning Center
                                      • Help Hub
                                      • Beginner's Guide to SEO
                                      • How-to Guides
                                      • Moz Academy
                                      • API Docs
                                      About Moz
                                      • About
                                      • Team
                                      • Careers
                                      • Contact
                                      Why Moz
                                      • Case Studies
                                      • Testimonials
                                      Get Involved
                                      • Become an Affiliate
                                      • MozCon
                                      • Webinars
                                      • Practical Marketer Series
                                      • MozPod
                                      Connect with us

                                      Contact the Help team

                                      Moz logo
                                      © 2021 - 2026 SEOMoz, Inc., a Ziff Davis company. All rights reserved. Moz is a registered trademark of SEOMoz, Inc.
                                      • Accessibility
                                      • Terms of Use
                                      • Privacy