The Moz Q&A Forum

    • Forum
    • Questions
    • My Q&A
    • Users
    • Ask the Community

    Welcome to the Q&A Forum

    Browse the forum for helpful insights and fresh discussions about all things SEO.

    1. SEO and Digital Marketing Q&A Forum
    2. Categories
    3. Technical SEO Issues
    4. How to allow bots to crawl all but WP-content

    How to allow bots to crawl all but WP-content

    Technical SEO Issues
    13 2 743
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as question
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • Tom3_15
      Tom3_15 last edited by

      Hello,

      I would like my website to remain crawlable to bots, but to block my wp content and media. Does the following robots.txt work? I worry that the * user agent may conflict with the others.

      User-agent: *
      Disallow: /wp-admin/
      Disallow: /wp-includes/
      Disallow: /wp-content/

      User-agent: GoogleBot
      Allow: /

      User-agent: GoogleBot-Mobile
      Allow: /

      User-agent: GoogleBot-Image
      Allow: /

      User-agent: Bingbot
      Allow: /

      User-agent: Slurp
      Allow: /

      1 Reply Last reply Reply Quote 0
      • GastonRiera
        GastonRiera last edited by

        Hi Tom!

        That Robots.txt config is pretty redundant.
        To acheive what you what, thy this:

        User-agent: * 
        Disallow: /wp-admin/ 
        Disallow: /wp-includes/
        Disallow: /wp-content/
        Allow: *.js
        Allow: *.css

        Just 3 things to note here:
        1- That User-agent:* and those disallows blocks for every bot to crawl whats in those folders.
        2- When blocking /wp-content/ you are also blocking the /themes/ folder and inside are the .js and .css files. Blocking those files cause to googlebot not being able to render correctly that page and see it different from what a normal user would see.
        3- Those Allow:/ dont prevent the disallow.

        To try that configuration, you can use the robots.txt tester in search console, just inder the Crawl menu.

        Remember that by default google considers that you are not blocking nothing. 
        More info here: The web robots.tat page

        Hope it helps.
        Best luck.
        GR

        Tom3_15 1 Reply Last reply Reply Quote 3
        • Tom3_15
          Tom3_15 @GastonRiera last edited by

          Thank you for the response. I'm still a little uncertain, does the version you wrote allow the bots to crawl the css and js as well?

          Best

          GastonRiera 1 Reply Last reply Reply Quote 0
          • GastonRiera
            GastonRiera @Tom3_15 last edited by

            Yes it does.

            As I said earlier. Copy and paste that code into the robot.txt tester in any of your search console and try with some name.css or testing.js just for testing.
            Check the image i've attached.

            Hope it helps.
            Best luck
            GR

            btsycPz

            Tom3_15 2 Replies Last reply Reply Quote 3
            • Tom3_15
              Tom3_15 @GastonRiera last edited by

              Awesome. Thanks, Gaston!

              1 Reply Last reply Reply Quote 0
              • Tom3_15
                Tom3_15 @GastonRiera last edited by

                Hi Gaston,

                I just wanted to follow up with you with one last question if possible. Would this allow my images and PDF's to be crawled & indexed still?

                Thanks!

                GastonRiera 1 Reply Last reply Reply Quote 0
                • GastonRiera
                  GastonRiera @Tom3_15 last edited by

                  Hi Tom,

                  Yes, this config will allow images to be crawled,

                  No, this config will block images to be crawled,as long as your wordpress has the defalt folder for images: /wp-content/uploads/year/month/image-name.png

                  How to know, super easy, where your images are stored? Go to the web where you can find an image... Then right clic and then copy link address. With that link you will find that folder structure.

                  Hope it helps.
                  Best luck.
                  GR

                  Tom3_15 1 Reply Last reply Reply Quote 0
                  • Tom3_15
                    Tom3_15 @GastonRiera last edited by

                    Gaston,

                    Thanks for the fast reply! My images folder does follow that format, which is what makes me worrisome as we are blocking the wp-conent folder.

                    Thanks!

                    GastonRiera 1 Reply Last reply Reply Quote 0
                    • GastonRiera
                      GastonRiera @Tom3_15 last edited by

                      Oh god, my mistake!
                      Im deeply sorry, yes, this configuration will block images! that follow that folder structure!

                      I'll correct myself.
                      Thanks for pointing it out!

                      Tom3_15 1 Reply Last reply Reply Quote 0
                      • Tom3_15
                        Tom3_15 @GastonRiera last edited by

                        Thanks, Gaston. I should have been more clear about what I am looking to do. I currently am having an indexation issue. Somehow, pages are being automatically generated by WordPress.

                        These pages are often .txt files of information or code from plugins, all beginning with /wp-content/uploads/ in their URL. I have been manually removing them from the index and would like to now have them be uncrawlable.

                        Best

                        Tom3_15 1 Reply Last reply Reply Quote 0
                        • Tom3_15
                          Tom3_15 @Tom3_15 last edited by

                          Can I do so with:

                          Allow: *.jpg

                          Allow: *.png

                          GastonRiera 1 Reply Last reply Reply Quote 0
                          • GastonRiera
                            GastonRiera @Tom3_15 last edited by

                            Yeap, with that you are allowing every file ending with that extension

                            Tom3_15 1 Reply Last reply Reply Quote 0
                            • Tom3_15
                              Tom3_15 @GastonRiera last edited by

                              Thank you for the help, Gaston!

                              1 Reply Last reply Reply Quote 0
                              • 1 / 1
                              • First post
                                Last post
                              • Dropdown content on page being crawled
                                BlueprintMarketing
                                BlueprintMarketing
                                1
                                3
                                20

                              • Moz Crawl Showing Duplicate Content, But Content Is Unique. What Am I Missing?
                                Casey_Bryan
                                Casey_Bryan
                                0
                                3
                                82

                              • Should you use robots.txt for pages within your site which do not have high quality content or are not contributing a great deal so when Google crawls your site the best performing content has a higher chance of being indexed?
                                Jacksons_Fencing
                                Jacksons_Fencing
                                0
                                5
                                44

                              • Pages with Duplicate Page Content Crawl Diagnostics
                                evolvingSEO
                                evolvingSEO
                                0
                                6
                                243

                              • Question about duplicate content in crawl reports
                                CHADHARRIS
                                CHADHARRIS
                                1
                                4
                                243

                              • Solving duplicate content with WP authors, tags, categories
                                evolvingSEO
                                evolvingSEO
                                0
                                4
                                4.2k

                              • Blocking AJAX Content from being crawled
                                BryceHoward
                                BryceHoward
                                0
                                2
                                733

                              • Crawl Errors and Duplicate Content
                                RyanKent
                                RyanKent
                                0
                                9
                                732

                              Get started with Moz Pro!

                              Unlock the power of advanced SEO tools and data-driven insights.

                              Start my free trial
                              Products
                              • Moz Pro
                              • Moz Local
                              • Moz API
                              • Moz Data
                              • STAT
                              • Product Updates
                              Moz Solutions
                              • SMB Solutions
                              • Agency Solutions
                              • Enterprise Solutions
                              • Digital Marketers
                              Free SEO Tools
                              • Domain Authority Checker
                              • Link Explorer
                              • Keyword Explorer
                              • Competitive Research
                              • Brand Authority Checker
                              • Local Citation Checker
                              • MozBar Extension
                              • MozCast
                              Resources
                              • Blog
                              • SEO Learning Center
                              • Help Hub
                              • Beginner's Guide to SEO
                              • How-to Guides
                              • Moz Academy
                              • API Docs
                              About Moz
                              • About
                              • Team
                              • Careers
                              • Contact
                              Why Moz
                              • Case Studies
                              • Testimonials
                              Get Involved
                              • Become an Affiliate
                              • MozCon
                              • Webinars
                              • Practical Marketer Series
                              • MozPod
                              Connect with us

                              Contact the Help team

                              Join our newsletter
                              Moz logo
                              © 2021 - 2026 SEOMoz, Inc., a Ziff Davis company. All rights reserved. Moz is a registered trademark of SEOMoz, Inc.
                              • Accessibility
                              • Terms of Use
                              • Privacy