The Moz Q&A Forum

    • Forum
    • Questions
    • My Q&A
    • Users
    • Ask the Community

    Welcome to the Q&A Forum

    Browse the forum for helpful insights and fresh discussions about all things SEO.

    1. SEO and Digital Marketing Q&A Forum
    2. Categories
    3. On-Page / Site Optimization
    4. How do you block development servers with robots.txt?

    How do you block development servers with robots.txt?

    On-Page / Site Optimization
    7 6 5.2k
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as question
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • DisMedia
      DisMedia last edited by

      When we create client websites the urls are client.oursite.com.  Google is indexing theses sites and attaching to our domain.  How can we stop it with robots.txt?  I've heard you need to have the robots file on both the main site and the dev sites... A code sample would be groovy.  Thanks, TR

      1 Reply Last reply Reply Quote 0
      • SteveOllington
        SteveOllington last edited by

        Just use a directory instead of a sub-domain and then block that directory... that's the easiest way.

        DisMedia 1 Reply Last reply Reply Quote 0
        • DisMedia
          DisMedia @SteveOllington last edited by

          Unfortunately I don't have that option.

          1 Reply Last reply Reply Quote 0
          • abtain
            abtain last edited by

            I put a .htaccess style password on the development site. If you make a robots.txt to block the site, make sure you don't accidentally put that on the production site.

            1 Reply Last reply Reply Quote 1
            • roimediaworks
              roimediaworks last edited by

              Like Daniel said you can use robots.txt to block spiders, but this won't guarantee exclusion of URLs showing up in search results. You could use x-robots-tag in the server headers. Generate a 403 every time user-agent hits the sub domain.

              1 Reply Last reply Reply Quote 1
              • KeriMorgret
                KeriMorgret last edited by

                On the root of the development subdomain, use the following robots.txt content to block all robots.

                User-agent: *
                Disallow: /

                Next, verify the subdomain in Google Webmaster Tools as its own site, and request that that site be removed from the index.

                For added protection:

                • Make the robots.txt on the live site read only, so when you copy the dev site over you don't accidentally copy over the robots.txt saying to exclude everything
                • Set up a code monitor on the robots.txt for both the dev site and the live site that checks the content of those files and alerts you if there are changes. I use https://polepositionweb.com/roi/codemonitor/index.php.
                1 Reply Last reply Reply Quote 2
                • JustinTaylor88
                  JustinTaylor88 last edited by

                  Added X robots tag into our headers on our development sites.

                  Just a note - if you use apache and have mod_pagespeed installed , it wall conflict and pagespeed will remove the X robots tag.

                  Begin Bad Bot Blocking

                  BrowserMatchNoCase Googlebot bad_bot
                  BrowserMatchNoCase bingbot bad_bot
                  BrowserMatchNoCase OmniExplorer_Bot/6.11.1 bad_bot
                  BrowserMatchNoCase omniexplorer_bot bad_bot
                  BrowserMatchNoCase Baiduspider bad_bot
                  BrowserMatchNoCase Baiduspider/2.0 bad_bot
                  BrowserMatchNoCase yandex bad_bot
                  BrowserMatchNoCase yandeximages bad_bot
                  BrowserMatchNoCase Spinn3r bad_bot
                  BrowserMatchNoCase sogou bad_bot
                  BrowserMatchNoCase Sogouwebspider/3.0 bad_bot
                  BrowserMatchNoCase Sogouwebspider/4.0 bad_bot
                  BrowserMatchNoCase sosospider+ bad_bot
                  BrowserMatchNoCase jikespider bad_bot
                  BrowserMatchNoCase ia_archiver bad_bot
                  BrowserMatchNoCase PaperLiBot bad_bot
                  BrowserMatchNoCase ahrefsbot bad_bot
                  BrowserMatchNoCase ahrefsbot/1.0 bad_bot
                  BrowserMatchNoCase SiteBot/0.1 bad_bot
                  BrowserMatchNoCase DNS-Digger/1.0 bad_bot
                  BrowserMatchNoCase DNS-Digger-Explorer/1.0 bad_bot
                  BrowserMatchNoCase boardreader bad_bot
                  BrowserMatchNoCase radian6 bad_bot
                  BrowserMatchNoCase R6_FeedFetcher bad_bot
                  BrowserMatchNoCase R6_CommentReader bad_bot
                  BrowserMatchNoCase ScoutJet bad_bot
                  BrowserMatchNoCase ezooms bad_bot
                  BrowserMatchNoCase CC-rget/5.818 bad_bot
                  BrowserMatchNoCase libwww-perl/5.813 bad_bot
                  BrowserMatchNoCase magpie-crawler 1.1 bad_bot
                  BrowserMatchNoCase jakarta bad_bot
                  BrowserMatchNoCase discobot/1.0 bad_bot
                  BrowserMatchNoCase MJ12bot bad_bot
                  BrowserMatchNoCase MJ12bot/v1.2.0 bad_bot
                  BrowserMatchNoCase MJ12bot/v1.2.5 bad_bot
                  BrowserMatchNoCase SemrushBot/0.9 bad_bot
                  BrowserMatchNoCase MLBot bad_bot
                  BrowserMatchNoCase butterfly bad_bot
                  BrowserMatchNoCase SeznamBot/3.0 bad_bot
                  BrowserMatchNoCase HuaweiSymantecSpider bad_bot
                  BrowserMatchNoCase Exabot/2.0 bad_bot
                  BrowserMatchNoCase netseer/0.1 bad_bot
                  BrowserMatchNoCase NetSeer crawler/2.0 bad_bot
                  BrowserMatchNoCase NetSeer/Nutch-0.9 bad_bot
                  BrowserMatchNoCase psbot/0.1 bad_bot
                  BrowserMatchNoCase Moreoverbot/x.00 bad_bot
                  BrowserMatchNoCase moreoverbot/5.0 bad_bot
                  BrowserMatchNoCase Jakarta Commons-HttpClient/3.0 bad_bot
                  BrowserMatchNoCase SocialSpider-Finder/0.2 bad_bot
                  BrowserMatchNoCase MaxPointCrawler/Nutch-1.1 bad_bot
                  BrowserMatchNoCase willow bad_bot
                  Order Deny,Allow
                  Deny from env=bad_bot

                  End Bad Bot Blocking

                  Header set X-Robots-Tag "noindex, nofollow"

                  Begin Bad Bot Blocking

                  BrowserMatchNoCase Googlebot bad_bot
                  BrowserMatchNoCase bingbot bad_bot
                  BrowserMatchNoCase OmniExplorer_Bot/6.11.1 bad_bot
                  BrowserMatchNoCase omniexplorer_bot bad_bot
                  BrowserMatchNoCase Baiduspider bad_bot
                  BrowserMatchNoCase Baiduspider/2.0 bad_bot
                  BrowserMatchNoCase yandex bad_bot
                  BrowserMatchNoCase yandeximages bad_bot
                  BrowserMatchNoCase Spinn3r bad_bot
                  BrowserMatchNoCase sogou bad_bot
                  BrowserMatchNoCase Sogouwebspider/3.0 bad_bot
                  BrowserMatchNoCase Sogouwebspider/4.0 bad_bot
                  BrowserMatchNoCase sosospider+ bad_bot
                  BrowserMatchNoCase jikespider bad_bot
                  BrowserMatchNoCase ia_archiver bad_bot
                  BrowserMatchNoCase PaperLiBot bad_bot
                  BrowserMatchNoCase ahrefsbot bad_bot
                  BrowserMatchNoCase ahrefsbot/1.0 bad_bot
                  BrowserMatchNoCase SiteBot/0.1 bad_bot
                  BrowserMatchNoCase DNS-Digger/1.0 bad_bot
                  BrowserMatchNoCase DNS-Digger-Explorer/1.0 bad_bot
                  BrowserMatchNoCase boardreader bad_bot
                  BrowserMatchNoCase radian6 bad_bot
                  BrowserMatchNoCase R6_FeedFetcher bad_bot
                  BrowserMatchNoCase R6_CommentReader bad_bot
                  BrowserMatchNoCase ScoutJet bad_bot
                  BrowserMatchNoCase ezooms bad_bot
                  BrowserMatchNoCase CC-rget/5.818 bad_bot
                  BrowserMatchNoCase libwww-perl/5.813 bad_bot
                  BrowserMatchNoCase magpie-crawler 1.1 bad_bot
                  BrowserMatchNoCase jakarta bad_bot
                  BrowserMatchNoCase discobot/1.0 bad_bot
                  BrowserMatchNoCase MJ12bot bad_bot
                  BrowserMatchNoCase MJ12bot/v1.2.0 bad_bot
                  BrowserMatchNoCase MJ12bot/v1.2.5 bad_bot
                  BrowserMatchNoCase SemrushBot/0.9 bad_bot
                  BrowserMatchNoCase MLBot bad_bot
                  BrowserMatchNoCase butterfly bad_bot
                  BrowserMatchNoCase SeznamBot/3.0 bad_bot
                  BrowserMatchNoCase HuaweiSymantecSpider bad_bot
                  BrowserMatchNoCase Exabot/2.0 bad_bot
                  BrowserMatchNoCase netseer/0.1 bad_bot
                  BrowserMatchNoCase NetSeer crawler/2.0 bad_bot
                  BrowserMatchNoCase NetSeer/Nutch-0.9 bad_bot
                  BrowserMatchNoCase psbot/0.1 bad_bot
                  BrowserMatchNoCase Moreoverbot/x.00 bad_bot
                  BrowserMatchNoCase moreoverbot/5.0 bad_bot
                  BrowserMatchNoCase Jakarta Commons-HttpClient/3.0 bad_bot
                  BrowserMatchNoCase SocialSpider-Finder/0.2 bad_bot
                  BrowserMatchNoCase MaxPointCrawler/Nutch-1.1 bad_bot
                  BrowserMatchNoCase willow bad_bot
                  Order Deny,Allow
                  Deny from env=bad_bot

                  End Bad Bot Blocking

                  Header set X-Robots-Tag "noindex, nofollow"

                  1 Reply Last reply Reply Quote 0
                  • 1 / 1
                  • First post
                    Last post
                  • Robot.txt file issue on wordpress site.
                    AlanMosley
                    AlanMosley
                    0
                    8
                    229

                  • Question about robots.txt
                    spencerhjustice
                    spencerhjustice
                    0
                    3
                    125

                  • Description tag not showing in the SERPs because page is blocked by Robots, but the page isn't blocked. Any help?
                    DavidFaltz
                    DavidFaltz
                    0
                    9
                    183

                  • Login webpage blocked by robots
                    juanmiguelcr
                    juanmiguelcr
                    0
                    5
                    112

                  • Site Maps / Robots.txt etc
                    LockCity
                    LockCity
                    0
                    3
                    120

                  • Can we listed URL on Website sitemap page which are blocked by Robots.txt
                    irvingw
                    irvingw
                    0
                    7
                    482

                  • Robots.txt: excluding URL
                    john4math
                    john4math
                    0
                    2
                    821

                  • Photogallery and Robots.txt
                    Rapturecamps
                    Rapturecamps
                    0
                    5
                    831

                  Get started with Moz Pro!

                  Unlock the power of advanced SEO tools and data-driven insights.

                  Start my free trial
                  Products
                  • Moz Pro
                  • Moz Local
                  • Moz API
                  • Moz Data
                  • STAT
                  • Product Updates
                  Moz Solutions
                  • SMB Solutions
                  • Agency Solutions
                  • Enterprise Solutions
                  • Digital Marketers
                  Free SEO Tools
                  • Domain Authority Checker
                  • Link Explorer
                  • Keyword Explorer
                  • Competitive Research
                  • Brand Authority Checker
                  • Local Citation Checker
                  • MozBar Extension
                  • MozCast
                  Resources
                  • Blog
                  • SEO Learning Center
                  • Help Hub
                  • Beginner's Guide to SEO
                  • How-to Guides
                  • Moz Academy
                  • API Docs
                  About Moz
                  • About
                  • Team
                  • Careers
                  • Contact
                  Why Moz
                  • Case Studies
                  • Testimonials
                  Get Involved
                  • Become an Affiliate
                  • MozCon
                  • Webinars
                  • Practical Marketer Series
                  • MozPod
                  Connect with us

                  Contact the Help team

                  Join our newsletter
                  Moz logo
                  © 2021 - 2026 SEOMoz, Inc., a Ziff Davis company. All rights reserved. Moz is a registered trademark of SEOMoz, Inc.
                  • Accessibility
                  • Terms of Use
                  • Privacy