The Moz Q&A Forum

    • Forum
    • Questions
    • My Q&A
    • Users
    • Ask the Community

    Welcome to the Q&A Forum

    Browse the forum for helpful insights and fresh discussions about all things SEO.

    1. SEO and Digital Marketing Q&A Forum
    2. Categories
    3. On-Page / Site Optimization
    4. Does Google respect User-agent rules in robots.txt?

    Does Google respect User-agent rules in robots.txt?

    On-Page / Site Optimization
    3 3 942
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as question
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • lzhao
      lzhao last edited by

      We want to use an inline linking tool (LinkSmart) to cross link between a few key content types on our online news site.

      LinkSmart uses a bot to establish the linking.

      The issue:  There are millions of pages on our site that we don't want LinkSmart to spider and process for cross linking.

      LinkSmart suggested setting a noindex tag on the pages we don't want them to process, and that we target the rule to their specific user agent.

      I have concerns.  We don't want to inadvertently block search engine access to those millions of pages.  I've seen googlebot ignore nofollow rules set at the page level.  Does it ever arbitrarily obey rules that it's been directed to ignore?

      Can you quantify the level of risk in setting user-agent-specific nofollow tags on pages we want search engines to crawl, but that we want LinkSmart to ignore?

      1 Reply Last reply Reply Quote 0
      • JamesNorquay
        JamesNorquay last edited by

        Hi,

        I would advise to block the directories which the files sit in in robots.txt, over adding no index tags to specific pages.

        Yet then this would also leave these pages to not be indexed by Google, other search engines and also this Link Smart software you are referring to.

        The thing is if you add a no index tag or if you add a robots .txt block to pages it will also block all search engines too.

        So yes their is some risk involved, you have to do things carefully around this area.

        Kind Regards,

        James.

        1 Reply Last reply Reply Quote 0
        • RyanKent
          RyanKent last edited by

          Does Google respect User-agent rules in robots.txt?

          Yes

          I've seen googlebot ignore nofollow rules set at the page level.

          Google honors the nofollow rules set at the page level. The issue is there may be other links on your site or elsewhere on the web that Google will find and follow those links.

          Robots.txt is the absolute last means to use for blocking pages. You should not block a page with robots.txt unless you have exhausted all other options. A more appropriate method of keeping a page out of the index is the noindex tag. If you use the tag appropriately, Google will honor the tag.

          1 Reply Last reply Reply Quote 1
          • 1 / 1
          • First post
            Last post
          • Correct robots.txt for WordPress
            jasongmcmahon
            jasongmcmahon
            1
            5
            180

          • Robots.txt Question for E-Commerce Sites
            Joe.Robison
            Joe.Robison
            0
            2
            450

          • Robot.txt file issue on wordpress site.
            AlanMosley
            AlanMosley
            0
            8
            229

          • Question about robots.txt
            spencerhjustice
            spencerhjustice
            0
            3
            125

          • When You Add a Robots.txt file to a website to block certain URLs, do they disappear from Google's index?
            Saijo.George
            Saijo.George
            0
            3
            205

          • Wordpress categories tags and robots.txt
            sfmatthews
            sfmatthews
            0
            4
            714

          • Robots.txt: excluding URL
            john4math
            john4math
            0
            2
            821

          • How do you block development servers with robots.txt?
            JustinTaylor88
            JustinTaylor88
            0
            7
            5.2k

          Get started with Moz Pro!

          Unlock the power of advanced SEO tools and data-driven insights.

          Start my free trial
          Products
          • Moz Pro
          • Moz Local
          • Moz API
          • Moz Data
          • STAT
          • Product Updates
          Moz Solutions
          • SMB Solutions
          • Agency Solutions
          • Enterprise Solutions
          • Digital Marketers
          Free SEO Tools
          • Domain Authority Checker
          • Link Explorer
          • Keyword Explorer
          • Competitive Research
          • Brand Authority Checker
          • Local Citation Checker
          • MozBar Extension
          • MozCast
          Resources
          • Blog
          • SEO Learning Center
          • Help Hub
          • Beginner's Guide to SEO
          • How-to Guides
          • Moz Academy
          • API Docs
          About Moz
          • About
          • Team
          • Careers
          • Contact
          Why Moz
          • Case Studies
          • Testimonials
          Get Involved
          • Become an Affiliate
          • MozCon
          • Webinars
          • Practical Marketer Series
          • MozPod
          Connect with us

          Contact the Help team

          Join our newsletter
          Moz logo
          © 2021 - 2026 SEOMoz, Inc., a Ziff Davis company. All rights reserved. Moz is a registered trademark of SEOMoz, Inc.
          • Accessibility
          • Terms of Use
          • Privacy