The Moz Q&A Forum

    • Forum
    • Questions
    • My Q&A
    • Users
    • Ask the Community

    Welcome to the Q&A Forum

    Browse the forum for helpful insights and fresh discussions about all things SEO.

    1. SEO and Digital Marketing Q&A Forum
    2. Categories
    3. Getting Started
    4. Prevent Rodger Bot for crwaling pagination

    Prevent Rodger Bot for crwaling pagination

    Getting Started
    3 3 88
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as question
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • twpnglobal
      twpnglobal last edited by

      Hello,

      I have a site that has around 300k static pages but each one of these has pagination on it.

      I would like to stop Rodger Bot from crawling the paginated pages and maybe even Google.

      The paginated pages are results that change daily so there is no need to index them.

      What's the best way to prevent them from being crawled?

      The pages are dynamic so I don't know the URLs.

      I have seen people mention add no follow to the pagination links would this do it? or is there a better way?

      Many thanks

      Steve

      1 Reply Last reply Reply Quote 0
      • GastonRiera
        GastonRiera last edited by

        Hi,

        Lets separate topics here:

        • Prevent crawling is by robots.txt and won't index pages that are already indexed.
        • Prevent indexing and de-index pages already indexed, is done by robots tag with a noindex parameter.
          Here, an article from google about that: Block search indexing with 'noindex' - Google Search Console Help

        That said, another action you might take is adding a nofollow in pagination links. Nofollow only tells Google: "I don't want that page to considered as important." It will probably reduce its chances to rank high but won't prevent from crawling nor indexing.
        Another way, yet a little more expensive in development, is adding a specific parameter in the URL when you know is pagination. Then you can block that in robots.txt. Again, this won't remove what's been already indexed.

        Hope it helps.
        Best luck.
        Gaston

        1 Reply Last reply Reply Quote 0
        • effectdigital
          effectdigital last edited by

          Robots.TXT Rules

          If you have architecture like:

          site.com/blog/post/page/1

          Then use:

          User-agent: rogerbot

          Disallow: /page/

          If you have architecture like:

          site.com/blog/post?p=1

          Then use:

          User-agent: rogerbot

          Disallow: /*?p=

          If you have architecture like:

          site.com/blog/post?page=1

          Then use:

          User-agent: rogerbot

          Disallow: /*?page=


          That should pretty much stop Rogerbot from crawling paginated content. It would certainly stop Googlebot, but I don't quite know if Rogerbot respects the "*" wildcard like Googlebot does. Give it a try, see what happens

          Don't worry, in the robots.txt file only "*" is respected as a wildcard, so you won't have any problems with "?" and there won't be any need for an escape character

          1 Reply Last reply Reply Quote 1
          • 1 / 1
          • First post
            Last post
          • How to increase website Moz DA/PA?
            elonmmusk
            elonmmusk
            0
            6
            41

          • Mozbar Chrome Extension 404 Error
            BartonInteractive
            BartonInteractive
            0
            6
            77

          • DA error in my website
            Bdgbye
            Bdgbye
            0
            4
            43

          • Moz DA Issue
            BartonInteractive
            BartonInteractive
            0
            3
            33

          • Finding less competitive keywords
            Oinsiie78
            Oinsiie78
            0
            2
            36

          • Da Decreased
            Staunton_Rook
            Staunton_Rook
            0
            2
            34

          • When setting up a new campaign for a client should I add keywords
            dave.kudera
            dave.kudera
            0
            5
            43

          • Standard Syntax in robots.txt doesn't prevent Moz bot from crawling
            btreloar
            btreloar
            0
            6
            127

          Get started with Moz Pro!

          Unlock the power of advanced SEO tools and data-driven insights.

          Start my free trial
          Products
          • Moz Pro
          • Moz Local
          • Moz API
          • Moz Data
          • STAT
          • Product Updates
          Moz Solutions
          • SMB Solutions
          • Agency Solutions
          • Enterprise Solutions
          • Digital Marketers
          Free SEO Tools
          • Domain Authority Checker
          • Link Explorer
          • Keyword Explorer
          • Competitive Research
          • Brand Authority Checker
          • Local Citation Checker
          • MozBar Extension
          • MozCast
          Resources
          • Blog
          • SEO Learning Center
          • Help Hub
          • Beginner's Guide to SEO
          • How-to Guides
          • Moz Academy
          • API Docs
          About Moz
          • About
          • Team
          • Careers
          • Contact
          Why Moz
          • Case Studies
          • Testimonials
          Get Involved
          • Become an Affiliate
          • MozCon
          • Webinars
          • Practical Marketer Series
          • MozPod
          Connect with us

          Contact the Help team

          Join our newsletter
          Moz logo
          © 2021 - 2026 SEOMoz, Inc., a Ziff Davis company. All rights reserved. Moz is a registered trademark of SEOMoz, Inc.
          • Accessibility
          • Terms of Use
          • Privacy