The Moz Q&A Forum

    • Forum
    • Questions
    • My Q&A
    • Users
    • Ask the Community

    Welcome to the Q&A Forum

    Browse the forum for helpful insights and fresh discussions about all things SEO.

    1. SEO and Digital Marketing Q&A Forum
    2. Categories
    3. Technical Support
    4. How to block Rogerbot From Crawling UTM URLs

    How to block Rogerbot From Crawling UTM URLs

    Technical Support
    7 4 1.0k
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as question
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • Firestarter-SEO
      Firestarter-SEO last edited by

      I am trying to block roger from crawling some UTM urls we have created, but having no luck. My robots.txt file looks like:

      User-agent: rogerbot
      Disallow: /?utm_source*
      
      This does not seem to be working. Any ideas? 
      
      1 Reply Last reply Reply Quote 0
      • LoganRay
        LoganRay last edited by

        Skyler,

        You're close, give this a shot:

        Disallow: /*?utm_

        This will be inclusive of all UTM tags regardless of what comes before the tag or what element you have first.

        Firestarter-SEO 1 Reply Last reply Reply Quote 2
        • tawnycase
          tawnycase last edited by

          Hi there! Tawny from the Customer Support team here!

          You should be able to add a disallow directive for that parameter and any others to block our crawler from accessing them. It would look something like this:

          User-agent: Rogerbot
          Disallow: ?utm

          etc., until you have blocked all of the parameters that may be causing these duplicate content errors. It looks like the _source* might be what's giving our tools some trouble. It looks like Logan Ray has made an excellent suggestion - give that formatting a try and see if it helps! 🙂

          You can also use the wild card user-agent * in order to block all crawlers from those pages, if you prefer. Here is a great resource about the robots.txt file that might be helpful: https://moz.com/learn/seo/robotstxt We always recommend checking your robots.txt file with a handy Robots Checker Tool once you make changes to avoid any nasty surprises. 🙂

          Jenny1 1 Reply Last reply Reply Quote 0
          • Firestarter-SEO
            Firestarter-SEO @LoganRay last edited by

            What is the difference between Disallow: /*?utm_ and Disallow: /?utm_  ?

            tawnycase 1 Reply Last reply Reply Quote 0
            • tawnycase
              tawnycase @Firestarter-SEO last edited by

              The only difference there is the * wildchar. The string with that character will limit the crawler from accessing any URL with that string of characters in it. 🙂

              1 Reply Last reply Reply Quote 0
              • Jenny1
                Jenny1 @tawnycase last edited by

                FYI - I tried this and it did not work. Rogerbot is still picking up URL's we don't need. It's making my crawl report a mess!

                tawnycase 1 Reply Last reply Reply Quote 0
                • tawnycase
                  tawnycase @Jenny1 last edited by

                  Shoot! There may be something else going on. Give us a shout at help@moz.com and we'll see if we can figure it out!

                  1 Reply Last reply Reply Quote 0
                  • 1 / 1
                  • First post
                    Last post
                  • Crawl still in process for 3 days. Not sure why the site isn't being crawled
                    dpsoftware
                    dpsoftware
                    0
                    3
                    52

                  • Crawl error robots.txt
                    Mandiram
                    Mandiram
                    0
                    10
                    688

                  • Rogerbot not crawling our site
                    KristinaKeyser
                    KristinaKeyser
                    0
                    3
                    91

                  • No crawl data anymore
                    KBC
                    KBC
                    0
                    3
                    103

                  • What is the difference between the "Crawl Issues" report and the "Crawl Test" report?
                    Silkstream
                    Silkstream
                    0
                    2
                    271

                  • Number of pages crawled = 1; Why?
                    jameskais
                    jameskais
                    0
                    5
                    143

                  • Latest Crawl Stats Not Showing
                    DavidLee
                    DavidLee
                    0
                    2
                    110

                  • Duplicate Content Report: Duplicate URLs being crawled with "++" at the end
                    ChiarynMiranda
                    ChiarynMiranda
                    0
                    6
                    253

                  Get started with Moz Pro!

                  Unlock the power of advanced SEO tools and data-driven insights.

                  Start my free trial
                  Products
                  • Moz Pro
                  • Moz Local
                  • Moz API
                  • Moz Data
                  • STAT
                  • Product Updates
                  Moz Solutions
                  • SMB Solutions
                  • Agency Solutions
                  • Enterprise Solutions
                  • Digital Marketers
                  Free SEO Tools
                  • Domain Authority Checker
                  • Link Explorer
                  • Keyword Explorer
                  • Competitive Research
                  • Brand Authority Checker
                  • Local Citation Checker
                  • MozBar Extension
                  • MozCast
                  Resources
                  • Blog
                  • SEO Learning Center
                  • Help Hub
                  • Beginner's Guide to SEO
                  • How-to Guides
                  • Moz Academy
                  • API Docs
                  About Moz
                  • About
                  • Team
                  • Careers
                  • Contact
                  Why Moz
                  • Case Studies
                  • Testimonials
                  Get Involved
                  • Become an Affiliate
                  • MozCon
                  • Webinars
                  • Practical Marketer Series
                  • MozPod
                  Connect with us

                  Contact the Help team

                  Join our newsletter
                  Moz logo
                  © 2021 - 2026 SEOMoz, Inc., a Ziff Davis company. All rights reserved. Moz is a registered trademark of SEOMoz, Inc.
                  • Accessibility
                  • Terms of Use
                  • Privacy