The Moz Q&A Forum

    • Forum
    • Questions
    • My Q&A
    • Users
    • Ask the Community

    Welcome to the Q&A Forum

    Browse the forum for helpful insights and fresh discussions about all things SEO.

    1. SEO and Digital Marketing Q&A Forum
    2. Categories
    3. Technical SEO Issues
    4. "Extremely high number of URLs" warning for robots.txt blocked pages

    "Extremely high number of URLs" warning for robots.txt blocked pages

    Technical SEO Issues
    8 3 380
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as question
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • EhrenReilly
      EhrenReilly last edited by

      I have a section of my site that is exclusively for tracking redirects for paid ads.  All URLs under this path do a 302 redirect through our ad tracking system:

      http://www.mysite.com/trackingredirect/blue-widgets?ad_id=1234567 --302--> http://www.mysite.com/blue-widgets

      This path of the site is blocked by our robots.txt, and none of the pages show up for a site: search.

      User-agent: *

      Disallow: /trackingredirect

      However, I keep receiving messages in Google Webmaster Tools about an "extremely high number of URLs", and the URLs listed are in my redirect directory, which is ostensibly not indexed.

      If not by robots.txt, how can I keep Googlebot from wasting crawl time on these millions of /trackingredirect/ links?

      1 Reply Last reply Reply Quote 0
      • FedeEinhorn
        FedeEinhorn last edited by

        There's nothing you need to do. If you don't want those pages to be indexed leaving the robots.txt as it is is fine.

        You can mark that in your Webmaster Tools as fixed and Google won't notify you again.

        EhrenReilly 1 Reply Last reply Reply Quote -1
        • EhrenReilly
          EhrenReilly @FedeEinhorn last edited by

          Federico, my concern is how do I get Google to spend spending so much crawl time on those pages.  I don't want Google to waste time crawling pages that are blocked in my robots.txt.

          1 Reply Last reply Reply Quote 0
          • KristinaKledzik
            KristinaKledzik last edited by

            Hi Ehren,

            Google has said that they send those warnings before they actually crawl your site (why they would bother you with a warning so quickly, I don't know), so I wouldn't worry about this if the warning is the only sign you're getting that Google might be crawling disallowed pages.

            What is your Google Webmaster Tools account saying? If Google isn't reporting to you that it's spending too long crawling your site, and the correct number of pages are indexed, you should be fine.

            Let me know if this is a bigger problem!

            Kristina

            EhrenReilly 1 Reply Last reply Reply Quote 1
            • EhrenReilly
              EhrenReilly @KristinaKledzik last edited by

              This is what my other research has suggested, as well.  Google is "discovering" millions of URLs that go into a queue to get crawled, and they're reporting the extremely high number of URLs in Webmaster Tools before they actually attempt to crawl, and see that all these URLs are blocked by robots.txt.

              KristinaKledzik 1 Reply Last reply Reply Quote 0
              • KristinaKledzik
                KristinaKledzik @EhrenReilly last edited by

                And everything looks okay in your GWT?

                EhrenReilly 1 Reply Last reply Reply Quote 0
                • EhrenReilly
                  EhrenReilly @KristinaKledzik last edited by

                  Yes, Google does not appear to be crawling or indexing any of the pages in question, and GWT doesn't note any issues with crawl budget.

                  KristinaKledzik 1 Reply Last reply Reply Quote 0
                  • KristinaKledzik
                    KristinaKledzik @EhrenReilly last edited by

                    Awesome, good to know things are all okay!

                    1 Reply Last reply Reply Quote 0
                    • 1 / 1
                    • First post
                      Last post
                    • Extreme high number of pages found on webshop
                      PaddyM556
                      PaddyM556
                      0
                      3
                      39

                    • Google Webmaster Tools is saying "Sitemap contains urls which are blocked by robots.txt" after Https move...
                      vetofunk
                      vetofunk
                      0
                      5
                      11.2k

                    • Why is robots.txt blocking URL's in sitemap?
                      PurpleGriffon
                      PurpleGriffon
                      0
                      3
                      346

                    • Is it good to remove page extensions like ".php" or ".htm" in the end of URL as SEO prospects?
                      BlueprintMarketing
                      BlueprintMarketing
                      0
                      8
                      12.9k

                    • Block or remove pages using a robots.txt
                      OlegKorneitchouk
                      OlegKorneitchouk
                      0
                      2
                      422

                    • Same URL in "Duplicate Content" and "Blocked by robots.txt"?
                      alsvik
                      alsvik
                      0
                      3
                      502

                    • Warnings for blocked by blocked by meta-robots/meta robots Nofollow...how to resolve?
                      Cyrus-Shepard
                      Cyrus-Shepard
                      0
                      3
                      415

                    • What to do about "blocked by meta-robots"?
                      AlanBleiweiss
                      AlanBleiweiss
                      0
                      3
                      955

                    Get started with Moz Pro!

                    Unlock the power of advanced SEO tools and data-driven insights.

                    Start my free trial
                    Products
                    • Moz Pro
                    • Moz Local
                    • Moz API
                    • Moz Data
                    • STAT
                    • Product Updates
                    Moz Solutions
                    • SMB Solutions
                    • Agency Solutions
                    • Enterprise Solutions
                    • Digital Marketers
                    Free SEO Tools
                    • Domain Authority Checker
                    • Link Explorer
                    • Keyword Explorer
                    • Competitive Research
                    • Brand Authority Checker
                    • Local Citation Checker
                    • MozBar Extension
                    • MozCast
                    Resources
                    • Blog
                    • SEO Learning Center
                    • Help Hub
                    • Beginner's Guide to SEO
                    • How-to Guides
                    • Moz Academy
                    • API Docs
                    About Moz
                    • About
                    • Team
                    • Careers
                    • Contact
                    Why Moz
                    • Case Studies
                    • Testimonials
                    Get Involved
                    • Become an Affiliate
                    • MozCon
                    • Webinars
                    • Practical Marketer Series
                    • MozPod
                    Connect with us

                    Contact the Help team

                    Join our newsletter
                    Moz logo
                    © 2021 - 2026 SEOMoz, Inc., a Ziff Davis company. All rights reserved. Moz is a registered trademark of SEOMoz, Inc.
                    • Accessibility
                    • Terms of Use
                    • Privacy