The Moz Q&A Forum

    • Forum
    • Questions
    • My Q&A
    • Users
    • Ask the Community

    Welcome to the Q&A Forum

    Browse the forum for helpful insights and fresh discussions about all things SEO.

    1. SEO and Digital Marketing Q&A Forum
    2. Categories
    3. Intermediate & Advanced SEO
    4. Large robots.txt file

    Large robots.txt file

    Intermediate & Advanced SEO
    2 1 513
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as question
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • ThomasHarvey
      ThomasHarvey last edited by

      We're looking at potentially creating a robots.txt with 1450 lines in it. This will remove 100k+ pages from the crawl that are all old pages (I know, the ideal would be to delete/noindex but not viable unfortunately)

      Now the issue i'm thinking is that a large robots.txt will either stop the robots.txt from being followed or will slow our crawl rate down.

      Does anybody have any experience with a robots.txt of that size?

      1 Reply Last reply Reply Quote 0
      • ThomasHarvey
        ThomasHarvey last edited by

        Answered my own questions:

        https://developers.google.com/webmasters/control-crawl-index/docs/robots_txt?csw=1#file-format

        "A maximum file size may be enforced per crawler. Content which is after the maximum file size may be ignored. Google currently enforces a size limit of 500kb."

        1 Reply Last reply Reply Quote 0
        • 1 / 1
        • First post
          Last post
        • Set Robots.txt file to crawl my website at specific times
          Tenlo
          Tenlo
          0
          2
          50

        • Our parent company has included their sitemap links in our robots.txt file - will that have an impact on the way our site is crawled?
          GlobeRunner
          GlobeRunner
          0
          2
          197

        • Meta robots or robot.txt file?
          Andy.Drinkwater
          Andy.Drinkwater
          0
          5
          152

        • Issue with Robots.txt file blocking meta description
          ThompsonPaul
          ThompsonPaul
          0
          9
          3.8k

        • What should I block with a robots.txt file?
          Travis-W
          Travis-W
          1
          3
          298

        • Robots.txt file - How to block thosands of pages when you don't have a folder path
          Klarke
          Klarke
          0
          4
          264

        • Using 2 wildcards in the robots.txt file
          lonniea
          lonniea
          0
          2
          605

        • Negative impact on crawling after upload robots.txt file on HTTPS pages
          ShaMenz
          ShaMenz
          0
          2
          892

        Get started with Moz Pro!

        Unlock the power of advanced SEO tools and data-driven insights.

        Start my free trial
        Products
        • Moz Pro
        • Moz Local
        • Moz API
        • Moz Data
        • STAT
        • Product Updates
        Moz Solutions
        • SMB Solutions
        • Agency Solutions
        • Enterprise Solutions
        • Digital Marketers
        Free SEO Tools
        • Domain Authority Checker
        • Link Explorer
        • Keyword Explorer
        • Competitive Research
        • Brand Authority Checker
        • Local Citation Checker
        • MozBar Extension
        • MozCast
        Resources
        • Blog
        • SEO Learning Center
        • Help Hub
        • Beginner's Guide to SEO
        • How-to Guides
        • Moz Academy
        • API Docs
        About Moz
        • About
        • Team
        • Careers
        • Contact
        Why Moz
        • Case Studies
        • Testimonials
        Get Involved
        • Become an Affiliate
        • MozCon
        • Webinars
        • Practical Marketer Series
        • MozPod
        Connect with us

        Contact the Help team

        Join our newsletter
        Moz logo
        © 2021 - 2026 SEOMoz, Inc., a Ziff Davis company. All rights reserved. Moz is a registered trademark of SEOMoz, Inc.
        • Accessibility
        • Terms of Use
        • Privacy