The Moz Q&A Forum

    • Forum
    • Questions
    • My Q&A
    • Users
    • Ask the Community

    Welcome to the Q&A Forum

    Browse the forum for helpful insights and fresh discussions about all things SEO.

    1. SEO and Digital Marketing Q&A Forum
    2. Categories
    3. Intermediate & Advanced SEO
    4. What To Do About Yahoo Slurp Bot Bogging My Site Down?

    What To Do About Yahoo Slurp Bot Bogging My Site Down?

    Intermediate & Advanced SEO
    3 3 4.2k
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as question
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • RobbieFoglia
      RobbieFoglia last edited by

      Hello,

      Our IT department has informed me that they have seen extremely heavy traffic from the Yahoo Slurp bot in recent days. They are claiming this bot has single-handedly caused one of our servers to crash.

      I am a bit skeptical of this, as I have not found these particular legitimate search engine bots to be aggressive resource hogs, especially for an enterprise-level web server.

      I have requested to examine the server logs myself, but have not had success with this. IT is requesting to block this particular bot, but I am apprehensive about doing this, as I don't want this to have any negative implications on our site showing in Yahoo News or other Yahoo properties.

      Does anyone else have experience with this bot being an overly-zealous resource drag, and if so, what is the best course of action to satisfy all parties?

      1 Reply Last reply Reply Quote 1
      • john4math
        john4math last edited by

        You should be able to can control the rate at which the bot accesses you pages by adding a crawl delay in your robots.txt file. Robots.txt and crawl delay is discussed here: http://en.wikipedia.org/wiki/Robots_exclusion_standard, and Slurp bot here: https://help.yahoo.com/kb/SLN22600.html.

        Should look like this in your robots.txt file:

        User-agent: Slurp

        Crawl-delay: 30

        The crawl delay is the number of seconds the bot should wait between pageview (ask your IT guys what's appropriate for you).  I stuck 30 in there, meaning the Slurp bot would only be able to access up to 2 pages a minute.

        1 Reply Last reply Reply Quote 2
        • N1ghteyes
          N1ghteyes last edited by

          Examining the server logs yourself probably wont help your understanding of the issue unless you know what your looking at specifically. On the Yahoo note, i have found Slurp to be really bad in the past, but no legitimate bot should be able to bring down a properly configured web server, especially an 'enterprise-level' one.

          I would check your .htaccess and apache settings for bad redirects (or web.conf if on windows) before considering banning the bot. Other things to check would be website code or if a bot hits a massive and horribly optimised Database Query for example, that could bring the server down.

          Ask IT exactly what the bot did that caused the server to go down, they should atleast be able to tell you that. If not then they need to run load tests against the website itself to try and reproduce the scenario and thus debug the issue, if indeed there is one.

          Tl;dr :- Normally bad config or code / queries are to blame for this kind of thing. I'd review that before blocking a bot that crawls hundreds of thousands of other sites without issue.

          1 Reply Last reply Reply Quote 0
          • 1 / 1
          • First post
            Last post
          • Breaking up a site into multiple sites
            Fatueque90
            Fatueque90
            0
            4
            179

          • Google Indexed Site A's Content On Site B, Site C etc
            Paddy_Moogan
            Paddy_Moogan
            1
            7
            70

          • Linking to one of my own sites, from my site
            EricaMcGillivray
            EricaMcGillivray
            0
            3
            178

          • Old site penalised, we moved: Shall we cut loose from the old site. It's curently 301 to new site.
            Carson-Ward
            Carson-Ward
            0
            3
            143

          • How to block search bots in crawling my site except for homepage?
            JaneCopland
            JaneCopland
            0
            3
            145

          • Site revamp for neglected site - modifying site structure, URLs and content - is there an optimal approach?
            macrobbo
            macrobbo
            0
            3
            171

          • Migrating a site from a standalone site to a subdivision of large .gov.uk site
            smrs-digital
            smrs-digital
            0
            3
            366

          • On-Site Optimization Tips for Job site?
            RDK
            RDK
            0
            2
            547

          Get started with Moz Pro!

          Unlock the power of advanced SEO tools and data-driven insights.

          Start my free trial
          Products
          • Moz Pro
          • Moz Local
          • Moz API
          • Moz Data
          • STAT
          • Product Updates
          Moz Solutions
          • SMB Solutions
          • Agency Solutions
          • Enterprise Solutions
          • Digital Marketers
          Free SEO Tools
          • Domain Authority Checker
          • Link Explorer
          • Keyword Explorer
          • Competitive Research
          • Brand Authority Checker
          • Local Citation Checker
          • MozBar Extension
          • MozCast
          Resources
          • Blog
          • SEO Learning Center
          • Help Hub
          • Beginner's Guide to SEO
          • How-to Guides
          • Moz Academy
          • API Docs
          About Moz
          • About
          • Team
          • Careers
          • Contact
          Why Moz
          • Case Studies
          • Testimonials
          Get Involved
          • Become an Affiliate
          • MozCon
          • Webinars
          • Practical Marketer Series
          • MozPod
          Connect with us

          Contact the Help team

          Join our newsletter
          Moz logo
          © 2021 - 2026 SEOMoz, Inc., a Ziff Davis company. All rights reserved. Moz is a registered trademark of SEOMoz, Inc.
          • Accessibility
          • Terms of Use
          • Privacy