The Moz Q&A Forum

    • Forum
    • Questions
    • My Q&A
    • Users
    • Ask the Community

    Welcome to the Q&A Forum

    Browse the forum for helpful insights and fresh discussions about all things SEO.

    1. SEO and Digital Marketing Q&A Forum
    2. Categories
    3. Search Engine Trends
    4. Baidu Spider Spam

    Baidu Spider Spam

    Search Engine Trends
    8 3 9.0k
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as question
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • MangoMM
      MangoMM last edited by

      Baidu Spider hits my UK site every 5 minutes of every day for the past 2 years.

      It has no consideration whether a domain exists or not.

      I know this because looking at etc/httpd/logs/error_log, i am getting every 5 minutes hits from Baidu spider trying to access a domain which points to my server which no longer exists.

      Given that I have absolutely no trade with China, and given that the only spam comments I get on my wordpress blog originate from China, do you think it's a good idea to either do a China country block in my .HTACCESS or block out Baidu spider?

      Baidu is consuming bandwidth and is clogging my error_logs!!!

      Why is it that Google, Bing, Yahoo etc... can all crawl my site nicely, but Baidu just abuses?

      1 Reply Last reply Reply Quote 0
      • GPainter
        GPainter last edited by

        Have you tried blocking it in robots ?

        #Baiduspider
        User-agent: Baiduspider
        Disallow: /

        MangoMM 1 Reply Last reply Reply Quote 1
        • MangoMM
          MangoMM @GPainter last edited by

          User-agent: Baiduspider
          User-agent: baiduspider
          User-agent: Baiduspider+
          Disallow: /

          Baidu spider is blocked, but it doesn't seem to care!

          GPainter BlueprintMarketing 2 Replies Last reply Reply Quote 0
          • BlueprintMarketing
            BlueprintMarketing last edited by

            make sure you're not running an odd plug-in that maybe causing a caching issue I know it sounds strange but I've heard of this before and it was because of an all-in-one event calendar plug in.

            If it's not something like that I definitely agree with what Chris's said Good call on that Chris.

            however if there is no domain you will have to implement the robots.txt on whatever your server is currently running.

            If you want a free tool that will allow you to create a solid block here's one below however Chris has done a great job of creating one.

            http://www.internetmarketingninjas.com/seo-tools/robots-txt-generator/

            sincerely,

            Thomas

            1 Reply Last reply Reply Quote 2
            • GPainter
              GPainter @MangoMM last edited by

              ?It should respect the robots so may be some one pretending to be Baidu I  would try HTACCESS if you're not looking to go near China etc.

              1 Reply Last reply Reply Quote 1
              • BlueprintMarketing
                BlueprintMarketing @MangoMM last edited by

                the complete block is here

                Required robots.txt code:

                Baidu (CN) 
                Info: http://www.baidu.com/search/spider.htm

                Required robots.txt code:

                User-agent: Baiduspider
                User-agent: Baiduspider-video
                User-agent: Baiduspider-image
                Disallow: /

                http://searchenginewatch.com/article/2067357/Bye-bye-Crawler-Blocking-the-Parasites

                http://forums.oscommerce.com/topic/382923-baiduspider-using-multiple-user-agents-how-to-stop-them/

                1 Reply Last reply Reply Quote 1
                • BlueprintMarketing
                  BlueprintMarketing last edited by

                  I just remembered another tool that you can easily add to your site and simply block the bots by implementing to not trust this hostname or IP

                  https://www.cloudflare.com/

                  in fact with cloud flare can block anything looking for that old domain

                  Is a free service and very good DNS I would utilize it if you must.

                  Sincerely,

                  Thomas

                  MangoMM 1 Reply Last reply Reply Quote 0
                  • MangoMM
                    MangoMM @BlueprintMarketing last edited by

                    Hi, ive tried cloudflare before.

                    Problem is that i am using SSL for some of my pages, so Cloudflare doesn't play nice unless I pay them.

                    Also, I am using amazon cdn - does that work with cloudflare or is it a bit ott?

                    I will take a look at your links and thanks!

                    1 Reply Last reply Reply Quote 0
                    • 1 / 1
                    • First post
                      Last post
                    • Drop in featured snippets and sessions following algorithm update
                      0
                      2
                      43

                    • Why my dog site is not indexing?
                      0
                      1
                      24

                    • How does EAT work?
                      WebDaytona
                      WebDaytona
                      0
                      4
                      33

                    • I think this core update has to do something with spam score any ideas???
                      GastonRiera
                      GastonRiera
                      0
                      3
                      87

                    • Ranking gone for the original page and a shortened url ranks instead.
                      Bestbing
                      Bestbing
                      1
                      8
                      137

                    • Remove spam url errors from search console
                      monicapopa
                      monicapopa
                      0
                      4
                      4.9k

                    • Spam Back Link Removal Problem.
                      Palmbourne
                      Palmbourne
                      0
                      11
                      216

                    • What are Baidu's top ranking factors?
                      JamesNorquay
                      JamesNorquay
                      0
                      6
                      4.3k

                    Get started with Moz Pro!

                    Unlock the power of advanced SEO tools and data-driven insights.

                    Start my free trial
                    Products
                    • Moz Pro
                    • Moz Local
                    • Moz API
                    • Moz Data
                    • STAT
                    • Product Updates
                    Moz Solutions
                    • SMB Solutions
                    • Agency Solutions
                    • Enterprise Solutions
                    • Digital Marketers
                    Free SEO Tools
                    • Domain Authority Checker
                    • Link Explorer
                    • Keyword Explorer
                    • Competitive Research
                    • Brand Authority Checker
                    • Local Citation Checker
                    • MozBar Extension
                    • MozCast
                    Resources
                    • Blog
                    • SEO Learning Center
                    • Help Hub
                    • Beginner's Guide to SEO
                    • How-to Guides
                    • Moz Academy
                    • API Docs
                    About Moz
                    • About
                    • Team
                    • Careers
                    • Contact
                    Why Moz
                    • Case Studies
                    • Testimonials
                    Get Involved
                    • Become an Affiliate
                    • MozCon
                    • Webinars
                    • Practical Marketer Series
                    • MozPod
                    Connect with us

                    Contact the Help team

                    Join our newsletter
                    Moz logo
                    © 2021 - 2026 SEOMoz, Inc., a Ziff Davis company. All rights reserved. Moz is a registered trademark of SEOMoz, Inc.
                    • Accessibility
                    • Terms of Use
                    • Privacy