The Moz Q&A Forum

    • Forum
    • Questions
    • My Q&A
    • Users
    • Ask the Community

    Welcome to the Q&A Forum

    Browse the forum for helpful insights and fresh discussions about all things SEO.

    1. SEO and Digital Marketing Q&A Forum
    2. Categories
    3. Technical SEO Issues
    4. Some bots excluded from crawling client's domain

    Some bots excluded from crawling client's domain

    Technical SEO Issues
    2 2 31
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as question
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • SimpleSearch
      SimpleSearch last edited by

      Hi all!

      My client is in healthcare in the US and for HIPAA reasons, blocks traffic from most international sources.

      a. I don't think this is good for SEO

      b. The site won't allow Moz bot or Screaming Frog bot to crawl it.  It's so frustrating.

      We can't figure out what mechanism they are utilizing to execute this.  Any help as we start down the rabbit hole to remedy is much appreciated.

      thank you!

      1 Reply Last reply Reply Quote 0
      • effectdigital
        effectdigital last edited by

        The main reason it's not good is that Google crawl from different data-centers around the world. So one day they may think the site is up, then the next they may think the site is gone and down

        Typically you use a user-agent lance to pierce these kinds of setups. Screaming Frog for example, you can pre-select from a variety of user-agents (including 'googlebot' and Chrome) but you can also author or write your own user-agent

        Write a long one that looks like an encryption key. Tell your client the user agent you have defined, let them create and exemption for it within their spam-defense system. Insert the user-agent (which no one else has or uses) into Screaming Frog, use it to allow the crawler to pierce the defense grid

        Typically you would want to exempt 'Googlebot' (as a user agent) from these defense systems, but it comes with a risk. Anyone with basic scripting knowledge or who knows how to install Chrome extensions, can alter the user-agent of their script (or web browser, it's under the user's control) with ease and it is widely known that many sites make an exception for 'Googlebot' - thus it becomes a common vulnerability. For example, lots of publishers create URLs which Google can access and index, yet if you are a bog standard user they ask you to turn off ad-blockers or pay a fee

        Download the Chrome User-Agent extension, set your user-agent to "googlebot" and sail right through. Not ideal from a defense perspective

        For this reason I have often wished (and I am really hoping someone from Google might be reading) that in Search Console, you could tell Google a custom user-agent string and give it to them. You could then exempt that, safe in the knowledge that no one else knows it, and Google would use your own custom string to identify themselves when accessing your site and content. Then everyone could be safe, indexable and happy

        We're not there yet

        1 Reply Last reply Reply Quote 1
        • 1 / 1
        • First post
          Last post
        • 'domain:example.com/' is this line with a '/' at the end of the domain valid in a disavow report file ?
          LabeliumUSA
          LabeliumUSA
          0
          3
          42

        • Pro's & contra's: http vs https
          max.favilli
          max.favilli
          0
          9
          377

        • Sitemap issue? 404's & 500's are regenerating?
          jeff-rackaid.com
          jeff-rackaid.com
          0
          5
          263

        • New domain's Sitemap.xml file loaded to old domain - how does this effect SEO?
          KevinBudzynski
          KevinBudzynski
          0
          5
          179

        • Website's stability and it's affect on SEO
          AlanMosley
          AlanMosley
          0
          2
          1.0k

        • How to improve my site's Domain Authority?
          SeanLade
          SeanLade
          0
          6
          1.8k

        • Crawl Tool Producing Random URL's
          kchandler
          kchandler
          0
          4
          692

        • Access To Client's Google Webmaster Tools
          94501
          94501
          0
          7
          995

        Get started with Moz Pro!

        Unlock the power of advanced SEO tools and data-driven insights.

        Start my free trial
        Products
        • Moz Pro
        • Moz Local
        • Moz API
        • Moz Data
        • STAT
        • Product Updates
        Moz Solutions
        • SMB Solutions
        • Agency Solutions
        • Enterprise Solutions
        • Digital Marketers
        Free SEO Tools
        • Domain Authority Checker
        • Link Explorer
        • Keyword Explorer
        • Competitive Research
        • Brand Authority Checker
        • Local Citation Checker
        • MozBar Extension
        • MozCast
        Resources
        • Blog
        • SEO Learning Center
        • Help Hub
        • Beginner's Guide to SEO
        • How-to Guides
        • Moz Academy
        • API Docs
        About Moz
        • About
        • Team
        • Careers
        • Contact
        Why Moz
        • Case Studies
        • Testimonials
        Get Involved
        • Become an Affiliate
        • MozCon
        • Webinars
        • Practical Marketer Series
        • MozPod
        Connect with us

        Contact the Help team

        Join our newsletter
        Moz logo
        © 2021 - 2026 SEOMoz, Inc., a Ziff Davis company. All rights reserved. Moz is a registered trademark of SEOMoz, Inc.
        • Accessibility
        • Terms of Use
        • Privacy