The Moz Q&A Forum

    • Forum
    • Questions
    • My Q&A
    • Users
    • Ask the Community

    Welcome to the Q&A Forum

    Browse the forum for helpful insights and fresh discussions about all things SEO.

    1. SEO and Digital Marketing Q&A Forum
    2. Categories
    3. Moz Pro
    4. Why is my domain not being crawled anymore?

    Why is my domain not being crawled anymore?

    Moz Pro
    6 3 280
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as question
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • tcolling
      tcolling last edited by

      I just noticed that right around 12/1/2012, SEOMoz stopped crawling all but two pages out of the 400 or so on my website at www.TrustworthyCare.com .  I speculate that this is probably due to some dumb mistake I made at that time, but I can't for the life of me figure out what that mistake was.  Before that, the weekly crawls included all 400 or so pages.

      I wonder whether it's something that changed in our .htaccess file.  Here's how that file looks now; can anyone see what is wrong there, or perhaps offer other suggestions if it doesn't look like anything is wrong in it?

      Thanks!

      • Tim

      PS - I'm a small business owner, not an SEO or software engineer.

      PPS - I found and read this page, but I've pretty much tried the things described there (I think): https://seomoz.zendesk.com/entries/409821-why-isn-t-my-site-being-crawled-you-re-not-crawling-all-my-pages

      =================================

      RewriteCond %{HTTP_HOST} ^aservantsheartcare.com$ [OR]RewriteCond %{HTTP_HOST} ^www.aservantsheartcare.com$RewriteRule ^/?$ "http://trustworthycare.com/" [R=301,L] RewriteCond %{HTTP_HOST} ^aservantsheartcaremanagement.com$ [OR]RewriteCond %{HTTP_HOST} ^www.aservantsheartcaremanagement.com$RewriteRule ^/?$ "http://trustworthycare.com/" [R=301,L] RewriteCond %{HTTP_HOST} ^aservantsheartgeriatriccare.com$ [OR]RewriteCond %{HTTP_HOST} ^www.aservantsheartgeriatriccare.com$RewriteRule ^/?$ "http://trustworthycare.com/" [R=301,L] RewriteCond %{HTTP_HOST} ^aservantsheartgeriatriccaremanagement.com$ [OR]RewriteCond %{HTTP_HOST} ^www.aservantsheartgeriatriccaremanagement.com$RewriteRule ^/?$ "http://trustworthycare.com/" [R=301,L] RewriteCond %{HTTP_HOST} ^aservantshearthomecare.com$ [OR]RewriteCond %{HTTP_HOST} ^www.aservantshearthomecare.com$RewriteRule ^/?$ "http://trustworthycare.com/" [R=301,L] RewriteCond %{HTTP_HOST} ^aservantsheartseniorcare.com$ [OR]RewriteCond %{HTTP_HOST} ^www.aservantsheartseniorcare.com$RewriteRule ^/?$ "http://trustworthycare.com/" [R=301,L] RewriteCond %{HTTP_HOST} ^aservantsheartservices.com$ [OR]RewriteCond %{HTTP_HOST} ^www.aservantsheartservices.com$RewriteRule ^/?$ "http://trustworthycare.com/" [R=301,L] RewriteCond %{HTTP_HOST} ^careforparents.com$ [OR]RewriteCond %{HTTP_HOST} ^www.careforparents.com$RewriteRule ^/?$ "http://trustworthycare.com/" [R=301,L] RewriteCond %{HTTP_HOST} ^eldercareradio.com$ [OR]RewriteCond %{HTTP_HOST} ^www.eldercareradio.com$RewriteRule ^/?$ "http://trustworthycare.com/" [R=301,L] RewriteCond %{HTTP_HOST} ^helpforyourparents.com$ [OR]RewriteCond %{HTTP_HOST} ^www.helpforyourparents.com$RewriteRule ^/?$ "http://trustworthycare.com/" [R=301,L] RewriteCond %{HTTP_HOST} ^privatedutyseniorcare.com$ [OR]RewriteCond %{HTTP_HOST} ^www.privatedutyseniorcare.com$RewriteRule ^/?$ "http://trustworthycare.com/" [R=301,L] RewriteCond %{HTTP_HOST} ^sandiegocaremanagement.com$ [OR]RewriteCond %{HTTP_HOST} ^www.sandiegocaremanagement.com$RewriteRule ^/?$ "http://trustworthycare.com/" [R=301,L] RewriteCond %{HTTP_HOST} ^sandiegocaremanager.com$ [OR]RewriteCond %{HTTP_HOST} ^www.sandiegocaremanager.com$RewriteRule ^/?$ "http://trustworthycare.com/" [R=301,L] RewriteCond %{HTTP_HOST} ^sandiegogeriatriccaremanagement.com$ [OR]RewriteCond %{HTTP_HOST} ^www.sandiegogeriatriccaremanagement.com$RewriteRule ^/?$ "http://trustworthycare.com/" [R=301,L] RewriteCond %{HTTP_HOST} ^sandiegogeriatriccaremanager.com$ [OR]RewriteCond %{HTTP_HOST} ^www.sandiegogeriatriccaremanager.com$RewriteRule ^/?$ "http://trustworthycare.com/" [R=301,L] RewriteCond %{HTTP_HOST} ^servantsheartcare.com$ [OR]RewriteCond %{HTTP_HOST} ^www.servantsheartcare.com$RewriteRule ^/?$ "http://trustworthycare.com/" [R=301,L] RewriteCond %{HTTP_HOST} ^servantshearthomecare.com$ [OR]RewriteCond %{HTTP_HOST} ^www.servantshearthomecare.com$RewriteRule ^/?$ "http://trustworthycare.com/" [R=301,L] RewriteCond %{HTTP_HOST} ^servantsheartseniorcare.com$ [OR]RewriteCond %{HTTP_HOST} ^www.servantsheartseniorcare.com$RewriteRule ^/?$ "http://trustworthycare.com/" [R=301,L] RewriteCond %{HTTP_HOST} ^tlccare.com$ [OR]RewriteCond %{HTTP_HOST} ^www.tlccare.com$RewriteRule ^/?$ "http://trustworthycare.com/" [R=301,L] RewriteCond %{HTTP_HOST} ^tlcseniorcenter.com$ [OR]RewriteCond %{HTTP_HOST} ^www.tlcseniorcenter.com$RewriteRule ^/?$ "http://trustworthycare.com/" [R=301,L] RewriteCond %{HTTP_HOST} ^tlcseniorhomecare.com$ [OR]RewriteCond %{HTTP_HOST} ^www.tlcseniorhomecare.com$RewriteRule ^/?$ "http://trustworthycare.com/" [R=301,L] RewriteCond %{HTTP_HOST} ^tlcseniorservices.com$ [OR]RewriteCond %{HTTP_HOST} ^www.tlcseniorservices.com$RewriteRule ^/?$ "http://trustworthycare.com/" [R=301,L] #php_value upload_max_filesize 8MRewriteCond %{HTTP_HOST} ^trustworthycare.com$RewriteRule ^(.)$ "http://www.trustworthycare.com/$1" [R=301,L] RewriteCond %{HTTP_REFERER} !^$RewriteCond %{HTTP_REFERER} !^http://blog.trustworthycare.com/.*$      [NC]RewriteCond %{HTTP_REFERER} !^http://blog.trustworthycare.com$      [NC]RewriteCond %{HTTP_REFERER} !^http://test.trustworthycare.com/.*$      [NC]RewriteCond %{HTTP_REFERER} !^http://test.trustworthycare.com$      [NC]RewriteCond %{HTTP_REFERER} !^http://trustworthycare.com/.*$      [NC]RewriteCond %{HTTP_REFERER} !^http://trustworthycare.com$      [NC]RewriteCond %{HTTP_REFERER} !^http://www.blog.trustworthycare.com/.*$      [NC]RewriteCond %{HTTP_REFERER} !^http://www.blog.trustworthycare.com$      [NC]RewriteCond %{HTTP_REFERER} !^http://www.test.trustworthycare.com/.*$      [NC]RewriteCond %{HTTP_REFERER} !^http://www.test.trustworthycare.com$      [NC]RewriteCond %{HTTP_REFERER} !^http://www.trustworthycare.com/.*$      [NC]RewriteCond %{HTTP_REFERER} !^http://www.trustworthycare.com$      [NC]RewriteCond %{HTTP_REFERER} !^http://www.trustworthycare.com/images/files_for_service_inquiries/.*$      [NC]RewriteCond %{HTTP_REFERER} !^http://www.trustworthycare.com/images/files_for_service_inquiries$      [NC]RewriteCond %{HTTP_REFERER} !^http://sandbox.trustworthycare.com/.*$      [NC]RewriteCond %{HTTP_REFERER} !^http://sandbox.trustworthycare.com$      [NC]RewriteRule ..(jpg|jpeg|gif|png|bmp)$ - [F,NC] RewriteCond %{HTTP_HOST} ^ashsc.com$ [OR]RewriteCond %{HTTP_HOST} ^www.ashsc.com$RewriteRule ^/?$ "http://trustworthycare.com/" [R=301,L] # BEGIN W3TC Browser Cache         BrowserMatch ^Mozilla/4 gzip-only-text/html        BrowserMatch ^Mozilla/4.0[678] no-gzip        BrowserMatch \bMSIE !no-gzip !gzip-only-text/html        BrowserMatch \bMSI[E] !no-gzip !gzip-only-text/html         Header append Vary User-Agent env=!dont-vary         AddOutputFilterByType DEFLATE text/css application/x-javascript text/x-component text/html text/richtext image/svg+xml text/plain text/xsd text/xsl text/xml image/x-icon <filesmatch ".(css|js|htc|css|js|htc)$"=""></filesmatch>    FileETag None          Header set X-Powered-By "W3 Total Cache/0.9.2.5" <filesmatch ".(html|htm|rtf|rtx|svg|svgz|txt|xsd|xsl|xml|html|htm|rtf|rtx|svg|svgz|txt|xsd|xsl|xml)$"=""></filesmatch>    FileETag None          Header set X-Powered-By "W3 Total Cache/0.9.2.5" <filesmatch ".(asf|asx|wax|wmv|wmx|avi|bmp|class|divx|doc|docx|eot|exe|gif|gz|gzip|ico|jpg|jpeg|jpe|mdb|mid|midi|mov|qt|mp3|m4a|mp4|m4v|mpeg|mpg|mpe|mpp|otf|odb|odc|odf|odg|odp|ods|odt|ogg|pdf|png|pot|pps|ppt|pptx|ra|ram|svg|svgz|swf|tar|tif|tiff|ttf|ttc|wav|wma|wri|xla|xls|xlsx|xlt|xlw|zip|asf|asx|wax|wmv|wmx|avi|bmp|class|divx|doc|docx|eot|exe|gif|gz|gzip|ico|jpg|jpeg|jpe|mdb|mid|midi|mov|qt|mp3|m4a|mp4|m4v|mpeg|mpg|mpe|mpp|otf|odb|odc|odf|odg|odp|ods|odt|ogg|pdf|png|pot|pps|ppt|pptx|ra|ram|svg|svgz|swf|tar|tif|tiff|ttf|ttc|wav|wma|wri|xla|xls|xlsx|xlt|xlw|zip)$"=""></filesmatch>    FileETag None          Header set X-Powered-By "W3 Total Cache/0.9.2.5" # END W3TC Browser Cache# BEGIN W3TC Page Cache core    RewriteEngine On    RewriteBase /    RewriteRule ^(./)?w3tc_rewrite_test$ $1?w3tc_rewrite_test=1 [L]    RewriteCond %{HTTP:Accept-Encoding} gzip    RewriteRule . - [E=W3TC_ENC:gzip]    RewriteCond %{REQUEST_METHOD} !=POST    RewriteCond %{QUERY_STRING} =""    RewriteCond %{HTTP_HOST} =www.trustworthycare.com    RewriteCond %{REQUEST_URI} /$ [OR]    RewriteCond %{REQUEST_URI} (sitemap(index)?.xml(.gz)?|[a-z0-9-]+-sitemap([0-9]+)?.xml(.gz)?) [NC]    RewriteCond %{REQUEST_URI} !(/wp-admin/|/xmlrpc.php|/wp-(app|cron|login|register|mail).php|/feed/|wp-.*.php|index.php) [NC,OR]    RewriteCond %{REQUEST_URI} (wp-comments-popup.php|wp-links-opml.php|wp-locations.php) [NC]    RewriteCond %{HTTP_COOKIE} !(comment_author|wp-postpass|wordpress[a-f0-9]+|wordpress_logged_in) [NC]    RewriteCond %{HTTP_USER_AGENT} !(W3\ Total\ Cache/0.9.2.5) [NC]    RewriteCond "%{DOCUMENT_ROOT}/sitectrl/wp-content/w3tc/pgcache/%{REQUEST_URI}/_index%{ENV:W3TC_UA}%{ENV:W3TC_REF}%{ENV:W3TC_SSL}.html%{ENV:W3TC_ENC}" -f    RewriteRule .* "/sitectrl/wp-content/w3tc/pgcache/%{REQUEST_URI}/_index%{ENV:W3TC_UA}%{ENV:W3TC_REF}%{ENV:W3TC_SSL}.html%{ENV:W3TC_ENC}" [L]# END W3TC Page Cache core# BEGIN WordPressRewriteEngine OnRewriteBase /RewriteRule ^index.php$ - [L]RewriteCond %{REQUEST_FILENAME} !-fRewriteCond %{REQUEST_FILENAME} !-dRewriteRule . /index.php [L] # END WordPressRewriteCond %{HTTP_HOST} ^privatedutycare.com$ [OR]RewriteCond %{HTTP_HOST} ^www.privatedutycare.com$RewriteRule ^/?$ "http://www.ageassistance.com" [R=301,L]

      =================================

      1 Reply Last reply Reply Quote 0
      • TheeDigital
        TheeDigital last edited by

        Have you taken a look at your Google Webmaster Tools?  Among the various things it will give you a list of all the crawl errors that you have on your website.  That might be of help.

        tcolling 1 Reply Last reply Reply Quote 1
        • tcolling
          tcolling @TheeDigital last edited by

          Yes, I have looked at WMT.

          I used it to test my robots.txt file, here are the results:

          For the domain http://www.trustworthycare.com/ it reports:

          Allowed; Detected as a directory; specific files may have different restrictions

          It reports that it can fetch the homepage successfully.

          It reports the following under crawl errors:

          Access denied: 1

          Not found: 76 (none of these are ones that would be likely to block the entire domain from being crawled)

          I investigated the Access denied and it turned out to be this file:

          trustworthycare.com/public_html/sitectrl/wp-app.php

          Would that be causing this?  It doesn't seem likely.

          1 Reply Last reply Reply Quote 0
          • ChiarynMiranda
            ChiarynMiranda last edited by

            Hey Tim,

            Thanks for writing in! I took a look at your crawl diagnostics and it looks like one of the notices we are returning is that there is a nofollow directive in the source code of http://www.trustworthycare.com/. I checked the source code on http://www.trustworthycare.com/ and I am definitely seeing that directive in there, as well (http://screencast.com/t/FGF29en2). This directive means that we are not allowed to follow through any links on that page, so we crawl trustworthycare.com which redirects to www.trustworthycare.com and then we can't go any further into the site due to that directive.

            I hope this clears things up. Please let me know if you have any further questions.

            -Chiaryn

            tcolling 1 Reply Last reply Reply Quote 1
            • tcolling
              tcolling @ChiarynMiranda last edited by

              THANK YOU!

              That was the information that I needed.  I didn't know to look for that "robots" meta information.  With that, it was pretty quick work to find where I must have accidentally turned on a setting on my home page's settings to cause that to happen.

              • Tim
              ChiarynMiranda 1 Reply Last reply Reply Quote 0
              • ChiarynMiranda
                ChiarynMiranda @tcolling last edited by

                So glad I could help figure out the issue! Let us know if you need any other assistance.

                -Chiaryn

                1 Reply Last reply Reply Quote 0
                • 1 / 1
                • First post
                  Last post
                • Why my site not crawl?
                  jahanidawodi
                  jahanidawodi
                  0
                  5
                  68

                • Domain Authority
                  brunoseo
                  brunoseo
                  0
                  4
                  403

                • Aren't domain.com/page and domain.com/page/ the same thing?
                  G2W
                  G2W
                  0
                  8
                  171

                • Can I change the Competitive Domain Analysis Domains?
                  ChrisMcIlroy
                  ChrisMcIlroy
                  0
                  3
                  254

                • Crawl Diagnostics: Next crawl date is in the past
                  Abe_Schmidt
                  Abe_Schmidt
                  0
                  4
                  291

                • Why are my sub-domains ranking higher than my Root Domains?
                  ShaMenz
                  ShaMenz
                  0
                  7
                  1.4k

                • Moz crawling
                  coremediadesign
                  coremediadesign
                  0
                  6
                  631

                • TLD vs Sub Domain in Regards to Domain Authority
                  Hannah_Smith
                  Hannah_Smith
                  0
                  4
                  1.9k

                Get started with Moz Pro!

                Unlock the power of advanced SEO tools and data-driven insights.

                Start my free trial
                Products
                • Moz Pro
                • Moz Local
                • Moz API
                • Moz Data
                • STAT
                • Product Updates
                Moz Solutions
                • SMB Solutions
                • Agency Solutions
                • Enterprise Solutions
                • Digital Marketers
                Free SEO Tools
                • Domain Authority Checker
                • Link Explorer
                • Keyword Explorer
                • Competitive Research
                • Brand Authority Checker
                • Local Citation Checker
                • MozBar Extension
                • MozCast
                Resources
                • Blog
                • SEO Learning Center
                • Help Hub
                • Beginner's Guide to SEO
                • How-to Guides
                • Moz Academy
                • API Docs
                About Moz
                • About
                • Team
                • Careers
                • Contact
                Why Moz
                • Case Studies
                • Testimonials
                Get Involved
                • Become an Affiliate
                • MozCon
                • Webinars
                • Practical Marketer Series
                • MozPod
                Connect with us

                Contact the Help team

                Join our newsletter
                Moz logo
                © 2021 - 2026 SEOMoz, Inc., a Ziff Davis company. All rights reserved. Moz is a registered trademark of SEOMoz, Inc.
                • Accessibility
                • Terms of Use
                • Privacy