The Moz Q&A Forum

    • Forum
    • Questions
    • My Q&A
    • Users
    • Ask the Community

    Welcome to the Q&A Forum

    Browse the forum for helpful insights and fresh discussions about all things SEO.

    1. SEO and Digital Marketing Q&A Forum
    2. Categories
    3. Intermediate & Advanced SEO
    4. Google can't access/crawl my site!

    Google can't access/crawl my site!

    Intermediate & Advanced SEO
    16 4 3.0k
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as question
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • granitgash
      granitgash @Andy.Drinkwater last edited by

      In mid-march website changed it's CMS but i don't think that could be the reason because until this week everything was working perfectly. I don't think it could have been compromised too. I'm still suspecting it could be the firewall blocking bots from crawling the site, but the server administrator couldn't find any evidence of this.

      Andy.Drinkwater 1 Reply Last reply Reply Quote 0
      • Andy.Drinkwater
        Andy.Drinkwater @granitgash last edited by

        It doesn't look like a firewall, as I can crawl it with Screaming Frog. However, the server logs will be able to answer that one for you.

        Without looking in depth, I'm not seeing anything that stands out to me - do you think that there have been changes to the server that could cause issues? What firewall is the server running? Also, if there were errors in crawling the site, you would see a warning about this.

        -Andy

        granitgash 1 Reply Last reply Reply Quote 1
        • granitgash
          granitgash @Andy.Drinkwater last edited by

          We are suspecting that CloudFlare might be causing these troubles. We are trying everything, in the meantime i'm looking here to see if anyone has any similar experience or an idea for solution.

          As for warnings, the only warning we had was the one last week (8/23/14) saying that Google bot can't acces our site:

          Over the last 24 hours, Googlebot encountered 316 errors while attempting to connect to your site. Your site's overall connection failure rate is 7.5%.

          -Granit

          Andy.Drinkwater 1 Reply Last reply Reply Quote 0
          • Andy.Drinkwater
            Andy.Drinkwater @granitgash last edited by

            Ah OK - well keep us updated with what you find. Someone else will chip in with other info if they have some 🙂

            -Andy

            1 Reply Last reply Reply Quote 0
            • Travis_Bailey
              Travis_Bailey last edited by

              A friend of mine just got back from Kosovo. It was the last stop on a tour of the Balkans. He had a pretty good time. Moving along...

              I crawled about 12K URLs and hit almost 90 Internal Server Errors (500). It's probably not your core problem, but it's something to look at. Here are a few examples:

              http://www.gazetaexpress.com/blihet/?search_category_id=1&searchFilter=1

              http://www.gazetaexpress.com/shitet/?category_id=134&searchFilter=1

              http://www.gazetaexpress.com/me-qera/?category_id=131&searchFilter=1

              There was one actual page that threw a 500 at the time of crawl:

              http://www.gazetaexpress.com/mistere/edhe-kesaj-i-thuhet-veze-22591/

              The edhe kesaj page now resolves fine. (I'm not even going to pretend to understand or write Albanian.)

              So there may be some issues with the server or hosting. If you haven't already, try this troubleshooter from Cloudflare.

              granitgash 1 Reply Last reply Reply Quote 0
              • granitgash
                granitgash @Travis_Bailey last edited by

                Hi Travis, thank you for your time.

                Great for your friend, I also suggest to visit Kosovo someday, you will have great time here, for sure 🙂

                Back to the issue:

                Here is an interesting issue that is happening with the crawler.

                Our own cms uses htaccess for rewrite purposes. I created 2 new files that are independent from CMS and tried to fetch them with WMT, and it worked like a charm.

                These 2 independent files are:

                www.gazetaexpress.com/test_manaferra.php

                www.gazetaexpress.com/xhezidja.php

                Then, I created an ajax page with our CMS, which contains only plain text, tried to fetch it by WMT and strangely enough it didn't work. To make sure that the .htaccess file is not affecting this behavior, I deleted the htaccess and tried to fetch it, but it didn't worked.

                The ajax page is: www.gazetaexpress.com/page/xhezidja/?pageSEO=false

                The site works perfectly for humans which access it via the browser.

                I'm more than confused now!

                ac857dfbf02a316d92d378bc48f9c395.png

                1 Reply Last reply Reply Quote 0
                • granitgash
                  granitgash last edited by

                  Hi all

                  Just wanted to let you know that we fixed the problem. We disabled CloudFlare which we found out was blocking Google bots. More about this issue can be found at: https://support.cloudflare.com/hc/en-us/articles/200169806-I-m-getting-Google-Crawler-Errors-What-should-I-do-

                  KeriMorgret Travis_Bailey 2 Replies Last reply Reply Quote 3
                  • KeriMorgret
                    KeriMorgret @granitgash last edited by

                    Great, thanks for letting us know what happened with this!

                    Travis_Bailey 1 Reply Last reply Reply Quote 0
                    • Travis_Bailey
                      Travis_Bailey @KeriMorgret last edited by

                      This applies to the guy from Albania.

                      Oh, this IS the guy from Albania. Never mind.

                      1 Reply Last reply Reply Quote 0
                      • Travis_Bailey
                        Travis_Bailey @granitgash last edited by

                        What did you do specifically to mitigate the problem? You can PM me, if you would like.

                        1 Reply Last reply Reply Quote 0
                        • 1 / 1
                        • First post
                          Last post
                        • Crawl and Indexation Error - Googlebot can't/doesn't access specific folders on microsites
                          Everett
                          Everett
                          0
                          2
                          87

                        • Can't support IE 7,8,9, 10\. Can we redirect them to another page that's optimized for those browsers so that we can have our site work on modern browers while still providing a destination of IE browsers?
                          0
                          1
                          18

                        • Why doesn't my website crawl by Google?
                          LoganRay
                          LoganRay
                          0
                          8
                          82

                        • When Mobile and Desktop sites have the same page URLs, how should I handle the 'View Desktop Site' link on a mobile site to ensure a smooth crawl?
                          DirkC
                          DirkC
                          0
                          3
                          1.4k

                        • What can you do when Google can't decide which of two pages is the better search result
                          David-Kley
                          David-Kley
                          0
                          3
                          85

                        • After Receiving a "Googlebot can't access your site" would this stop your site from being crawled?
                          evolvingSEO
                          evolvingSEO
                          0
                          4
                          394

                        • Googlebot Can't Access My Sites After I Repair My Robots File
                          Igal_Zeifman
                          Igal_Zeifman
                          1
                          4
                          2.6k

                        • What on-page/site optimization techniques can I utilize to improve this site (http://www.paradisus.com/)?
                          RyanKent
                          RyanKent
                          0
                          2
                          626

                        Get started with Moz Pro!

                        Unlock the power of advanced SEO tools and data-driven insights.

                        Start my free trial
                        Products
                        • Moz Pro
                        • Moz Local
                        • Moz API
                        • Moz Data
                        • STAT
                        • Product Updates
                        Moz Solutions
                        • SMB Solutions
                        • Agency Solutions
                        • Enterprise Solutions
                        • Digital Marketers
                        Free SEO Tools
                        • Domain Authority Checker
                        • Link Explorer
                        • Keyword Explorer
                        • Competitive Research
                        • Brand Authority Checker
                        • Local Citation Checker
                        • MozBar Extension
                        • MozCast
                        Resources
                        • Blog
                        • SEO Learning Center
                        • Help Hub
                        • Beginner's Guide to SEO
                        • How-to Guides
                        • Moz Academy
                        • API Docs
                        About Moz
                        • About
                        • Team
                        • Careers
                        • Contact
                        Why Moz
                        • Case Studies
                        • Testimonials
                        Get Involved
                        • Become an Affiliate
                        • MozCon
                        • Webinars
                        • Practical Marketer Series
                        • MozPod
                        Connect with us

                        Contact the Help team

                        Join our newsletter
                        Moz logo
                        © 2021 - 2026 SEOMoz, Inc., a Ziff Davis company. All rights reserved. Moz is a registered trademark of SEOMoz, Inc.
                        • Accessibility
                        • Terms of Use
                        • Privacy