The Moz Q&A Forum

    • Forum
    • Questions
    • My Q&A
    • Users
    • Ask the Community

    Welcome to the Q&A Forum

    Browse the forum for helpful insights and fresh discussions about all things SEO.

    1. SEO and Digital Marketing Q&A Forum
    2. Categories
    3. Intermediate & Advanced SEO
    4. Google can't access/crawl my site!

    Google can't access/crawl my site!

    Intermediate & Advanced SEO
    16 4 3.0k
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as question
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • granitgash
      granitgash @KeriMorgret last edited by

      No prb. Thanks a lot for your time. Let just hope that someone in the community will help with a solution 🙂

      1 Reply Last reply Reply Quote 0
      • Andy.Drinkwater
        Andy.Drinkwater last edited by

        Hi Granit,

        Has any work been done to the site in the last 2-3 months? Have you had any warnings in webmaster tools at all? I did once see a strange problem where Google wasn't crawling a site correctly because it had been compromised, but after checking, there is nothing like this on yours.

        -Andy

        granitgash 1 Reply Last reply Reply Quote 1
        • granitgash
          granitgash @Andy.Drinkwater last edited by

          In mid-march website changed it's CMS but i don't think that could be the reason because until this week everything was working perfectly. I don't think it could have been compromised too. I'm still suspecting it could be the firewall blocking bots from crawling the site, but the server administrator couldn't find any evidence of this.

          Andy.Drinkwater 1 Reply Last reply Reply Quote 0
          • Andy.Drinkwater
            Andy.Drinkwater @granitgash last edited by

            It doesn't look like a firewall, as I can crawl it with Screaming Frog. However, the server logs will be able to answer that one for you.

            Without looking in depth, I'm not seeing anything that stands out to me - do you think that there have been changes to the server that could cause issues? What firewall is the server running? Also, if there were errors in crawling the site, you would see a warning about this.

            -Andy

            granitgash 1 Reply Last reply Reply Quote 1
            • granitgash
              granitgash @Andy.Drinkwater last edited by

              We are suspecting that CloudFlare might be causing these troubles. We are trying everything, in the meantime i'm looking here to see if anyone has any similar experience or an idea for solution.

              As for warnings, the only warning we had was the one last week (8/23/14) saying that Google bot can't acces our site:

              Over the last 24 hours, Googlebot encountered 316 errors while attempting to connect to your site. Your site's overall connection failure rate is 7.5%.

              -Granit

              Andy.Drinkwater 1 Reply Last reply Reply Quote 0
              • Andy.Drinkwater
                Andy.Drinkwater @granitgash last edited by

                Ah OK - well keep us updated with what you find. Someone else will chip in with other info if they have some 🙂

                -Andy

                1 Reply Last reply Reply Quote 0
                • Travis_Bailey
                  Travis_Bailey last edited by

                  A friend of mine just got back from Kosovo. It was the last stop on a tour of the Balkans. He had a pretty good time. Moving along...

                  I crawled about 12K URLs and hit almost 90 Internal Server Errors (500). It's probably not your core problem, but it's something to look at. Here are a few examples:

                  http://www.gazetaexpress.com/blihet/?search_category_id=1&searchFilter=1

                  http://www.gazetaexpress.com/shitet/?category_id=134&searchFilter=1

                  http://www.gazetaexpress.com/me-qera/?category_id=131&searchFilter=1

                  There was one actual page that threw a 500 at the time of crawl:

                  http://www.gazetaexpress.com/mistere/edhe-kesaj-i-thuhet-veze-22591/

                  The edhe kesaj page now resolves fine. (I'm not even going to pretend to understand or write Albanian.)

                  So there may be some issues with the server or hosting. If you haven't already, try this troubleshooter from Cloudflare.

                  granitgash 1 Reply Last reply Reply Quote 0
                  • granitgash
                    granitgash @Travis_Bailey last edited by

                    Hi Travis, thank you for your time.

                    Great for your friend, I also suggest to visit Kosovo someday, you will have great time here, for sure 🙂

                    Back to the issue:

                    Here is an interesting issue that is happening with the crawler.

                    Our own cms uses htaccess for rewrite purposes. I created 2 new files that are independent from CMS and tried to fetch them with WMT, and it worked like a charm.

                    These 2 independent files are:

                    www.gazetaexpress.com/test_manaferra.php

                    www.gazetaexpress.com/xhezidja.php

                    Then, I created an ajax page with our CMS, which contains only plain text, tried to fetch it by WMT and strangely enough it didn't work. To make sure that the .htaccess file is not affecting this behavior, I deleted the htaccess and tried to fetch it, but it didn't worked.

                    The ajax page is: www.gazetaexpress.com/page/xhezidja/?pageSEO=false

                    The site works perfectly for humans which access it via the browser.

                    I'm more than confused now!

                    ac857dfbf02a316d92d378bc48f9c395.png

                    1 Reply Last reply Reply Quote 0
                    • granitgash
                      granitgash last edited by

                      Hi all

                      Just wanted to let you know that we fixed the problem. We disabled CloudFlare which we found out was blocking Google bots. More about this issue can be found at: https://support.cloudflare.com/hc/en-us/articles/200169806-I-m-getting-Google-Crawler-Errors-What-should-I-do-

                      KeriMorgret Travis_Bailey 2 Replies Last reply Reply Quote 3
                      • KeriMorgret
                        KeriMorgret @granitgash last edited by

                        Great, thanks for letting us know what happened with this!

                        Travis_Bailey 1 Reply Last reply Reply Quote 0
                        • Travis_Bailey
                          Travis_Bailey @KeriMorgret last edited by

                          This applies to the guy from Albania.

                          Oh, this IS the guy from Albania. Never mind.

                          1 Reply Last reply Reply Quote 0
                          • Travis_Bailey
                            Travis_Bailey @granitgash last edited by

                            What did you do specifically to mitigate the problem? You can PM me, if you would like.

                            1 Reply Last reply Reply Quote 0
                            • 1 / 1
                            • First post
                              Last post
                            • A client rebranded a few years ago and doesn't want to be associated with it's old brand name. He wishes not to appear when the old brand is searched in Google, is there something we can do?
                              0
                              1
                              28

                            • Why isn't Google indexing this site?
                              GastonRiera
                              GastonRiera
                              0
                              8
                              124

                            • Why isn't my site being indexed by Google?
                              Chris661
                              Chris661
                              0
                              3
                              188

                            • ScreamingFrog won't crawl my site.
                              whiteonlySEO
                              whiteonlySEO
                              0
                              7
                              7.0k

                            • Some site's links look different on google search. For example Games.com › Flash games › Decoration games How can we do our url's like this?
                              davebuts
                              davebuts
                              0
                              4
                              268

                            • What can you do when Google can't decide which of two pages is the better search result
                              David-Kley
                              David-Kley
                              0
                              3
                              85

                            • How can Google index a page that it can't crawl completely?
                              OlegKorneitchouk
                              OlegKorneitchouk
                              0
                              4
                              75

                            • E-Commerce site - How do I geo-target towns/cities/states if there aren't any store locations?
                              Mr.Rangen
                              Mr.Rangen
                              0
                              2
                              451

                            Get started with Moz Pro!

                            Unlock the power of advanced SEO tools and data-driven insights.

                            Start my free trial
                            Products
                            • Moz Pro
                            • Moz Local
                            • Moz API
                            • Moz Data
                            • STAT
                            • Product Updates
                            Moz Solutions
                            • SMB Solutions
                            • Agency Solutions
                            • Enterprise Solutions
                            • Digital Marketers
                            Free SEO Tools
                            • Domain Authority Checker
                            • Link Explorer
                            • Keyword Explorer
                            • Competitive Research
                            • Brand Authority Checker
                            • Local Citation Checker
                            • MozBar Extension
                            • MozCast
                            Resources
                            • Blog
                            • SEO Learning Center
                            • Help Hub
                            • Beginner's Guide to SEO
                            • How-to Guides
                            • Moz Academy
                            • API Docs
                            About Moz
                            • About
                            • Team
                            • Careers
                            • Contact
                            Why Moz
                            • Case Studies
                            • Testimonials
                            Get Involved
                            • Become an Affiliate
                            • MozCon
                            • Webinars
                            • Practical Marketer Series
                            • MozPod
                            Connect with us

                            Contact the Help team

                            Join our newsletter
                            Moz logo
                            © 2021 - 2026 SEOMoz, Inc., a Ziff Davis company. All rights reserved. Moz is a registered trademark of SEOMoz, Inc.
                            • Accessibility
                            • Terms of Use
                            • Privacy