The Moz Q&A Forum

    • Forum
    • Questions
    • My Q&A
    • Users
    • Ask the Community

    Welcome to the Q&A Forum

    Browse the forum for helpful insights and fresh discussions about all things SEO.

    1. SEO and Digital Marketing Q&A Forum
    2. Categories
    3. Moz Tools
    4. Crawl Diagnostics 403 on home page...

    Crawl Diagnostics 403 on home page...

    Moz Tools
    9 3 420
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as question
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • martJ
      martJ last edited by

      In the crawl diagnostics it says oursite.com/ has a 403. doesn't say what's causing it but mentions no robots.txt. There is a robots.txt and I see no problems. How can I find out more information about this error?

      1 Reply Last reply Reply Quote 0
      • jesse-landry
        jesse-landry last edited by

        a 403 is a Forbidden code usually pertaining to Security and Permissions.

        Are you running your server in an Apache or IIS environment? Robots.txt shouldn't affect a site's visibility to the public it only talks to site crawlers.

        martJ 1 Reply Last reply Reply Quote 0
        • martJ
          martJ @jesse-landry last edited by

          apache

          1 Reply Last reply Reply Quote 0
          • jesse-landry
            jesse-landry last edited by

            OH this is only in SEOmoz's crawl diagnostics that you're seeing this error. That explains why robots.txt could be affecting it. I misread this earlier and thought you were finding the 403 on your own in-browser.

            Can you paste the robots.txt file into here so we can see it? I would imagine that has everything to do with it now that I've correctly read your post --my apologies

            martJ 1 Reply Last reply Reply Quote 0
            • martJ
              martJ @jesse-landry last edited by

              No problem. Looking at my Google WM Tools , crawl stats don't show any errors.

              Thanks

              User-Agent: *
              Disallow: /*?zenid=
              Disallow: /editors/
              Disallow: /email/
              Disallow: /googlecheckout/
              Disallow: /includes/
              Disallow: /js/
              Disallow: /manuals/

              1 Reply Last reply Reply Quote 0
              • jesse-landry
                jesse-landry last edited by

                hmmm... not sure why this is happening. maybe add this line to the top of your robots.txt and see if it fixes by next week. it certainly won't hurt anything:

                User-agent: *
                Allow: /
                
                
                martJ 1 Reply Last reply Reply Quote 0
                • martJ
                  martJ @jesse-landry last edited by

                  I think I do. I just (a few minutes ago) went through a 403 problem being reported by another site trying access an html file for verification. Apparently they are connecting with an ip that's blocked by our htaccess. I removed the blocks told them to try again and it worked no problem. I see that SEOMoz has only crawled 1 page. Off to see if I can trigger a re-crawl now...

                  1 Reply Last reply Reply Quote 0
                  • martJ
                    martJ last edited by

                    Okay, so I couldn't find this thread and started a new one. Sorry...

                    ... The problem persists.

                    RECAP

                    I have two blocks in my htaccess both are for amazonaws.com.

                    I have gone over our server block logs and see only amazon addresses and bot names.

                    I did a fetch as google with our WM Tools and fetch it did. Success!

                    Why isn't thiscrawler able to access? Many other bots are crawling right now.

                    Why can I use the seomoz on-page feature to crawl a single page but the automatic crawler wont access the site? Just took a break from typing this to try the on-page on our robots.txt, worked fine. Use the keyword "Disallow" and it gave me a C. =0)

                    ... now if we could just crawl the rest of the site...

                    any help on this would be greatly appreciated.

                    1 Reply Last reply Reply Quote 0
                    • ChiarynMiranda
                      ChiarynMiranda last edited by

                      Hi Dana,

                      Thanks for writing in. The robots.txt file would not cause a 403 error. That type of error is actually related to the way the server responds to our crawler. Basically, this means the server for the site is telling our crawler that we are not allowed to access the site. Here is a resource that explains the 403 http status code pretty thoroughly: http://pcsupport.about.com/od/findbyerrormessage/a/403error.htm

                      I looked at both of the campaigns on your account and I am not seeing a 403 error for either site, though I do see a couple of 404 page not found errors on one of the campaigns, which is a different issue.

                      If you are still seeing the 403 error message on one of your crawls, you would just need to have the webmaster update the server to allow rogerbot to access the site.

                      I hope this helps. Please let me know if you have any other questions.

                      -Chiaryn

                      1 Reply Last reply Reply Quote 1
                      • 1 / 1
                      • First post
                        Last post
                      • Have a Campaign, but only states 1 page has been crawled by SEOmoz bots. What needs to be done to have all the pages crawled?
                        Johnny4B
                        Johnny4B
                        0
                        4
                        147

                      • Only few pages (308 pages of 1000 something pages) have been crawled and diagnosed in 4 days, how many days till the entire website is crawled complete?
                        DarinPirkey
                        DarinPirkey
                        0
                        4
                        288

                      • I've got quite a few "Duplicate Page Title" Errors in my Crawl Diagnostics for my Wordpress Blog
                        Devanur-Rafi
                        Devanur-Rafi
                        0
                        2
                        292

                      • Crawl Disgnosis only crawling 250 pages not 10,000
                        kenneth_martin
                        kenneth_martin
                        0
                        7
                        409

                      • Is it possible to exclude pages from Crawl Diagnostic?
                        GCSMasone
                        GCSMasone
                        1
                        2
                        425

                      • Too Many On-Page Links: Crawl Diag vs On-Page
                        Dryope
                        Dryope
                        0
                        3
                        481

                      • Crawl Diagnostics and missing meta tags on noindex blog pages
                        ShaMenz
                        ShaMenz
                        0
                        2
                        835

                      • My website has 18500 pages but my SEO MOZ campaign is limited to a 10,000 page crawl. How can I get the other 8500 pages crawled? Can I use one of my 3 spare campaigns?
                        kenneth_martin
                        kenneth_martin
                        0
                        5
                        1.0k

                      Get started with Moz Pro!

                      Unlock the power of advanced SEO tools and data-driven insights.

                      Start my free trial
                      Products
                      • Moz Pro
                      • Moz Local
                      • Moz API
                      • Moz Data
                      • STAT
                      • Product Updates
                      Moz Solutions
                      • SMB Solutions
                      • Agency Solutions
                      • Enterprise Solutions
                      • Digital Marketers
                      Free SEO Tools
                      • Domain Authority Checker
                      • Link Explorer
                      • Keyword Explorer
                      • Competitive Research
                      • Brand Authority Checker
                      • Local Citation Checker
                      • MozBar Extension
                      • MozCast
                      Resources
                      • Blog
                      • SEO Learning Center
                      • Help Hub
                      • Beginner's Guide to SEO
                      • How-to Guides
                      • Moz Academy
                      • API Docs
                      About Moz
                      • About
                      • Team
                      • Careers
                      • Contact
                      Why Moz
                      • Case Studies
                      • Testimonials
                      Get Involved
                      • Become an Affiliate
                      • MozCon
                      • Webinars
                      • Practical Marketer Series
                      • MozPod
                      Connect with us

                      Contact the Help team

                      Join our newsletter
                      Moz logo
                      © 2021 - 2026 SEOMoz, Inc., a Ziff Davis company. All rights reserved. Moz is a registered trademark of SEOMoz, Inc.
                      • Accessibility
                      • Terms of Use
                      • Privacy