The Moz Q&A Forum

    • Forum
    • Questions
    • My Q&A
    • Users
    • Ask the Community

    Welcome to the Q&A Forum

    Browse the forum for helpful insights and fresh discussions about all things SEO.

    1. SEO and Digital Marketing Q&A Forum
    2. Categories
    3. Intermediate & Advanced SEO
    4. Can't crawl website with Screaming frog... what is wrong?

    Can't crawl website with Screaming frog... what is wrong?

    Intermediate & Advanced SEO
    3 3 3.8k
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as question
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • McTaggart
      McTaggart last edited by

      Hello all - I've just been trying to crawl a site with Screaming Frog and can't get beyond the homepage - have done the usual stuff (turn off JS and so on) and no problems there with nav and so on- the site's other pages have indexed in Google btw.

      Now I'm wondering whether there's a problem with this robots.txt file, which I think may be auto-generated by Joomla (I'm not familiar with Joomla...) - are there any issues here? [just checked... and there isn't!]

      If the Joomla site is installed within a folder such as at

      e.g. www.example.com/joomla/ the robots.txt file MUST be

      moved to the site root at e.g. www.example.com/robots.txt

      AND the joomla folder name MUST be prefixed to the disallowed

      path, e.g. the Disallow rule for the /administrator/ folder

      MUST be changed to read Disallow: /joomla/administrator/

      For more information about the robots.txt standard, see:

      http://www.robotstxt.org/orig.html

      For syntax checking, see:

      http://tool.motoricerca.info/robots-checker.phtml

      User-agent: *
      Disallow: /administrator/
      Disallow: /bin/
      Disallow: /cache/
      Disallow: /cli/
      Disallow: /components/
      Disallow: /includes/
      Disallow: /installation/
      Disallow: /language/
      Disallow: /layouts/
      Disallow: /libraries/
      Disallow: /logs/
      Disallow: /modules/
      Disallow: /plugins/
      Disallow: /tmp/

      1 Reply Last reply Reply Quote 0
      • EcommerceSite
        EcommerceSite last edited by

        This is the best I could find to so someone who had a similar problem with Joomla-

        "In the premium version you can slow down the crawl rate under 'speed' in the configuration. In the free lite version, you can crawl the site and then right click on any URLs with a 403 response and press 're-spider'. The server will generally then allow you to crawl these pages (and return a 200 ok response) as you're not requesting too many at once, so you might have to re-spider them individually."

        1 Reply Last reply Reply Quote 2
        • Singularitie
          Singularitie last edited by

          For anyone wondering; The answer above by Ecommerce Site (odd name btw) works - 21-Nov-2016.

          1 Reply Last reply Reply Quote 1
          • 1 / 1
          • First post
            Last post
          • Crawl and Indexation Error - Googlebot can't/doesn't access specific folders on microsites
            Everett
            Everett
            0
            2
            87

          • Why doesn't my website crawl by Google?
            LoganRay
            LoganRay
            0
            8
            82

          • Why some websites can rank the keywords they don't have in the page?
            TheSymmetran
            TheSymmetran
            0
            6
            115

          • Weird rankings on my website, can't figure it out
            evolvingSEO
            evolvingSEO
            0
            4
            117

          • After Receiving a "Googlebot can't access your site" would this stop your site from being crawled?
            evolvingSEO
            evolvingSEO
            0
            4
            394

          • How can Google index a page that it can't crawl completely?
            OlegKorneitchouk
            OlegKorneitchouk
            0
            4
            75

          • How can I tell if a website is a 'NoFollow'?
            Paul_Tovey
            Paul_Tovey
            0
            4
            8.5k

          • I can't help but think something is wrong with my SEO
            RobertFisher
            RobertFisher
            0
            6
            429

          Get started with Moz Pro!

          Unlock the power of advanced SEO tools and data-driven insights.

          Start my free trial
          Products
          • Moz Pro
          • Moz Local
          • Moz API
          • Moz Data
          • STAT
          • Product Updates
          Moz Solutions
          • SMB Solutions
          • Agency Solutions
          • Enterprise Solutions
          • Digital Marketers
          Free SEO Tools
          • Domain Authority Checker
          • Link Explorer
          • Keyword Explorer
          • Competitive Research
          • Brand Authority Checker
          • Local Citation Checker
          • MozBar Extension
          • MozCast
          Resources
          • Blog
          • SEO Learning Center
          • Help Hub
          • Beginner's Guide to SEO
          • How-to Guides
          • Moz Academy
          • API Docs
          About Moz
          • About
          • Team
          • Careers
          • Contact
          Why Moz
          • Case Studies
          • Testimonials
          Get Involved
          • Become an Affiliate
          • MozCon
          • Webinars
          • Practical Marketer Series
          • MozPod
          Connect with us

          Contact the Help team

          Join our newsletter
          Moz logo
          © 2021 - 2026 SEOMoz, Inc., a Ziff Davis company. All rights reserved. Moz is a registered trademark of SEOMoz, Inc.
          • Accessibility
          • Terms of Use
          • Privacy