The Moz Q&A Forum

    • Forum
    • Questions
    • My Q&A
    • Users
    • Ask the Community

    Welcome to the Q&A Forum

    Browse the forum for helpful insights and fresh discussions about all things SEO.

    1. SEO and Digital Marketing Q&A Forum
    2. Categories
    3. Technical SEO Issues
    4. Roger bot taking a long time to crawl site

    Roger bot taking a long time to crawl site

    Technical SEO Issues
    5 3 590
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as question
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • caterfor
      caterfor last edited by

      Hi all, I've noticed Roger bot is taking a long time to crawl my new site. It started on the 28th Feb 2013 and is still going. There aren't many pages at the moment. Any ideas please?

      thanks a lot, Mark.

      1 Reply Last reply Reply Quote 1
      • Mike.Goracke
        Mike.Goracke last edited by

        Hi Mark,

        This sounds like a bug or issue with the SEOmoz software.

        Contact help@seomoz.org and ask one of the help associates to look into this for you.

        If you do not have many pages, it definitely shouldn't take that long.

        The help team responds extremely quickly!

        Good luck.

        Mike

        caterfor 1 Reply Last reply Reply Quote 2
        • Peterli
          Peterli last edited by

          Hi Mark,

          Sorry it's taking a while to crawl your new site.

          While I'm not exactly sure what the delay is, one of the possible reasons is through your robots.txt.  Here's what I see in a short snippet from your robots.txt:

          # Crawlers Setup
          User-agent: *
          Crawl-delay: 30
          # Allowable Index
          Allow: /*?p=
          Allow: /index.php/blog/
          Allow: /catalog/seo_sitemap/category/
          Allow: /catalogsearch/result/
          Allow: /media/
          # Directories
          Disallow: /404/
          Disallow: /app/
          Disallow: /cgi-bin/
          Disallow: /downloader/
          Disallow: /errors/
          Disallow: /includes/
          Disallow: /js/
          Disallow: /lib/
          Disallow: /magento/
          Disallow: /pkginfo/
          Disallow: /report/
          
          From here, the formatting looks a little awkward. What's going on is that you're telling Roger bot to only look at these:
          
          

          Allowable Index

          Allow: /*?p=
          Allow: /index.php/blog/
          Allow: /catalog/seo_sitemap/category/
          Allow: /catalogsearch/result/
          Allow: /media/

          While the syntax is OK, not every crawler out there will follow the allow directive. Here's an example something you can use.

          # Crawlers Setup
          User-agent: *
          Crawl-delay: 30
          Disallow: /
          Disallow: /404/
          Disallow: /app/
          Disallow: /cgi-bin/
          Disallow: /downloader/
          Disallow: /errors/
          Disallow: /includes/
          Disallow: /js/
          
          From here you're telling the crawler to disallow nothing except these directories. Please let us know once you implement this method is that will actually fix the crawl.
          
          Thanks for reaching out!
          
          Best,
          
          Peter Li
          SEOmoz Help Team
          ```
          caterfor 1 Reply Last reply Reply Quote 2
          • caterfor
            caterfor @Mike.Goracke last edited by

            Hi Mike

            The crawl has now completed, thank you. I think the results will keep me occupied 🙂

            all the best, Mark.

            1 Reply Last reply Reply Quote 0
            • caterfor
              caterfor @Peterli last edited by

              Hi Peter

              thanks for your reply. The crawl has now completed and given me some more areas to work on, it's a great tool.

              I was so preoccupied with 'hiding' the site over the last couple of months with the easy code:

              User-agent: *
              Disallow: /
              
              

              I hadn't thought beyond this.

              I've noticed Google has now recognised the new robots.txt which has allowed the sitemap to be accepted..

              I'll look at your notes, thank you, and work out my next move. I'll let you know how I get on too.

              I know (well think) I have to get noindex, follow for 'sorted' category pages...

              all the best, Mark.

              1 Reply Last reply Reply Quote 0
              • 1 / 1
              • First post
                Last post
              • Should I Remove Thousands of Bad Links over a Short Time or Long Time?
                JaneCopland
                JaneCopland
                0
                5
                125

              • Googlebot take 5 times longer to crawl each page
                spes123
                spes123
                0
                5
                314

              • How long does it take for Google to index a new site and has anyone experienced serious fluctuations in SERP within 2 weeks after launch?
                deluxebydesign
                deluxebydesign
                0
                9
                9.2k

              • Site Crawl
                Sean_Dawes
                Sean_Dawes
                0
                2
                414

              • Penalities in a brand new site, Sandbox Time or rather a problem of the site?
                KTaylor
                KTaylor
                0
                4
                528

              • When is the last time Google crawled my site
                Martijn_Scheijbeler
                Martijn_Scheijbeler
                0
                3
                15.8k

              • Time on site
                EGOL
                EGOL
                0
                5
                560

              • How long does it take open site explorer to recognize new links?
                KeriMorgret
                KeriMorgret
                2
                5
                948

              Get started with Moz Pro!

              Unlock the power of advanced SEO tools and data-driven insights.

              Start my free trial
              Products
              • Moz Pro
              • Moz Local
              • Moz API
              • Moz Data
              • STAT
              • Product Updates
              Moz Solutions
              • SMB Solutions
              • Agency Solutions
              • Enterprise Solutions
              • Digital Marketers
              Free SEO Tools
              • Domain Authority Checker
              • Link Explorer
              • Keyword Explorer
              • Competitive Research
              • Brand Authority Checker
              • Local Citation Checker
              • MozBar Extension
              • MozCast
              Resources
              • Blog
              • SEO Learning Center
              • Help Hub
              • Beginner's Guide to SEO
              • How-to Guides
              • Moz Academy
              • API Docs
              About Moz
              • About
              • Team
              • Careers
              • Contact
              Why Moz
              • Case Studies
              • Testimonials
              Get Involved
              • Become an Affiliate
              • MozCon
              • Webinars
              • Practical Marketer Series
              • MozPod
              Connect with us

              Contact the Help team

              Join our newsletter
              Moz logo
              © 2021 - 2026 SEOMoz, Inc., a Ziff Davis company. All rights reserved. Moz is a registered trademark of SEOMoz, Inc.
              • Accessibility
              • Terms of Use
              • Privacy