The Moz Q&A Forum

    • Forum
    • Questions
    • My Q&A
    • Users
    • Ask the Community

    Welcome to the Q&A Forum

    Browse the forum for helpful insights and fresh discussions about all things SEO.

    1. SEO and Digital Marketing Q&A Forum
    2. Categories
    3. Technical SEO Issues
    4. Roger bot taking a long time to crawl site

    Roger bot taking a long time to crawl site

    Technical SEO Issues
    5 3 590
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as question
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • caterfor
      caterfor last edited by

      Hi all, I've noticed Roger bot is taking a long time to crawl my new site. It started on the 28th Feb 2013 and is still going. There aren't many pages at the moment. Any ideas please?

      thanks a lot, Mark.

      1 Reply Last reply Reply Quote 1
      • Mike.Goracke
        Mike.Goracke last edited by

        Hi Mark,

        This sounds like a bug or issue with the SEOmoz software.

        Contact help@seomoz.org and ask one of the help associates to look into this for you.

        If you do not have many pages, it definitely shouldn't take that long.

        The help team responds extremely quickly!

        Good luck.

        Mike

        caterfor 1 Reply Last reply Reply Quote 2
        • Peterli
          Peterli last edited by

          Hi Mark,

          Sorry it's taking a while to crawl your new site.

          While I'm not exactly sure what the delay is, one of the possible reasons is through your robots.txt.  Here's what I see in a short snippet from your robots.txt:

          # Crawlers Setup
          User-agent: *
          Crawl-delay: 30
          # Allowable Index
          Allow: /*?p=
          Allow: /index.php/blog/
          Allow: /catalog/seo_sitemap/category/
          Allow: /catalogsearch/result/
          Allow: /media/
          # Directories
          Disallow: /404/
          Disallow: /app/
          Disallow: /cgi-bin/
          Disallow: /downloader/
          Disallow: /errors/
          Disallow: /includes/
          Disallow: /js/
          Disallow: /lib/
          Disallow: /magento/
          Disallow: /pkginfo/
          Disallow: /report/
          
          From here, the formatting looks a little awkward. What's going on is that you're telling Roger bot to only look at these:
          
          

          Allowable Index

          Allow: /*?p=
          Allow: /index.php/blog/
          Allow: /catalog/seo_sitemap/category/
          Allow: /catalogsearch/result/
          Allow: /media/

          While the syntax is OK, not every crawler out there will follow the allow directive. Here's an example something you can use.

          # Crawlers Setup
          User-agent: *
          Crawl-delay: 30
          Disallow: /
          Disallow: /404/
          Disallow: /app/
          Disallow: /cgi-bin/
          Disallow: /downloader/
          Disallow: /errors/
          Disallow: /includes/
          Disallow: /js/
          
          From here you're telling the crawler to disallow nothing except these directories. Please let us know once you implement this method is that will actually fix the crawl.
          
          Thanks for reaching out!
          
          Best,
          
          Peter Li
          SEOmoz Help Team
          ```
          caterfor 1 Reply Last reply Reply Quote 2
          • caterfor
            caterfor @Mike.Goracke last edited by

            Hi Mike

            The crawl has now completed, thank you. I think the results will keep me occupied 🙂

            all the best, Mark.

            1 Reply Last reply Reply Quote 0
            • caterfor
              caterfor @Peterli last edited by

              Hi Peter

              thanks for your reply. The crawl has now completed and given me some more areas to work on, it's a great tool.

              I was so preoccupied with 'hiding' the site over the last couple of months with the easy code:

              User-agent: *
              Disallow: /
              
              

              I hadn't thought beyond this.

              I've noticed Google has now recognised the new robots.txt which has allowed the sitemap to be accepted..

              I'll look at your notes, thank you, and work out my next move. I'll let you know how I get on too.

              I know (well think) I have to get noindex, follow for 'sorted' category pages...

              all the best, Mark.

              1 Reply Last reply Reply Quote 0
              • 1 / 1
              • First post
                Last post
              • Redirecting HTTP to HTTPS - How long does it take Google to re-index the site?
                MChuckGreen
                MChuckGreen
                0
                4
                6.7k

              • Have a client that migrated their site; went live with noindex/nofollow and for last two SEOMoz crawls only getting one page crawled. In contrast, G.A. is crawling all pages. Just wait?
                Nobody1560986989723
                Nobody1560986989723
                0
                5
                422

              • Site Crawl
                Sean_Dawes
                Sean_Dawes
                0
                2
                414

              • Penalities in a brand new site, Sandbox Time or rather a problem of the site?
                KTaylor
                KTaylor
                0
                4
                528

              • Time on site
                EGOL
                EGOL
                0
                5
                560

              • How long will Google take to stop crawling an old URL once it has been 301 redirected
                gmk1567
                gmk1567
                0
                3
                664

              • Other Websites Crawling my Site using WordPress/3.1 bot?
                AlanMosley
                AlanMosley
                0
                4
                660

              • How long does it take for customized Google Site Search to show results from pdf files?
                Lauroca
                Lauroca
                0
                10
                3.1k

              Get started with Moz Pro!

              Unlock the power of advanced SEO tools and data-driven insights.

              Start my free trial
              Products
              • Moz Pro
              • Moz Local
              • Moz API
              • Moz Data
              • STAT
              • Product Updates
              Moz Solutions
              • SMB Solutions
              • Agency Solutions
              • Enterprise Solutions
              • Digital Marketers
              Free SEO Tools
              • Domain Authority Checker
              • Link Explorer
              • Keyword Explorer
              • Competitive Research
              • Brand Authority Checker
              • Local Citation Checker
              • MozBar Extension
              • MozCast
              Resources
              • Blog
              • SEO Learning Center
              • Help Hub
              • Beginner's Guide to SEO
              • How-to Guides
              • Moz Academy
              • API Docs
              About Moz
              • About
              • Team
              • Careers
              • Contact
              Why Moz
              • Case Studies
              • Testimonials
              Get Involved
              • Become an Affiliate
              • MozCon
              • Webinars
              • Practical Marketer Series
              • MozPod
              Connect with us

              Contact the Help team

              Join our newsletter
              Moz logo
              © 2021 - 2026 SEOMoz, Inc., a Ziff Davis company. All rights reserved. Moz is a registered trademark of SEOMoz, Inc.
              • Accessibility
              • Terms of Use
              • Privacy