The Moz Q&A Forum

    • Forum
    • Questions
    • My Q&A
    • Users
    • Ask the Community

    Welcome to the Q&A Forum

    Browse the forum for helpful insights and fresh discussions about all things SEO.

    1. SEO and Digital Marketing Q&A Forum
    2. Categories
    3. Moz Pro
    4. Crawl Diagnostics - Crawling way more pages than my site has?

    Crawl Diagnostics - Crawling way more pages than my site has?

    Moz Pro
    5 3 410
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as question
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • LodestoneGen
      LodestoneGen last edited by

      Hello all,

      I'm fairly new here, more of a paid search guy dabbling in SEO on the side. I have a client that I have in SEOMoz and the Crawl Diagnostics report is showing 10,000+ pages crawled and I think the site has at most 800 pages (e-commerce site using freewebstore.org as the platform).

      Any reasons this would be happening?

      1 Reply Last reply Reply Quote 0
      • NakulGoyal
        NakulGoyal last edited by

        You can download the entire crawl and see if there's actually that many pages. Or post the URL here.

        You can also test using a crawling software tool like Xenu or Screaming Frog to test it.

        You can also post/private message the link here and I can take a look.

        1 Reply Last reply Reply Quote 1
        • DougRoberts
          DougRoberts last edited by

          I'm guessing that as an ecommerce site you've got multiple ways to browse your content, by category / brand / special offers etc. The thing to watch out for is interesting URLs with categories or lots of parameters.As a result, chances are you've got a duplicate content problem.

          As Nakul mentioned a good first step is to take a look at your crawl report or use one of the tools he mentioned to see if you've got the same content being indexed multiple times.

          Once you've done that, check is to see how many of these pages being crawled are appearing in Google's index. Is Google doing a reasonable job identifying the right version? How many pages are there in the index. Are recently added products being discovered quickly?

          The Site: operators will be your friend here and Dr Pete did a great article on ways you can use it.

          http://www.seomoz.org/blog/25-killer-combos-for-googles-site-operator

          Once you understand what is being crawled and what's making it to the index you need to decide what pages you really do want to be indexed and make sure that these become the canonical versions and block parts of your site using robots.txt. (But understand the problem and what you want to achieve before you start doing this.)

          Hope this helps.

          <object id="plugin0" style="position: absolute; z-index: 1000;" width="0" height="0" type="application/x-dgnria"><param name="tabId" value="ff-tab-10"> <param name="counter" value="138"></object>

          LodestoneGen 1 Reply Last reply Reply Quote 1
          • LodestoneGen
            LodestoneGen @DougRoberts last edited by

            Thanks to both of you. I will start to dig in to your suggested steps later today.

            I just took this one and they really don't have anything set-up. I just got them set-up on Webmaster tools as well so not even sure if they had their site indexed before.

            The Crawl Diagnostics doesn't show much duplicate content (60 pages?) but the Too Many On Page Links, Overly Dynamic URL, Duplicate Title, Long URL warnings are all showing 6000-10000 pages.

            The site sells crystals, each item is unique and as I did my first review they don't really even have item descriptions written let alone page titles and meta-descriptions.

            I am in analysis mode working up my comments in review and detailing an action plane to help them focus moving forward. I was just shocked by the 10,000 pages listed in one of the crawl warnings.

            anyway, I'll dig into this info and let you know what I find. It's an adventure!

            1 Reply Last reply Reply Quote 0
            • LodestoneGen
              LodestoneGen last edited by

              Ok - Here is an update. I found that it has a basketful of entries for each Category and I have a pretty good list of categories.

              Attached is an image showing what is happening in one category. There is an entry for each sort option which I understand where this is coming from (Sort Name, Sort Price Ascending, Sort Price Descending) what i don't understand are all the "rw=1" entries. And why they stack up like they do.

              Is this an issue? I am assuming it is because there seems to be no real reason for it.

              VH2Cjst

              1 Reply Last reply Reply Quote 0
              • 1 / 1
              • First post
                Last post
              • Why my site not crawl?
                jahanidawodi
                jahanidawodi
                0
                5
                68

              • What to do with a site of >50,000 pages vs. crawl limit?
                scienceisrad
                scienceisrad
                0
                5
                625

              • Pagination Issues on E-commerce Site: Duplicate Page Title and Content on Moz Crawl
                Dr-Pete
                Dr-Pete
                0
                3
                1.4k

              • Moz crawl only shows 2 pages, but we have more than 1000 pages.
                DavidLee
                DavidLee
                0
                8
                209

              • Crawled pages are missing and showing just 1 page crawled
                KeriMorgret
                KeriMorgret
                0
                3
                296

              • "Issue: Duplicate Page Content " in Crawl Diagnostics - but sample pages are not related to page indicated with duplicate content
                cbielich
                cbielich
                0
                2
                345

              • Only 1 page has been crawled. Why?
                tompollard
                tompollard
                0
                5
                557

              Get started with Moz Pro!

              Unlock the power of advanced SEO tools and data-driven insights.

              Start my free trial
              Products
              • Moz Pro
              • Moz Local
              • Moz API
              • Moz Data
              • STAT
              • Product Updates
              Moz Solutions
              • SMB Solutions
              • Agency Solutions
              • Enterprise Solutions
              • Digital Marketers
              Free SEO Tools
              • Domain Authority Checker
              • Link Explorer
              • Keyword Explorer
              • Competitive Research
              • Brand Authority Checker
              • Local Citation Checker
              • MozBar Extension
              • MozCast
              Resources
              • Blog
              • SEO Learning Center
              • Help Hub
              • Beginner's Guide to SEO
              • How-to Guides
              • Moz Academy
              • API Docs
              About Moz
              • About
              • Team
              • Careers
              • Contact
              Why Moz
              • Case Studies
              • Testimonials
              Get Involved
              • Become an Affiliate
              • MozCon
              • Webinars
              • Practical Marketer Series
              • MozPod
              Connect with us

              Contact the Help team

              Join our newsletter
              Moz logo
              © 2021 - 2026 SEOMoz, Inc., a Ziff Davis company. All rights reserved. Moz is a registered trademark of SEOMoz, Inc.
              • Accessibility
              • Terms of Use
              • Privacy