The Moz Q&A Forum

    • Forum
    • Questions
    • My Q&A
    • Users
    • Ask the Community

    Welcome to the Q&A Forum

    Browse the forum for helpful insights and fresh discussions about all things SEO.

    1. SEO and Digital Marketing Q&A Forum
    2. Categories
    3. Intermediate & Advanced SEO
    4. Big discrepancies between pages in Google's index and pages in sitemap

    Big discrepancies between pages in Google's index and pages in sitemap

    Intermediate & Advanced SEO
    6 3 218
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as question
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • Digirank
      Digirank last edited by

      Hi,

      I'm noticing a huge difference in the number of pages in Googles index (using 'site:' search) versus the number of pages indexed by Google in Webmaster tools. (ie 20,600 in 'site:' search vs 5,100 submitted via the dynamic sitemap.)

      Anyone know possible causes for this and how i can fix?

      It's an ecommerce site but i can't see any issues with duplicate content - they employ a very good canonical tag strategy. Could it be that Google has decided to ignore the canonical tag?

      Any help appreciated,

      Karen

      1 Reply Last reply Reply Quote 0
      • Humix
        Humix last edited by

        Are you using just one sitemap or multiple?

        Digirank 1 Reply Last reply Reply Quote 0
        • Digirank
          Digirank @Humix last edited by

          Just 1 at the moment.

          Humix 1 Reply Last reply Reply Quote 0
          • Humix
            Humix @Digirank last edited by

            Sorry wrong interpretation of your question. Have you excluded the site search pages using robots.txt.? If not this might be the reason why you've that many pages indexed.

            Anyway this discussion might give you more answers:

            Digirank 1 Reply Last reply Reply Quote 1
            • Digirank
              Digirank @Humix last edited by

              Hi,

              Thanks so much for that, it's really interesting.

              I've resolved the issue now, it was a case of some (a lot!) of missing canonical tags. Phew!

              Thanks for your help!

              1 Reply Last reply Reply Quote 0
              • David-Kley
                David-Kley last edited by

                Take a look at the pages that are indexed. Chances are that since it is a cart or CMS-based site, you just need to use robots.txt to block out some areas you don't want indexed. You also need to look at your indexed pages, to see if any of them are duplicates, meaning you have 2 or more url's that display the same content.

                "It's an ecommerce site but i can't see any issues with duplicate content - they employ a very good canonical tag strategy. Could it be that Google has decided to ignore the canonical tag? "

                Could be that your cms or cart is not forwarding all the pages to the canonical version. Again, check to see if you can access multiple versions of the same page. Ecom and CMS sites always have these types of errors if you dont keep a close eye on the URL's since they are database driven, vs static HTML. Look for www or non-www versions of pages, url's with and without index.php, etc.

                Once you target what the offending url's are, use redirects to forward them to the proper and search engine friendly version.

                1 Reply Last reply Reply Quote 2
                • 1 / 1
                • First post
                  Last post
                • How long will old pages stay in Google's cache index. We have a new site that is two months old but we are seeing old pages even though we used 301 redirects.
                  DonnaDuncan
                  DonnaDuncan
                  0
                  3
                  81

                • Magento 1.9 SEO. I have product pages with identical On Page SEO score in the 90's. Some pull up Google page 1 some won't pull up at all. I am searching for the exact title on that page.
                  CTOPDS
                  CTOPDS
                  0
                  3
                  63

                • Will disallowing URL's in the robots.txt file stop those URL's being indexed by Google
                  Martijn_Scheijbeler
                  Martijn_Scheijbeler
                  0
                  11
                  1.6k

                • Is it a problem that Google's index shows paginated page urls, even with canonical tags in place?
                  94501
                  94501
                  0
                  3
                  249

                • Does Google still don't index Hashtag Links ? No chance to get a Search Result that leads directly to a section of a page? or to one of numeras Hashtag Pages in a single HTML page?
                  Muhammad_Jabali
                  Muhammad_Jabali
                  0
                  3
                  748

                • My landing page changed in google's serp. I used to have a product page now I have a pdf?
                  KenyonManu3-SEOSEM
                  KenyonManu3-SEOSEM
                  0
                  9
                  326

                • Is 404'ing a page enough to remove it from Google's index?
                  RyanKent
                  RyanKent
                  0
                  4
                  13.7k

                • Tool to calculate the number of pages in Google's index?
                  gfiorelli1
                  gfiorelli1
                  0
                  4
                  1.2k

                Get started with Moz Pro!

                Unlock the power of advanced SEO tools and data-driven insights.

                Start my free trial
                Products
                • Moz Pro
                • Moz Local
                • Moz API
                • Moz Data
                • STAT
                • Product Updates
                Moz Solutions
                • SMB Solutions
                • Agency Solutions
                • Enterprise Solutions
                • Digital Marketers
                Free SEO Tools
                • Domain Authority Checker
                • Link Explorer
                • Keyword Explorer
                • Competitive Research
                • Brand Authority Checker
                • Local Citation Checker
                • MozBar Extension
                • MozCast
                Resources
                • Blog
                • SEO Learning Center
                • Help Hub
                • Beginner's Guide to SEO
                • How-to Guides
                • Moz Academy
                • API Docs
                About Moz
                • About
                • Team
                • Careers
                • Contact
                Why Moz
                • Case Studies
                • Testimonials
                Get Involved
                • Become an Affiliate
                • MozCon
                • Webinars
                • Practical Marketer Series
                • MozPod
                Connect with us

                Contact the Help team

                Join our newsletter
                Moz logo
                © 2021 - 2026 SEOMoz, Inc., a Ziff Davis company. All rights reserved. Moz is a registered trademark of SEOMoz, Inc.
                • Accessibility
                • Terms of Use
                • Privacy