The Moz Q&A Forum

    • Forum
    • Questions
    • My Q&A
    • Users
    • Ask the Community

    Welcome to the Q&A Forum

    Browse the forum for helpful insights and fresh discussions about all things SEO.

    1. SEO and Digital Marketing Q&A Forum
    2. Categories
    3. Intermediate & Advanced SEO
    4. More Indexed Pages than URLs on site.

    More Indexed Pages than URLs on site.

    Intermediate & Advanced SEO
    6 3 358
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as question
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • DavidLenehan
      DavidLenehan last edited by

      According to webmaster tools, the number of pages indexed by Google on my site doubled yesterday (gone from 150K to 450K). Usually I would be jumping for joy but now I have more indexed pages than actual pages on my site.

      I have checked for duplicate URLs pointing to the same product page but can't see any, pagination in category pages doesn't seem to be indexed nor does parameterisation in URLs from advanced filtration.

      Using the site: operator we get a different result on google.com (450K)  to google.co.uk (150K).

      Anyone got any ideas?

      1 Reply Last reply Reply Quote 0
      • irvingw
        irvingw last edited by

        Can you see in the search the pages which are indexed and look for duplicates or technical issues causing improper indexing? Do you have other sites like subdomains Google might be counting as pages.

        DavidLenehan 1 Reply Last reply Reply Quote 0
        • LynnPatchett
          LynnPatchett last edited by

          Hi David,

          Not sure why they started showing up now (some recent changes to the site?) but I suspect your problem is indexed urls that you are trying to block with robots.txt but are finding their way into the index somehow.

          If you do a search for: site:nicontrols.com inurl:/manufacturer/ and then click on the show omitted results you will see a whole bunch (31000!) of 'content blocked by robots.txt' notices but the urls are still in the index. If you do a couple more similar searches looking for other likely url paths you  will likely find some more.

          If you can get a no-index meta tag into these pages I think it will be more effective in keeping them out of the index. If you have in mind some recent changes you have done to the site that might have introduced internal links to these pages then it would be worth looking to see if you can get the links removed or replaced with the 'proper' link format.

          Hope that helps!

          DavidLenehan 1 Reply Last reply Reply Quote 2
          • DavidLenehan
            DavidLenehan @irvingw last edited by

            Hi Irving

            We checked everything obvious and cannot explain what is going on. I cannot see any major duplicate content issues and we do not have any subdomains active.  The Moz crawler also doesn't highlight any major duplicate content issues.

            1 Reply Last reply Reply Quote 0
            • DavidLenehan
              DavidLenehan @LynnPatchett last edited by

              Thanks Lynn.  The 31,000 was a bit of a legacy of issue and something we have solved. The robots file was changed a couple of weeks ago. So fingers crossed Google will deindex them soon. We get the same result when using inurl: where.

              Any idea where the rest have come from?

              LynnPatchett 1 Reply Last reply Reply Quote 0
              • LynnPatchett
                LynnPatchett @DavidLenehan last edited by

                Hi David,

                Its tough to say without some more digging and information, it certainly looks like you have most of the common problem areas covered from what I can see. I will throw out an idea: I see you have a few 301 redirects in place switching from .html to non html versions. If this was done on a massive scale then possibly you have a google index with both versions of the pages in the index? If so it might not really be a big issue and over the next weeks/months the old .html versions will fall out of the index and your numbers will begin to look more normal again, Just a thought.

                1 Reply Last reply Reply Quote 0
                • 1 / 1
                • First post
                  Last post
                • How long will old pages stay in Google's cache index. We have a new site that is two months old but we are seeing old pages even though we used 301 redirects.
                  DonnaDuncan
                  DonnaDuncan
                  0
                  3
                  81

                • How do we decide which pages to index/de-index? Help for a 250k page site
                  julie-getonthemap
                  julie-getonthemap
                  0
                  2
                  63

                • Dfferent url of some other site is shown by Google in cace copy of our site's page
                  Dr-Pete
                  Dr-Pete
                  0
                  4
                  215

                • When Mobile and Desktop sites have the same page URLs, how should I handle the 'View Desktop Site' link on a mobile site to ensure a smooth crawl?
                  DirkC
                  DirkC
                  0
                  3
                  1.4k

                • Do I need to re-index the page after editing URL?
                  Chemometec
                  Chemometec
                  0
                  4
                  142

                • Indexed non existent pages, problem appeared after we 301d the url/index to the url.
                  ThompsonPaul
                  ThompsonPaul
                  0
                  4
                  331

                • Why are new pages not being indexed, and old pages (now in robots.txt) remain in the index?
                  KeriMorgret
                  KeriMorgret
                  0
                  3
                  378

                • Pages un-indexed in my site
                  AdamThompson
                  AdamThompson
                  0
                  6
                  999

                Get started with Moz Pro!

                Unlock the power of advanced SEO tools and data-driven insights.

                Start my free trial
                Products
                • Moz Pro
                • Moz Local
                • Moz API
                • Moz Data
                • STAT
                • Product Updates
                Moz Solutions
                • SMB Solutions
                • Agency Solutions
                • Enterprise Solutions
                • Digital Marketers
                Free SEO Tools
                • Domain Authority Checker
                • Link Explorer
                • Keyword Explorer
                • Competitive Research
                • Brand Authority Checker
                • Local Citation Checker
                • MozBar Extension
                • MozCast
                Resources
                • Blog
                • SEO Learning Center
                • Help Hub
                • Beginner's Guide to SEO
                • How-to Guides
                • Moz Academy
                • API Docs
                About Moz
                • About
                • Team
                • Careers
                • Contact
                Why Moz
                • Case Studies
                • Testimonials
                Get Involved
                • Become an Affiliate
                • MozCon
                • Webinars
                • Practical Marketer Series
                • MozPod
                Connect with us

                Contact the Help team

                Join our newsletter
                Moz logo
                © 2021 - 2026 SEOMoz, Inc., a Ziff Davis company. All rights reserved. Moz is a registered trademark of SEOMoz, Inc.
                • Accessibility
                • Terms of Use
                • Privacy