The Moz Q&A Forum

    • Forum
    • Questions
    • My Q&A
    • Users
    • Ask the Community

    Welcome to the Q&A Forum

    Browse the forum for helpful insights and fresh discussions about all things SEO.

    1. SEO and Digital Marketing Q&A Forum
    2. Categories
    3. Technical SEO Issues
    4. Problems with to many indexed pages

    Problems with to many indexed pages

    Technical SEO Issues
    8 4 106
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as question
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • Inevo
      Inevo last edited by

      A client of our have not been able to rank very well the last few years. They are a big brand in our country, have more than 100+ offline stores and have plenty of inbound links.

      Our main issue has been that they have to many indexed pages. Before we started we they had around 750.000 pages in the Google index. After a bit of work we got it down to 400-450.000. During our latest push we used the robots meta tag with "noindex, nofollow" on all pages we wanted to get out of the index, along with canonical to correct URL - nothing was done to robots.txt to block the crawlers from entering the pages we wanted out.

      Our aim is to get it down to roughly 5000+ pages. They just passed 5000 products + 100 categories.

      I added this about 10 days ago, but nothing has happened yet. Is there anything I can to do speed up the process of getting all the pages out of index?

      The page is vita.no if you want to have a look!

      1 Reply Last reply Reply Quote 0
      • DonnaDuncan
        DonnaDuncan last edited by

        Hi lnevo,

        I had a similar situation last year and am not aware of a faster way to get pages deindexed. You're feeding WMT an  updated sitemap right?

        It took 8 months for the excess pages to get dropped off my client's site. I'll be listening to hear if anyone knows a faster way.

        1 Reply Last reply Reply Quote 1
        • MoosaHemani
          MoosaHemani last edited by

          My advice would be to include a fresh sitemap and upload it Google Webmaster tool. Not sure about time but I will second Donna, this will take time for the pages to get out of the Google Index.

          There is one hack that I used for one page on my website but not sure if it will work for 1000+ pages.

          I actually removed a page on my website using Google’s temporary removal request. It kicked the page out of the index for 90 days and in the mean time I added the link in the robots.txt file so it gone quickly and never returned back in the Google listing.

          Hope this helps.

          1 Reply Last reply Reply Quote 1
          • Everett
            Everett last edited by

            Hello Inevo,

            Most of the time when this happens it's just because Google hasn't gotten around to recrawling the pages and updating their index after seeing the new robots meta tag. It can take several months for this to happen on a large site. Submit an XML sitemap and/or create an HTML sitemap that makes it easy for them to get to these pages if you need it to go faster.

            I had a look and see some conflicting instructions that Google could possibly be having a problem with.

            The paginated version ( e.g. http://www.vita.no/duft?p=2 ) of the page has a rel canonical tag pointing to the first page (e.g. http://www.vita.no/duft/ ). Yet it also has a noindex tag while the canonical page has an index tag. And each page has its own unique title (Side 2  ... Side 3 | ...) . I would remove the rel canonical tag on the paginated pages since they probably don't have any pagerank worth giving to the canonical page. This way it is even more clear to Google that the canonical page is to be indexed, and the others are not to be - instead of saying they are the same page. The same is true of filter pages: http://www.vita.no/gavesett/herre/filter/price-400-/ .

            I don't know if that has anything to do with your issue of index bloat, but it's worth a try. I did find some paginated pages in the index.

            There also appears to be about 520 blog tag pages indexed. I typically set those to be noindex,follow.

            Also remove all paginated pages and any other page that you don't want indexed from your XML sitemaps if you haven't already.

            At least for the filter pages, since /filter/ is its own directory, you can use the URL removal tool in GWT. It does have a directory-level removal feature. Of course there are only 75 of these indexed at this moment.

            Inevo 1 Reply Last reply Reply Quote 0
            • Inevo
              Inevo @Everett last edited by

              Isn't the whole point of using canonical to give Google a pointer of what page it is originally meant to be?

              So if you have a category on shop.com/sub..

              Using filter and/or pagenation you then get:

              shop.com/sub?p=1
              shop.com/sub?color=blue

              .. and so on! Both those pages then need canonical and neither do we want them index, so we by using both canonical and noindex tell Google to "don't index this page (noindex), here is the original version of it (canonical)".

              Or did I misunderstand something? 🙂

              Everett 1 Reply Last reply Reply Quote 0
              • Everett
                Everett @Inevo last edited by

                "Google: Do Not No Index Pages With Rel Canonical Tags"
                https://www.seroundtable.com/noindex-canonical-google-18274.html

                https://productforums.google.com/forum/?hl=en#!category-topic/webmasters/crawling-indexing--ranking/0sqRrolO_Ss

                This is still being debated by people and I'm not saying it is "definitely" your problem. But if you're trying to figure out why those noindexed pages aren't coming out of the index this could be one thing to look into.

                John Mueller (see screenshot below) is a Webmaster Trends Analyst for Google.

                Good luck.

                Noindex-no-follow-rel-canonical-same-page-1395178226.png

                Inevo 1 Reply Last reply Reply Quote 1
                • Inevo
                  Inevo @Everett last edited by

                  Thanks for that! What you are saying makes sense, so I'm going to go ahead and give it a try.

                  Everett 1 Reply Last reply Reply Quote 0
                  • Everett
                    Everett @Inevo last edited by

                    Great! Please let us know how it goes so we can all learn more about it.

                    Thanks!

                    1 Reply Last reply Reply Quote 0
                    • 1 / 1
                    • First post
                      Last post
                    • Google not index main keyword on homepage in 2 countries same language, rest of pages no problem
                      Aleyda
                      Aleyda
                      0
                      6
                      227

                    • Is there a way to index important pages manually or to make sure a certain page will get indexed in a short period of time??
                      rijwielcashencarry040
                      rijwielcashencarry040
                      0
                      7
                      111

                    • Anything new if determining how many of a sites pages are in Google's supplemental index vs the main index?
                      SEMPassion
                      SEMPassion
                      0
                      4
                      390

                    • Indexed pages and current pages - Big difference?
                      grasshopper
                      grasshopper
                      0
                      4
                      445

                    • Page not being indexed
                      rasmusbang
                      rasmusbang
                      0
                      8
                      776

                    • Does page speed affect what pages are in the index?
                      Alex-Harford
                      Alex-Harford
                      0
                      10
                      835

                    • Discrepency between # of pages and # of pages indexed
                      Dan-Petrovic
                      Dan-Petrovic
                      0
                      14
                      990

                    • Too many on page links for WP blog page
                      mozUser1469236629285
                      mozUser1469236629285
                      0
                      9
                      684

                    Get started with Moz Pro!

                    Unlock the power of advanced SEO tools and data-driven insights.

                    Start my free trial
                    Products
                    • Moz Pro
                    • Moz Local
                    • Moz API
                    • Moz Data
                    • STAT
                    • Product Updates
                    Moz Solutions
                    • SMB Solutions
                    • Agency Solutions
                    • Enterprise Solutions
                    • Digital Marketers
                    Free SEO Tools
                    • Domain Authority Checker
                    • Link Explorer
                    • Keyword Explorer
                    • Competitive Research
                    • Brand Authority Checker
                    • Local Citation Checker
                    • MozBar Extension
                    • MozCast
                    Resources
                    • Blog
                    • SEO Learning Center
                    • Help Hub
                    • Beginner's Guide to SEO
                    • How-to Guides
                    • Moz Academy
                    • API Docs
                    About Moz
                    • About
                    • Team
                    • Careers
                    • Contact
                    Why Moz
                    • Case Studies
                    • Testimonials
                    Get Involved
                    • Become an Affiliate
                    • MozCon
                    • Webinars
                    • Practical Marketer Series
                    • MozPod
                    Connect with us

                    Contact the Help team

                    Join our newsletter
                    Moz logo
                    © 2021 - 2026 SEOMoz, Inc., a Ziff Davis company. All rights reserved. Moz is a registered trademark of SEOMoz, Inc.
                    • Accessibility
                    • Terms of Use
                    • Privacy