The Moz Q&A Forum

    • Forum
    • Questions
    • My Q&A
    • Users
    • Ask the Community

    Welcome to the Q&A Forum

    Browse the forum for helpful insights and fresh discussions about all things SEO.

    1. SEO and Digital Marketing Q&A Forum
    2. Categories
    3. Intermediate & Advanced SEO
    4. Old pages still in index

    Old pages still in index

    Intermediate & Advanced SEO
    5 3 134
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as question
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • ssiebn7
      ssiebn7 last edited by

      Hi Guys,

      I've been working on a E-commerce site for a while now. Let me sum it up :

      • February new site is launched
      • Due to lack of resources we started 301's of old url's in March
      • Added rel=canonical end of May because of huge index numbers (developers forgot!!)
      • Added noindex and robots.txt on at least 1000 urls.
      • Index numbers went down from 105.000 tot 55.000 for now, see screenshot (actual number in sitemap is 13.000)

      Now when i do site:domain.com there are still old url's in the index while there is a 301 on the url since March!

      I know this can take a while but I wonder how I can speed this up or am doing something wrong. Hope anyone can help because I simply don't know how the old url's can still be in the index.

      4cArHPH.png

      1 Reply Last reply Reply Quote 0
      • Chris.Menke
        Chris.Menke last edited by

        It can take months for pages to fall out of Google's index have you looked at your log files to verify that googlebot is crawling those pages?.  Things to keep in mind:

        • If you 301 a page, the rel=canonical on that page will not be seen by the bot (no biggie in your case)
        • If you 301 a page, a meta noindex will not be seen by the bot
        • It is suggested not to use the robots.txt to no index a page  that is being 301 redirected--as the redirect may not be seen by Google.
        1 Reply Last reply Reply Quote 1
        • ssiebn7
          ssiebn7 last edited by

          Hi Chris,

          Thanks for your answer.

          I'm either using a 301 or noindex, not both of course.

          Still have to check the server logs, thanks for that!

          Another weird thing. While the old url is still in the index, when i check the cache date it's a week old. That's what i don't get. Cache date is a week old but Google still has the old url in the index.

          1 Reply Last reply Reply Quote 0
          • evolvingSEO
            evolvingSEO last edited by

            Hi There

            To noindex pages there are a few methods;

            • use a meta noindex without robots.txt - I think that is why some may not be removed. The robots.txt block crawling so they can not see the noindex.

            • use a 301 redirect - this will eventually kill off the old pages, but it can definitely take a while.

            • canonical it to another page. and as Chris says, don't block the page or add extra directives. If you canonical the page (correctly), I find it usually drops out of the index fairly quickly after being crawled.

            • use the URL removal tool in webmaster tools + robots.txt or 404. So if you 404 a page or block it with robots.txt you can then go into webmaster tools and do a URL removal. This is NOT recommended though in most normal cases, as Google prefers this be for "emergencies".

            The only method that removes pages within a day or two guaranteed is the URL removal tool.

            I would also examine your site since it is new, for something that is causing additional pages to be generated and indexed. I see this a lot with ecommerce sites where they have lots of pagination, facets, sorting, etc and those can generate lots of other pages which get indexed.

            Again, as Chris says, you want to be careful to not mix signals. Hope this all helps!

            -Dan

            1 Reply Last reply Reply Quote 1
            • ssiebn7
              ssiebn7 last edited by

              Hi Dan,

              Thanks for the answer!

              Indexation is already back to 42.000 so slowly going back to normal 🙂

              And thanks for the last tip, that's totally right. I just discovered that several pages had duplicate url's generated so by continually monitoring we'll fix it !

              1 Reply Last reply Reply Quote 0
              • 1 / 1
              • First post
                Last post
              • I still see the old page in index
                seoanalytics
                seoanalytics
                0
                7
                60

              • How do we decide which pages to index/de-index? Help for a 250k page site
                julie-getonthemap
                julie-getonthemap
                0
                2
                63

              • Robots.txt Disallowed Pages and Still Indexed
                Igor.Go
                Igor.Go
                0
                3
                2.9k

              • Our client's web property recently switched over to secure pages (https) however there non secure pages (http) are still being indexed in Google. Should we request in GWMT to have the non secure pages deindexed?
                N1ghteyes
                N1ghteyes
                0
                3
                128

              • "No index" page still shows in search results and paginated pages shows page 2 in results
                khi5
                khi5
                0
                3
                114

              • Incorrect cached page indexing in Google while correct page indexes intermittently
                MikeRoberts
                MikeRoberts
                0
                2
                298

              • I have removed over 2000+ pages but Google still says i have 3000+ pages indexed
                evolvingSEO
                evolvingSEO
                0
                6
                157

              • Should pages of old news articles be indexed?
                gcdtechnologies
                gcdtechnologies
                0
                6
                357

              Get started with Moz Pro!

              Unlock the power of advanced SEO tools and data-driven insights.

              Start my free trial
              Products
              • Moz Pro
              • Moz Local
              • Moz API
              • Moz Data
              • STAT
              • Product Updates
              Moz Solutions
              • SMB Solutions
              • Agency Solutions
              • Enterprise Solutions
              • Digital Marketers
              Free SEO Tools
              • Domain Authority Checker
              • Link Explorer
              • Keyword Explorer
              • Competitive Research
              • Brand Authority Checker
              • Local Citation Checker
              • MozBar Extension
              • MozCast
              Resources
              • Blog
              • SEO Learning Center
              • Help Hub
              • Beginner's Guide to SEO
              • How-to Guides
              • Moz Academy
              • API Docs
              About Moz
              • About
              • Team
              • Careers
              • Contact
              Why Moz
              • Case Studies
              • Testimonials
              Get Involved
              • Become an Affiliate
              • MozCon
              • Webinars
              • Practical Marketer Series
              • MozPod
              Connect with us

              Contact the Help team

              Join our newsletter
              Moz logo
              © 2021 - 2026 SEOMoz, Inc., a Ziff Davis company. All rights reserved. Moz is a registered trademark of SEOMoz, Inc.
              • Accessibility
              • Terms of Use
              • Privacy