The Moz Q&A Forum

    • Forum
    • Questions
    • My Q&A
    • Users
    • Ask the Community

    Welcome to the Q&A Forum

    Browse the forum for helpful insights and fresh discussions about all things SEO.

    1. SEO and Digital Marketing Q&A Forum
    2. Categories
    3. Intermediate & Advanced SEO
    4. Disallowed Pages Still Showing Up in Google Index. What do we do?

    Disallowed Pages Still Showing Up in Google Index. What do we do?

    Intermediate & Advanced SEO
    6 4 896
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as question
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • udemy
      udemy last edited by

      We recently disallowed a wide variety of pages for www.udemy.com which we do not want google indexing (e.g., /tags or /lectures). Basically we don't want to spread our link juice around to all these pages that are never going to rank. We want to keep it focused on our core pages which are for our courses.

      We've added them as disallows in robots.txt, but after 2-3 weeks google is still showing them in it's index. When we lookup "site: udemy.com", for example, Google currently shows ~650,000 pages indexed... when really it should only be showing ~5,000 pages indexed.

      As another example, if you search for "site:udemy.com/tag", google shows 129,000 results. We've definitely added "/tag" into our robots.txt properly, so this should not be happening... Google showed be showing 0 results.

      Any ideas re: how we get Google to pay attention and re-index our site properly?

      1 Reply Last reply Reply Quote 0
      • john4math
        john4math last edited by

        Disallowing in your robots.txt keeps the bots from indexing your pages going forward, but Google may keep returning them in search results.  This post has great explanations about ways to remove pages from indices: http://www.seomoz.org/blog/robot-access-indexation-restriction-techniques-avoiding-conflicts

        The surefire way to get them out of the index is to remove the disallow from your robots.txt, and add a meta noindex tags on all the pages you want removed.  Once they're reindexed by Google, they'll no longer appear in SERPs.

        loopyal 1 Reply Last reply Reply Quote 1
        • loopyal
          loopyal @john4math last edited by

          I would have said the same thing, except that a few weeks ago, I removed a rule from the robots file and I changed the affected pages to have a noindex.nofollow and the next day, tens of thousands of those pages appeared in the index and overpowered the content pages.

          So my advice, is don't trust noindex,nofollow and just stop the robot going down that tree (as you are doing) and find another way to get those pages out of the index.

          You can use the URL removal request tool.

          It only seems to allow you to remove 1000 per day.

          I have done this before by automating the removal using a macro program.

          I think I removed about 15,000 over the space of a month, doing that.

          They are fairly fast at removing URLs these days, 24 hours or less.

          KeriMorgret loopyal 3 Replies Last reply Reply Quote 0
          • KeriMorgret
            KeriMorgret @loopyal last edited by

            The last time I looked, you can request removal of an entire directory as well, which should work for the OP.

            1 Reply Last reply Reply Quote 1
            • loopyal
              loopyal @loopyal last edited by

              Thank you Keri.

              Yes, good idea, but whatever you request, that page or directory must respond with a 404, otherwise, it will be ignored.

              • that is why I couldn't do that with the send to a friend URLs

              (would have been a nice thing to do)

              I guess I could have cheated, and made them return a 404 if it was google, just to dump them all out of the index.

              The 15,000 I did request to be removed were individual pages, that returned 404 response code, so thats why I did them one at a time. I could have waited, but if you wait, then google keeps trying to fetch those missing pages and they keep reporting them in your GWT.

              That is a good reason to request the removals.

              I actually gave up when the number of deletions got to 1.5 million. I figured it was just too hard to do.

              1 Reply Last reply Reply Quote 0
              • KeriMorgret
                KeriMorgret @loopyal last edited by

                The last time I used a tool, excluding via robots.txt was also sufficient for URL removal.

                Recently, Google has updated their documentation to strongly encourage you to use URL removal only for things like exposing confidential information, and not to clean up old pages or errors in your GWT account (see http://support.google.com/webmasters/bin/answer.py?hl=en&answer=1269119). I know many people still use the tool for that type of stuff, but wanted to point out that change.

                1 Reply Last reply Reply Quote 0
                • 1 / 1
                • First post
                  Last post
                • Paginated category pages still showing in Google
                  NickSamuel
                  NickSamuel
                  0
                  5
                  89

                • Google is indexing wrong page for search terms not on that page
                  katemorris
                  katemorris
                  0
                  6
                  1.1k

                • Link Removal Request Sent to Google, Bad Pages Gone from Index But Still Appear in Webmaster Tools
                  Kingalan1
                  Kingalan1
                  0
                  3
                  146

                • Does Google still don't index Hashtag Links ? No chance to get a Search Result that leads directly to a section of a page? or to one of numeras Hashtag Pages in a single HTML page?
                  Muhammad_Jabali
                  Muhammad_Jabali
                  0
                  3
                  748

                • "No index" page still shows in search results and paginated pages shows page 2 in results
                  khi5
                  khi5
                  0
                  3
                  114

                • Incorrect cached page indexing in Google while correct page indexes intermittently
                  MikeRoberts
                  MikeRoberts
                  0
                  2
                  298

                • Why the archive sub pages are still indexed by Google?
                  Stramark
                  Stramark
                  1
                  5
                  241

                • Why is Google displaying inside pages for our sites rather than the index pages?
                  aloley
                  aloley
                  0
                  7
                  430

                Get started with Moz Pro!

                Unlock the power of advanced SEO tools and data-driven insights.

                Start my free trial
                Products
                • Moz Pro
                • Moz Local
                • Moz API
                • Moz Data
                • STAT
                • Product Updates
                Moz Solutions
                • SMB Solutions
                • Agency Solutions
                • Enterprise Solutions
                • Digital Marketers
                Free SEO Tools
                • Domain Authority Checker
                • Link Explorer
                • Keyword Explorer
                • Competitive Research
                • Brand Authority Checker
                • Local Citation Checker
                • MozBar Extension
                • MozCast
                Resources
                • Blog
                • SEO Learning Center
                • Help Hub
                • Beginner's Guide to SEO
                • How-to Guides
                • Moz Academy
                • API Docs
                About Moz
                • About
                • Team
                • Careers
                • Contact
                Why Moz
                • Case Studies
                • Testimonials
                Get Involved
                • Become an Affiliate
                • MozCon
                • Webinars
                • Practical Marketer Series
                • MozPod
                Connect with us

                Contact the Help team

                Join our newsletter
                Moz logo
                © 2021 - 2026 SEOMoz, Inc., a Ziff Davis company. All rights reserved. Moz is a registered trademark of SEOMoz, Inc.
                • Accessibility
                • Terms of Use
                • Privacy