The Moz Q&A Forum

    • Forum
    • Questions
    • My Q&A
    • Users
    • Ask the Community

    Welcome to the Q&A Forum

    Browse the forum for helpful insights and fresh discussions about all things SEO.

    1. SEO and Digital Marketing Q&A Forum
    2. Categories
    3. Technical SEO Issues
    4. Removing indexed pages

    Removing indexed pages

    Technical SEO Issues
    5 4 105
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as question
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • Jettynz
      Jettynz last edited by

      Hi all, this is my first post so be kind 🙂 - I have a one page Wordpress site that has the Yoast plugin installed. Unfortunately, when I first submitted the site's XML sitemap to the Google Search Console, I didn't check the Yoast settings and it submitted some example files from a theme demo I was using. These got indexed, which is a pain, so now I am trying to remove them. Originally I did a bunch of 301's but that didn't remove them from (at least not after about a month) - so now I have set up 410's - These also seem to not be working and I am wondering if it is because I re-submitted the sitemap with only the index page on it (as it is just a single page site) could that have now stopped Google indexing the original pages to actually see the 410's?
      Thanks in advance for any suggestions.

      1 Reply Last reply Reply Quote 0
      • seoman10
        seoman10 last edited by

        Couple of ideas spring to mind

        1. Use the robots.txt file
        2. Demote the site link in Google search console (see https://support.google.com/webmasters/answer/47334)

        Example of robots.txt file...

        Disallow:    /the-link/you-dont/want-to-show.html
        Disallow:    /the-link/you-dont/want-to-show2.html

        Don't include the domain just the link to the page, Plenty of tutorials out there worthwhile having a look at http://www.robotstxt.org

        1 Reply Last reply Reply Quote 0
        • ViviCa1
          ViviCa1 last edited by

          I'd suggest adding a noindex robots meta tag to the affected pages (see how to do this here: https://support.google.com/webmasters/answer/93710?hl=en) and until Google recrawls use the remove URLs tool (see how to use this here: https://support.google.com/webmasters/answer/1663419?hl=en).

          If you use the noindex robots meta tag, don't disallow the pages through your robots.txt or Google won't even see the tag. Disallowing Google from crawling a page doesn't mean it won't be indexed (or removed from the index), it just means Google won't crawl the page.

          1 Reply Last reply Reply Quote 2
          • Joe.Robison
            Joe.Robison last edited by

            I agree with ViviCa1's methods, so go with that.

            One thing I just wanted to bring up though, is that unless people are actually visiting those pages you don't want indexed, or it does some type of brand damage, then you don't really need to make it a priority.

            Just because they're indexed doesn't mean they're showing up for any searches - and most likely they aren't - so people will realistically never see them. And if you only have a one-page site, you're not wasting much crawl budget on those.

            I just bring this up since sometimes we (I'm guilty of it too) can get bogged down by small distractions in SEO that don't really help much, when we should be creating and producing new things!

            "These also seem to not be working and I am wondering if it is because I re-submitted the sitemap with only the index page on it (as it is just a single page site) could that have now stopped Google indexing the original pages to actually see the 410's?"

            There was a good related response from Google employee Susan Moskwa:

            “The best way to stop Googlebot from crawling URLs that it has discovered in the past is to make those URLs (such as your old Sitemaps) 404. After seeing that a URL repeatedly 404s, we stop crawling it. And after we stop crawling a Sitemap, it should drop out of your "All Sitemaps" tab.”

            A bit older, but shows how Google discovers URLs through the sitemap. Take a look at the rest of that thread as well.

            1 Reply Last reply Reply Quote 1
            • Jettynz
              Jettynz last edited by

              Thanks for all the responses! 🙂

              At the moment I am serving the 410's using the .htaaccess file as I removed the actual pages a while ago. The pages don't show in most searches, however, two of them do show up in some instances under the sitelinks which is the main pain. I manually asked for them to be removed using 'remove urls' however that only last a couple of months and they are now back.

              So I guess the best way is to recreate the pages and insert a noindex?

              Thanks again for everyone time, it's much appreciated.

              1 Reply Last reply Reply Quote 0
              • 1 / 1
              • First post
                Last post
              • Over 40+ pages have been removed from the indexed and this page has been selected as the google preferred canonical.
                willcritchlow
                willcritchlow
                0
                4
                69

              • Is there a way to index important pages manually or to make sure a certain page will get indexed in a short period of time??
                rijwielcashencarry040
                rijwielcashencarry040
                0
                7
                111

              • Should I remove these pages from the Google index?
                Robbern
                Robbern
                0
                4
                212

              • Home page indexed but not ranking...interior pages with thin content outrank home page??
                DougHosmer
                DougHosmer
                0
                3
                294

              • Indexed pages and current pages - Big difference?
                grasshopper
                grasshopper
                0
                4
                445

              • Secondary Pages Indexed over Primary Page
                KeriMorgret
                KeriMorgret
                0
                5
                687

              • Does page speed affect what pages are in the index?
                Alex-Harford
                Alex-Harford
                0
                10
                835

              • Remove Deleted (but indexed) Pages Through Webmaster Tools?
                SEM-Freak
                SEM-Freak
                0
                3
                790

              Get started with Moz Pro!

              Unlock the power of advanced SEO tools and data-driven insights.

              Start my free trial
              Products
              • Moz Pro
              • Moz Local
              • Moz API
              • Moz Data
              • STAT
              • Product Updates
              Moz Solutions
              • SMB Solutions
              • Agency Solutions
              • Enterprise Solutions
              • Digital Marketers
              Free SEO Tools
              • Domain Authority Checker
              • Link Explorer
              • Keyword Explorer
              • Competitive Research
              • Brand Authority Checker
              • Local Citation Checker
              • MozBar Extension
              • MozCast
              Resources
              • Blog
              • SEO Learning Center
              • Help Hub
              • Beginner's Guide to SEO
              • How-to Guides
              • Moz Academy
              • API Docs
              About Moz
              • About
              • Team
              • Careers
              • Contact
              Why Moz
              • Case Studies
              • Testimonials
              Get Involved
              • Become an Affiliate
              • MozCon
              • Webinars
              • Practical Marketer Series
              • MozPod
              Connect with us

              Contact the Help team

              Join our newsletter
              Moz logo
              © 2021 - 2026 SEOMoz, Inc., a Ziff Davis company. All rights reserved. Moz is a registered trademark of SEOMoz, Inc.
              • Accessibility
              • Terms of Use
              • Privacy