The Moz Q&A Forum

    • Forum
    • Questions
    • My Q&A
    • Users
    • Ask the Community

    Welcome to the Q&A Forum

    Browse the forum for helpful insights and fresh discussions about all things SEO.

    1. SEO and Digital Marketing Q&A Forum
    2. Categories
    3. Technical SEO Issues
    4. 404's being re-indexed

    404's being re-indexed

    Technical SEO Issues
    10 3 139
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as question
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • Tom3_15
      Tom3_15 last edited by

      Hi All,

      We are experiencing issues with pages that have been 404'd being indexed. Originally, these were /wp-content/ index pages, that were included in Google's index. Once I realized this, I added in a directive into our htaccess to 404 all of these pages - as there were hundreds. I tried to let Google crawl and remove these pages naturally but after a few months I used the URL removal tool to remove them manually.

      However, Google seems to be continually re/indexing these pages, even after they have been manually requested for removal in search console. Do you have suggestions? They all respond to 404's.

      Thanks

      1 Reply Last reply Reply Quote 1
      • gjordan90
        gjordan90 last edited by

        Back to basics. Have you marked those pages/posts as 'no-index'. With many wp plugins, you can no-index them in bulk then submit for re-indexation.

        Tom3_15 1 Reply Last reply Reply Quote 2
        • Tom3_15
          Tom3_15 @gjordan90 last edited by

          Yes, all pages have a noindex. I have also tried to noindex them using htaccess, to add an extra layer of security, but it seems to be incorrect. I believe it is an issue with the regex. Attempting to match anything with wp-content.

          <filesmatch "(wp-content)"="">Header set X-Robots-Tag: "noindex, nofollow"</filesmatch>

          effectdigital 1 Reply Last reply Reply Quote 0
          • effectdigital
            effectdigital @Tom3_15 last edited by

            You know that 404 means "temporarily gone but will be coming back" right? By saying a page is temporarily unavailable, you actively encourage Google to come back later

            If you want to say that the page is permanently gone use status code 410 (gone)

            Leave the Meta no-index stuff in the HTTP header via X-Robots, that was a good call. But it was a bad call to combine Meta no-index and 404, as they contradict each other ("don't index me now but then do come back and index me later as I'll probably be back at some point")

            Use Meta no-index and 410, which agree with each other ("don't index me now and don't bother coming back")

            Tom3_15 1 Reply Last reply Reply Quote 2
            • Tom3_15
              Tom3_15 @effectdigital last edited by

              Thank you! I am in the process of doing so, however with a 410 I can not leave my JS redirect after the page loads, this creates some UX issues. Do you have any suggestions to remedy this?

              Additionally, after the 410 the non x-robots noindex is now being stripped so it only resolves to a 410 with no noindex or redirect. I am still working on a noindex header, as the 410 is server-side, I assume this would be the only way, correct?

              effectdigital 1 Reply Last reply Reply Quote 1
              • effectdigital
                effectdigital @Tom3_15 last edited by

                Well if a page has been removed and has not been moved to a new destination - you shouldn't redirect a user anyway (which kind of 'tricks' users into thinking the content was found). That's actually bad UX

                If the content has been properly removed or was never supposed to be there, just leave it at a 410 (but maybe create a nice custom 410 page, in the same vein as a decent UX custom 404 page). Use the page to admit that the content is gone (without shady redirects) but to point to related posts or products. Let the user decide, but still be useful

                If the content is actually still there and, hence you are doing a redirect - then you shouldn't be serving 404s or 410s in the first place. You should be serving 301s, and just doing HTTP redirects to the content's new (or revised) destination URL

                Yes, the HTTP header method is the correct replacement when the HTML implementation gets stripped out. HTTP Header X-Robots is the way for you!

                Tom3_15 1 Reply Last reply Reply Quote 1
                • Tom3_15
                  Tom3_15 @effectdigital last edited by

                  Thank you for the quick response,

                  The pages are truly removed, however, because there were so many of these types of pages that leaked into the index, I added a redirect to keep users on our site - no intentions of being "shady", I just didn't want hundreds of 404's getting clicked and causing a very high bounce rate.

                  For the x-robots header, could you offer some insight into why my directive isn't working? I believe it's a regex issue on the wp-content. I have tried to troubleshoot to no avail.

                  <filesmatch <strong="">"(wp-content)">
                  Header set X-Robots-Tag: "noindex, nofollow"</filesmatch>

                  I appreciate the help!

                  effectdigital 1 Reply Last reply Reply Quote 0
                  • effectdigital
                    effectdigital @Tom3_15 last edited by

                    First it would be helpful to know how you are detecting that it isn't working. What indexation tool are you using to see whether the blocks are being detected? I personally really like this one: https://chrome.google.com/webstore/detail/seo-indexability-check/olojclckfadnlhnlmlekdihebmjpjnoa?hl=en-GB

                    Or obviously at scale - Screaming Frog

                    Tom3_15 2 Replies Last reply Reply Quote 0
                    • Tom3_15
                      Tom3_15 @effectdigital last edited by

                      I'll check this one out as well, thanks! I used a header response extension which reveals the presence of x-botots headers called web developer.

                      1 Reply Last reply Reply Quote 0
                      • Tom3_15
                        Tom3_15 @effectdigital last edited by

                        Just to follow up - I have now actually 410'd the pages and the 410's are still being re-indexed.

                        1 Reply Last reply Reply Quote 0
                        • 1 / 1
                        • First post
                          Last post
                        • Strange URL's for client's site
                          everestagency
                          everestagency
                          0
                          3
                          456

                        • Test site got indexed in Google - What's the best way of getting the pages removed from the SERP's?
                          Yarden_Uitvaartorganisatie
                          Yarden_Uitvaartorganisatie
                          0
                          5
                          1.3k

                        • Sitemap issue? 404's & 500's are regenerating?
                          jeff-rackaid.com
                          jeff-rackaid.com
                          0
                          5
                          263

                        • Unfindable 404's
                          Adamshowbiz
                          Adamshowbiz
                          0
                          5
                          110

                        • What's the best way to handle Overly Dynamic Url's?
                          GKLA
                          GKLA
                          0
                          2
                          309

                        • Website's stability and it's affect on SEO
                          AlanMosley
                          AlanMosley
                          0
                          2
                          1.0k

                        • Can I format my H1 to be smaller than H2's and H3's on the same page?
                          theLotter
                          theLotter
                          0
                          5
                          2.8k

                        • 404's and duplicate content.
                          AnthonyLasVegas
                          AnthonyLasVegas
                          0
                          5
                          868

                        Get started with Moz Pro!

                        Unlock the power of advanced SEO tools and data-driven insights.

                        Start my free trial
                        Products
                        • Moz Pro
                        • Moz Local
                        • Moz API
                        • Moz Data
                        • STAT
                        • Product Updates
                        Moz Solutions
                        • SMB Solutions
                        • Agency Solutions
                        • Enterprise Solutions
                        • Digital Marketers
                        Free SEO Tools
                        • Domain Authority Checker
                        • Link Explorer
                        • Keyword Explorer
                        • Competitive Research
                        • Brand Authority Checker
                        • Local Citation Checker
                        • MozBar Extension
                        • MozCast
                        Resources
                        • Blog
                        • SEO Learning Center
                        • Help Hub
                        • Beginner's Guide to SEO
                        • How-to Guides
                        • Moz Academy
                        • API Docs
                        About Moz
                        • About
                        • Team
                        • Careers
                        • Contact
                        Why Moz
                        • Case Studies
                        • Testimonials
                        Get Involved
                        • Become an Affiliate
                        • MozCon
                        • Webinars
                        • Practical Marketer Series
                        • MozPod
                        Connect with us

                        Contact the Help team

                        Join our newsletter
                        Moz logo
                        © 2021 - 2026 SEOMoz, Inc., a Ziff Davis company. All rights reserved. Moz is a registered trademark of SEOMoz, Inc.
                        • Accessibility
                        • Terms of Use
                        • Privacy