The Moz Q&A Forum

    • Forum
    • Questions
    • My Q&A
    • Users
    • Ask the Community

    Welcome to the Q&A Forum

    Browse the forum for helpful insights and fresh discussions about all things SEO.

    1. SEO and Digital Marketing Q&A Forum
    2. Categories
    3. Intermediate & Advanced SEO
    4. Old URLs that have 301s to 404s not being de-indexed.

    Old URLs that have 301s to 404s not being de-indexed.

    Intermediate & Advanced SEO
    4 3 119
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as question
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • boxclever
      boxclever last edited by

      We have a scenario on a domain that recently moved to enforcing SSL. If a page is requested over non-ssl (http) requests, the server automatically redirects to the SSL (https) URL using a good old fashioned 301. This is great except for any page that no longer exists, in which case you get a 301 going to a 404.

      Here's what I mean.

      Case 1 - Good page:

      http://domain.com/goodpage -> 301 -> https://domain.com/goodpage -> 200

      Case 2 - Bad page that no longer exists:

      http://domain.com/badpage -> 301 -> https://domain.com/badpage -> 404

      Google is correctly re-indexing all the "good" pages and just displaying search results going directly to the https version.

      Google is stubbornly hanging on to all the "bad" pages and serving up the original URL (http://domain.com/badpage) unless we submit a removal request. But there are hundreds of these pages and this is starting to suck. Note: the load balancer does the SSL enforcement, not the CMS. So we can't detect a 404 and serve it up first. The CMS does the 404'ing.

      Any ideas on the best way to approach this problem? Or any idea why Google is holding on to all the old "bad" pages that no longer exist, given that we've clearly indicated with 301s that no one is home at the old address?

      1 Reply Last reply Reply Quote 0
      • TheSymmetran
        TheSymmetran last edited by

        Read these three, then do what you got to do...

        https://www.searchcommander.com/how-to-bulk-remove-urls-google/

        https://productforums.google.com/forum/#!topic/webmasters/uYFJnsyiH8w

        https://moz.com/community/q/404-redirects-to-the-homepage-is-this-good-bad-ugly

        For proper removal, please ensure that there are no INTERNAL links anywhere on your website to 404 addresses, from sitemap, buttons, text, or images (the whole 9 yards).

        Good luck!

        1 Reply Last reply Reply Quote 0
        • becole
          becole last edited by

          sym_admin is right--you'll want to find the source of those pages, as Google apparently is seeing them from somewhere and still requesting them. If there are links to those pages somewhere, you will need to remove them. Also, if you're able, I would change those URLs so that they serve up a "410 Gone" error, and not a 404.

          1 Reply Last reply Reply Quote 0
          • boxclever
            boxclever last edited by

            I don't think 404 vs 410 is the answer here.The basis for this thought is the following:

            https://searchenginewatch.com/sew/how-to/2340728/matt-cutts-on-how-google-handles-404-410-status-codes

            ========

            "if we see a page and we get a 404, we are gonna protect that page for 24 hours in the crawling system, so we sort of wait and we say maybe that was a transient 404, maybe it really wasn’t intended to be a page not found.”

            “If we see a 410, then the site crawling system says, OK we assume the webmasters knows what they’re doing because they went off the beaten path to deliberately say this page is gone,” he said. “So they immediately convert that 410 to an error, rather than protecting it for 24 hours."

            ========

            I'm thinking the deeper issue is why the 301s are not being respected. If a link points to http://domain.com/badpage  and we use a 301 to point to https://domain.com/badpage - shouldn't the crawler (Google or otherwise) respect the 301? Why still index and serve up a page that responds with the 301? To me, this is baffling.  If we serve up a 404 or a 410 - either way we are saying "this page is gone" but we're still seeing the original http://domain.com/badpage in the index?

            Does that make sense? Or is there more clarification required?

            1 Reply Last reply Reply Quote 0
            • 1 / 1
            • First post
              Last post
            • Old product URLs still indexed and maybe causing problems?
              mcampanaro
              mcampanaro
              0
              5
              87

            • Client wants to remove mobile URLs from their sitemap to avoid indexing issues. However this will require SEVERAL billing hours. Is having both mobile/desktop URLs in a sitemap really that detrimental to search indexing?
              RosemaryB
              RosemaryB
              0
              7
              89

            • How long does google index old urls?
              daracreative
              daracreative
              0
              3
              108

            • Google Indexed my Site then De-indexed a Week After
              Jonathan1979
              Jonathan1979
              0
              4
              397

            • What may cause a page not to be indexed (be de-indexed)?
              OlegKorneitchouk
              OlegKorneitchouk
              0
              6
              152

            • How do you de-index and prevent indexation of a whole domain?
              SebastianCowie
              SebastianCowie
              0
              3
              430

            • Indexed non existent pages, problem appeared after we 301d the url/index to the url.
              ThompsonPaul
              ThompsonPaul
              0
              4
              331

            • How to deal with old, indexed hashbang URLs?
              belasco
              belasco
              0
              4
              2.0k

            Get started with Moz Pro!

            Unlock the power of advanced SEO tools and data-driven insights.

            Start my free trial
            Products
            • Moz Pro
            • Moz Local
            • Moz API
            • Moz Data
            • STAT
            • Product Updates
            Moz Solutions
            • SMB Solutions
            • Agency Solutions
            • Enterprise Solutions
            • Digital Marketers
            Free SEO Tools
            • Domain Authority Checker
            • Link Explorer
            • Keyword Explorer
            • Competitive Research
            • Brand Authority Checker
            • Local Citation Checker
            • MozBar Extension
            • MozCast
            Resources
            • Blog
            • SEO Learning Center
            • Help Hub
            • Beginner's Guide to SEO
            • How-to Guides
            • Moz Academy
            • API Docs
            About Moz
            • About
            • Team
            • Careers
            • Contact
            Why Moz
            • Case Studies
            • Testimonials
            Get Involved
            • Become an Affiliate
            • MozCon
            • Webinars
            • Practical Marketer Series
            • MozPod
            Connect with us

            Contact the Help team

            Join our newsletter
            Moz logo
            © 2021 - 2026 SEOMoz, Inc., a Ziff Davis company. All rights reserved. Moz is a registered trademark of SEOMoz, Inc.
            • Accessibility
            • Terms of Use
            • Privacy