The Moz Q&A Forum

    • Forum
    • Questions
    • My Q&A
    • Users
    • Ask the Community

    Welcome to the Q&A Forum

    Browse the forum for helpful insights and fresh discussions about all things SEO.

    1. SEO and Digital Marketing Q&A Forum
    2. Categories
    3. Intermediate & Advanced SEO
    4. After hack and remediation, thousands of URL's still appearing as 'Valid' in google search console. How to remedy?

    After hack and remediation, thousands of URL's still appearing as 'Valid' in google search console. How to remedy?

    Intermediate & Advanced SEO
    2 2 155
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as question
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • rickyporco
      rickyporco last edited by

      I'm working on a site that was hacked in March 2019 and in the process, nearly 900,000 spam links were generated and indexed. After remediation of the hack in April 2019, the spammy URLs began dropping out of the index until last week, when Search Console showed around 8,000 as "Indexed, not submitted in sitemap" but listed as "Valid" in the coverage report and many of them are still hack-related URLs that are listed as being indexed in March 2019, despite the fact that clicking on them leads to a 404. As of this Saturday, the number jumped up to 18,000, but I have no way of finding out using the search console reports why the jump happened or what are the new URLs that were added, the only sort mechanism is last crawled and they don't show up there.

      How long can I expect it to take for these remaining urls to also be removed from the index? Is there any way to expedite the process? I've submitted a 'new' sitemap several times, which (so far) has not helped.

      Is there any way to see inside the new GSC view why/how the number of valid URLs in the indexed doubled over one weekend?

      1 Reply Last reply Reply Quote 0
      • effectdigital
        effectdigital last edited by

        Google Search Console actually has a URL removal tool built into it, unfortunately it's not really scaleable (mostly it's one at a time submissions) and in addition to that the effect of using the tool is only temporary (the URLs come back again)

        In your case I reckon' that changing the status code of the 'gone' URLs from 404 ("temporarily not found, but will be returning soon") to 410 ("GONE!") might be a good idea. Google might digest that better as it's a harder indexation directive and a very strong crawl directive ("go away, don't come back!")

        You could also serve the Meta no-index directive on those URLs. Obviously you're unlikely to have access to the HTML of non-existent pages, but did you know Meta no-index can also be fired through x-robots, through the HTTP header? So it's not impossible

        https://developer.mozilla.org/en-US/docs/Web/HTTP/Status/404

        (Ctrl+F for "X-Robots-Tag HTTP header")

        Another option is this form to let Google know outdated content is gone, has been removed, and isn't coming back:

        https://www.google.com/webmasters/tools/removals

        ... but again, URLs one at a time is going to be mega-slow. It does work pretty well though (at least in my experience)

        In any eventuality I think you're looking at, a week or two for Google to start noticing in a way that you can see visually - and then maybe a month or two until it rights itself (caveat: it's different for all sites and URLs, it's variable)

        1 Reply Last reply Reply Quote 0
        • 1 / 1
        • First post
          Last post
        • A client rebranded a few years ago and doesn't want to be associated with it's old brand name. He wishes not to appear when the old brand is searched in Google, is there something we can do?
          0
          1
          28

        • Canonical URL's searchable in Google?
          DonnaDuncan
          DonnaDuncan
          0
          3
          92

        • Will disallowing URL's in the robots.txt file stop those URL's being indexed by Google
          Martijn_Scheijbeler
          Martijn_Scheijbeler
          0
          11
          1.6k

        • "Null" appearing as top keyword in "Content Keywords" under Google index in Google Search Console
          Tom-Anthony
          Tom-Anthony
          0
          5
          588

        • Is there any negative SEO effect of having comma's in URL's?
          gcdtechnologies
          gcdtechnologies
          0
          3
          2.6k

        • Posing QU's on Google Variables "aclk", "gclid" "cd", "/aclk" "/search", "/url" etc
          0
          1
          2.3k

        • Best solution to get mass URl's out the SE's index
          James77
          James77
          0
          3
          583

        • Export list of urls in google's index?
          nicole.healthline
          nicole.healthline
          0
          3
          2.6k

        Get started with Moz Pro!

        Unlock the power of advanced SEO tools and data-driven insights.

        Start my free trial
        Products
        • Moz Pro
        • Moz Local
        • Moz API
        • Moz Data
        • STAT
        • Product Updates
        Moz Solutions
        • SMB Solutions
        • Agency Solutions
        • Enterprise Solutions
        • Digital Marketers
        Free SEO Tools
        • Domain Authority Checker
        • Link Explorer
        • Keyword Explorer
        • Competitive Research
        • Brand Authority Checker
        • Local Citation Checker
        • MozBar Extension
        • MozCast
        Resources
        • Blog
        • SEO Learning Center
        • Help Hub
        • Beginner's Guide to SEO
        • How-to Guides
        • Moz Academy
        • API Docs
        About Moz
        • About
        • Team
        • Careers
        • Contact
        Why Moz
        • Case Studies
        • Testimonials
        Get Involved
        • Become an Affiliate
        • MozCon
        • Webinars
        • Practical Marketer Series
        • MozPod
        Connect with us

        Contact the Help team

        Join our newsletter
        Moz logo
        © 2021 - 2026 SEOMoz, Inc., a Ziff Davis company. All rights reserved. Moz is a registered trademark of SEOMoz, Inc.
        • Accessibility
        • Terms of Use
        • Privacy