The Moz Q&A Forum

    • Forum
    • Questions
    • My Q&A
    • Users
    • Ask the Community

    Welcome to the Q&A Forum

    Browse the forum for helpful insights and fresh discussions about all things SEO.

    1. SEO and Digital Marketing Q&A Forum
    2. Categories
    3. Intermediate & Advanced SEO
    4. How to get a large number of urls out of Google's Index when there are no pages to noindex tag?

    How to get a large number of urls out of Google's Index when there are no pages to noindex tag?

    Intermediate & Advanced SEO
    2 2 50
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as question
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • 94501
      94501 last edited by

      Hi,

      I'm working with a site that has created a large group of urls (150,000)  that have crept into Google's index. If these urls actually existed as pages, which they don't, I'd just noindex tag them and over time the number would drift down.

      The thing is, they created them through a complicated internal linking arrangement that adds affiliate code to the links and forwards them to the affiliate. GoogleBot would crawl a link that looks like it's to the client's same domain and wind up on Amazon or somewhere else with some affiiiate code. GoogleBot would then grab the original link on the clients domain and index it... even though the page served is on Amazon or somewhere else. Ergo, I don't have a page to noindex tag.

      I have to get this 150K block of cruft out of Google's index, but without actual pages to noindex tag, it's a bit of a puzzler.

      Any ideas? Thanks! Best... Michael

      P.S.,

      All 150K urls seem to share the same url pattern... exmpledomain.com/item/...   so /item/ is common to all of them, if that helps.

      1 Reply Last reply Reply Quote 0
      • effectdigital
        effectdigital last edited by

        If no pages which support web coding actually exist for the URLs you want to remove from Google's index, I'd probably use the HTTP header instead. Maybe use the X-Robots directives:

        • https://yoast.com/x-robots-tag-play/
        • https://www.searchenginejournal.com/x-robots-tag-simple-alternate-robots-txt-meta-tag/67138/

        Even if you have no page with web-code, you can always have a HTTP Header. A HTTP header simply allows a client and / or server to fire additional information through 'requests' (post / get etc).

        This is the only thing I can think of which would really help. Some people might suggest robots.txt wildcards, but robots.txt handles crawling and not indexation (so those answers wouldn't really be worth anything to you)

        The other thing you could do (maybe combine this with the X-Robots stuff) is to get all of those URLs to serve status code 410 (gone) instead of 404 (temporarily gone, but coming back)

        1 Reply Last reply Reply Quote 0
        • 1 / 1
        • First post
          Last post
        • URL Injection Hack - What to do with spammy URLs that keep appearing in Google's index?
          Dezzign
          Dezzign
          0
          7
          3.6k

        • Does Google Read URL's if they include a # tag? Re: SEO Value of Clean Url's
          Atlanta-SMO
          Atlanta-SMO
          0
          6
          1.6k

        • Is it a problem that Google's index shows paginated page urls, even with canonical tags in place?
          94501
          94501
          0
          3
          249

        • Big discrepancies between pages in Google's index and pages in sitemap
          David-Kley
          David-Kley
          0
          6
          218

        • How can I get a list of every url of a site in Google's index?
          KaneJamison
          KaneJamison
          0
          8
          1.1k

        • Will Canonical tag on parameter URLs remove those URL's from Index, and preserve link juice?
          StreamlineMetrics
          StreamlineMetrics
          0
          2
          113

        • Sitemap - % of URL's in Google Index?
          irvingw
          irvingw
          0
          7
          677

        • Export list of urls in google's index?
          nicole.healthline
          nicole.healthline
          0
          3
          2.6k

        Get started with Moz Pro!

        Unlock the power of advanced SEO tools and data-driven insights.

        Start my free trial
        Products
        • Moz Pro
        • Moz Local
        • Moz API
        • Moz Data
        • STAT
        • Product Updates
        Moz Solutions
        • SMB Solutions
        • Agency Solutions
        • Enterprise Solutions
        • Digital Marketers
        Free SEO Tools
        • Domain Authority Checker
        • Link Explorer
        • Keyword Explorer
        • Competitive Research
        • Brand Authority Checker
        • Local Citation Checker
        • MozBar Extension
        • MozCast
        Resources
        • Blog
        • SEO Learning Center
        • Help Hub
        • Beginner's Guide to SEO
        • How-to Guides
        • Moz Academy
        • API Docs
        About Moz
        • About
        • Team
        • Careers
        • Contact
        Why Moz
        • Case Studies
        • Testimonials
        Get Involved
        • Become an Affiliate
        • MozCon
        • Webinars
        • Practical Marketer Series
        • MozPod
        Connect with us

        Contact the Help team

        Join our newsletter
        Moz logo
        © 2021 - 2026 SEOMoz, Inc., a Ziff Davis company. All rights reserved. Moz is a registered trademark of SEOMoz, Inc.
        • Accessibility
        • Terms of Use
        • Privacy