The Moz Q&A Forum

    • Forum
    • Questions
    • My Q&A
    • Users
    • Ask the Community

    Welcome to the Q&A Forum

    Browse the forum for helpful insights and fresh discussions about all things SEO.

    1. SEO and Digital Marketing Q&A Forum
    2. Categories
    3. Intermediate & Advanced SEO
    4. Please help :) Troubles getting 3 types of content de-indexed

    Please help :) Troubles getting 3 types of content de-indexed

    Intermediate & Advanced SEO
    14 2 174
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as question
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • evolvingSEO
      evolvingSEO last edited by

      Hi There

      For all these cases above, this may be a situation where you've BOTH blocked these in robots.txt and added noindex tags. You can not block the directories in robots.txt and get them deindexed, because Google can not then crawl the URLs to see the noindex tag.

      If this is the case, I would remove any disallows to /tag/ etc in robots.txt, allow Google to crawl the URLs to see the nodinex tags - wait a few weeks and see what happens.

      As far as the URL removal not working, make sure you have the correct subdomain registered - www or non-www etc for the URLs you want removed.

      If neither one of those is the issue, please write back so I can try to help you more with that. Google should noindex the pages in a week or two under normal situations. The other thing is, check the cache date of the pages. If the cache dates are prior to the date you added the noindex, Google might not have seen the noindex directives yet.

      -Dan

      Ltsmz 1 Reply Last reply Reply Quote 1
      • Ltsmz
        Ltsmz @evolvingSEO last edited by

        Hey Dan thanks,
        well, so google had indexed all my tags, categories and stuff.

        The only things I had blocked in my robots was
        /go/ for affiliate links
        and
        /plugins/ for plugins

        so I did let google see that categories and archives pages were no-indexed.

        I have also submit the removal request many months ago but I haven't quite understood what you say about the cache dates. What should I check?

        Thanks for your help!

        evolvingSEO 1 Reply Last reply Reply Quote 0
        • evolvingSEO
          evolvingSEO @Ltsmz last edited by

          Hi There

          Should have explained better 🙂

          if you type cache: in front of any web URL for example cache:apple.com you get;

          http://webcache.googleusercontent.com/search?q=cache%3Aapple.com&oq=cache%3Aapple.com&aqs=chrome..69i57j69i58.8119j0j7&sourceid=chrome&espv=210&es_sm=91&ie=UTF-8

          And see the "cache" date? This is not the same as the crawl date, but it can give you a rough indication of how often Google might be looking at your pages.

          So try that on some of your tag archives and if the cache date is say 4+ weeks ago maybe Google isn't looking at the site very often.

          But it's odd they haven't been removed yet, especially with the URL removal tool - that tool usually only takes a day. Noindex tags usually only take a week or two.

          Have you examined the source code to make sure it does in fact say "noindex" by the robots tag - or that there is not a conflicting duplicate robots noindex tag? Sometimes wordpress themes and plugins both try adding SEO tags and you can end up with duplicates.

          -Dan

          Ltsmz 1 Reply Last reply Reply Quote 1
          • Ltsmz
            Ltsmz @evolvingSEO last edited by

            Hey Dan, thanks a lot for your help.

            I have tried the cache trick on my home page and the cached version was about 4-5 days old.

            I have then tried to cache:mywebsite/tag/ and it gives me a google 404 not found which I suppose is a good sign.

            But if they have been de-indexed why do they appear in search results then?

            I am not sure how to check the double SEO no-index in the source code though. How do I do that exactly? What should I look for after right-clicking -> source code?

            Thanks for your help!

            My MOZ account ends in two days so I may not be able to reply back next time.

            evolvingSEO 1 Reply Last reply Reply Quote 0
            • evolvingSEO
              evolvingSEO @Ltsmz last edited by

              Hey There

              You want to look for this;

              You can just do a cntrl-f (to search text in the source) and type in "noindex" and it should be present on the Tag archives.

              -Dan

              Ltsmz 1 Reply Last reply Reply Quote 1
              • Ltsmz
                Ltsmz @evolvingSEO last edited by

                Hey Dan,
                thanks for the quick reply.

                I have gone trough site:mywebsite.com and I found that tags and categories disappeared but there still is some content that shouldn't be indexed like this:

                mywebsite.com/wp-content/plugins/wp-flash-countdown/counter_cs3_v2_NoReflectLight.swf

                and this:
                mywebsite.com/go/affiliate-product/

                and I found this:Disallow: /wp-content/plugins/
                in my robots.txt

                Thing is that:

                1. I have deleted that wp-flash-countdown plugin at least 9 months ago
                2. I have manually removed all the urls with /go/ from GWMT and when I search for a cached version of them they are not there
                3. If I remove Disallow: /wp-content/plugins/ from my robots.txt won't that get all my plugins' pages to be indexed? So how do I make sure they are not indexed?

                Thank you so much for your help!So far you have been the most helpful answerer in this forum.

                evolvingSEO 1 Reply Last reply Reply Quote 0
                • evolvingSEO
                  evolvingSEO @Ltsmz last edited by

                  Hi There

                  1. For the flash file NoReflectLight.swf - I would do a removal request in WMT and maintain the blocking in robots.txt of /plugins/

                  2. When you do a URL removal in WMT the files need to either be blocked in robots.txt or have a noindex on them or 404. Doesn't that sort of link redirect to your affiliate product? In other words, if I were to try to visit /go/affiliate-product/ it would redirect to www.affiliateproductwebsite.com ?Or does /go/affiliate-product/ load it's on page on your site?

                  3. I would maintain the robots.txt bloking on /plugins/ - if no other files from there are indexed, they will not be in the future.

                  -Dan

                  Ltsmz 1 Reply Last reply Reply Quote 1
                  • Ltsmz
                    Ltsmz @evolvingSEO last edited by

                    Hi Dan,

                    1. Ok! I will.

                    2. When I click on the /go/ link in search results it redirects me to the affiliate website. I asked for the removal of /go/ a few days ago, but they (about 30 results) still appear in google when I search with the site:mywebsite.com trick.

                    What should I do about it? How can I get rid of them? They were created with the SimpleUrl plugin which I deleted about 3 months ago though.

                    3. Got it!

                    Thanks!
                    Fabio

                    evolvingSEO 1 Reply Last reply Reply Quote 0
                    • evolvingSEO
                      evolvingSEO @Ltsmz last edited by

                      Hey Fabio

                      Regarding #2 I'd give it a little bit more time. 301's take a little longer to drop out, so maybe check back in a week or two 🙂 Technically the URL removal will mainly work if the content now 404's, is noindexed or blocked in robots.txt but with a redirdect you can do none of those, so you just have to wait for them to pick up on the redirects.

                      -Dan

                      Ltsmz 1 Reply Last reply Reply Quote 1
                      • Ltsmz
                        Ltsmz @evolvingSEO last edited by

                        Hey Dan thanks a lot for all your help!
                        There still is a problem though. A while ago I had created an adult subdomain: adult.mywebsite.com

                        Then I completely deleted everything inside it (even though I noticed the subfolder is still in my account).
                        A few days ago, when I started this thread, I also created a GWMT account for adult.mywebsite.com and submitted a removal request for all those URLs (about 15).

                        Now today when I check:
                        site:mywebsite.com
                        or
                        site.adult.mywebsite.com

                        the URLs still appear in search results.

                        When I check
                        cache:adult.mywebsite.comit sends me to a google 404 page:
                        http://webcache.googleusercontent.com/search?/complete/search?client=hp&hl=en&gs_rn=31&gs_ri=hp&cp=26&gs_id=s xxxxxxxxxxxxxxxxxxxxxxxx

                        So I don't know what this means...
                        Does it mean google hasn't deindexed them?
                        How do I get them deindexed?
                        Is it possible google is having troubles de-indexing them because they have no content in them or something like that?

                        What should I do to get rid of them?

                        Thanks a lot!!!!!!!!!!
                        Fabio

                        evolvingSEO 1 Reply Last reply Reply Quote 0
                        • evolvingSEO
                          evolvingSEO @Ltsmz last edited by

                          Hi There

                          You should ensure the content either;

                          • has meta noindex tags
                          • or is blocked with robots.txt
                          • or 404's or 410's (is missing)

                          And then use the URL removal tool again and see if that works.

                          Ltsmz 1 Reply Last reply Reply Quote 1
                          • Ltsmz
                            Ltsmz @evolvingSEO last edited by

                            Hey Dan,there is no content.
                            The whole website has been deleted, but it still appears in search results.

                            What should I do?
                            should I put back some content and then de-index it?

                            Thanks!
                            fabio

                            evolvingSEO 1 Reply Last reply Reply Quote 0
                            • evolvingSEO
                              evolvingSEO @Ltsmz last edited by

                              Hi Fabio

                              If the content is gone when you visit your old URLs do you get a 404 code? You can plug the old URLs into urivalet.com to see what code is returned. If you do, then you're all set. If you don't, see if you can just upload a robots.txt file to that subdomain and block all search engines. Here's info on how to do that http://www.robotstxt.org/robotstxt.html

                              -Dan

                              1 Reply Last reply Reply Quote 0
                              • 1 / 1
                              • First post
                                Last post
                              • My url disappeared from Google but Search Console shows indexed. This url has been indexed for more than a year. Please help!
                                jacobmartinnn
                                jacobmartinnn
                                0
                                3
                                74

                              • Queries on sitemap and indexing. Please help
                                gowthamsm
                                gowthamsm
                                0
                                4
                                123

                              • Website Ranks and gets de indexed ??
                                Mobilio
                                Mobilio
                                0
                                3
                                126

                              • Complicated Duplicate Content Question...but it's fun, so please help.
                                DmitriiK
                                DmitriiK
                                0
                                4
                                128

                              • Content question please help
                                DirkC
                                DirkC
                                0
                                2
                                106

                              • Blog Content In different language not indexed - HELP PLEASE!
                                MattAntonino
                                MattAntonino
                                0
                                6
                                107

                              • Website is not indexed in Google, please help with suggestions
                                jesse-landry
                                jesse-landry
                                0
                                7
                                175

                              • Best way to de-index content from Google and not Bing?
                                ShaMenz
                                ShaMenz
                                0
                                5
                                1.4k

                              Get started with Moz Pro!

                              Unlock the power of advanced SEO tools and data-driven insights.

                              Start my free trial
                              Products
                              • Moz Pro
                              • Moz Local
                              • Moz API
                              • Moz Data
                              • STAT
                              • Product Updates
                              Moz Solutions
                              • SMB Solutions
                              • Agency Solutions
                              • Enterprise Solutions
                              • Digital Marketers
                              Free SEO Tools
                              • Domain Authority Checker
                              • Link Explorer
                              • Keyword Explorer
                              • Competitive Research
                              • Brand Authority Checker
                              • Local Citation Checker
                              • MozBar Extension
                              • MozCast
                              Resources
                              • Blog
                              • SEO Learning Center
                              • Help Hub
                              • Beginner's Guide to SEO
                              • How-to Guides
                              • Moz Academy
                              • API Docs
                              About Moz
                              • About
                              • Team
                              • Careers
                              • Contact
                              Why Moz
                              • Case Studies
                              • Testimonials
                              Get Involved
                              • Become an Affiliate
                              • MozCon
                              • Webinars
                              • Practical Marketer Series
                              • MozPod
                              Connect with us

                              Contact the Help team

                              Moz logo
                              © 2021 - 2026 SEOMoz, Inc., a Ziff Davis company. All rights reserved. Moz is a registered trademark of SEOMoz, Inc.
                              • Accessibility
                              • Terms of Use
                              • Privacy