The Moz Q&A Forum

    • Forum
    • Questions
    • My Q&A
    • Users
    • Ask the Community

    Welcome to the Q&A Forum

    Browse the forum for helpful insights and fresh discussions about all things SEO.

    1. SEO and Digital Marketing Q&A Forum
    2. Categories
    3. Intermediate & Advanced SEO
    4. Panda Recovery - What is the best way to shrink your index and make Google aware?

    Panda Recovery - What is the best way to shrink your index and make Google aware?

    Intermediate & Advanced SEO
    20 9 3.3k
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as question
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • Kerry22
      Kerry22 @sp80 last edited by

      Hi

      No problem, I am happy to help!

      Yes. graph declined sloooowly but only when we started removing pages. This is half the problem - you have to wait for Google to find the changes. The waiting is frustrating as you don't know if what you have done is right, but the stuff I listed will help speed it up. We literally had to wait until none of the pages could be found in the index.

      I see a big increase in your indexation from April to May 2012. When did you get hit and what happened over that month - did you add a lot of new pages/products? Are those drops in indexation from June to Dec 2012 you removing pages or did the drop just start to 'happen' and then you got hit?

      1 Reply Last reply Reply Quote 0
      • sp80
        sp80 @sp80 last edited by

        Hey Kerry,

        There was addition of additional pages in April which is also when our sites started seeing a decrease in rankings - so the timing adds up.

        The drops starting June have no clear root for us - we started our de-indexation process starting of December.

        We are thinking to speed up indexation exclusively through a second Google Sitemap as anything else would need to be a very artificial landing page with a high number of links at this point. Would you be concerned exclusively using a Sitemap over keeping the unwanted pages linked from your linking structure?

        Further, I am interested in how you determined the set of pages you know were part of the Google index to be delisted? It appears the best way to do so is to scrape the Google search results of pages returned for a domain and build up a list this way.

        Did you recover completely to prior Panda?

        Best /Thomas

        1 Reply Last reply Reply Quote 0
        • Kerry22
          Kerry22 @sp80 last edited by

          Think second sitemap will be fine. Wouldn't add a page with just links as that is the type of page Panda doesn't like.

          Regarding sets of pages - we started by going into the search results - found a lot of content that shouldn't have been indexed.

          We  then looked  manually at the content on subsets of pages and found pages that were thin and very similar to others (at the product level) and either made them more unique or removed them. Tools like this also help identify similar pages across products/categories http://www.copyscape.com/compare.php

          It's only been 2 weeks, so it looks like we have pretty much 80% recovered and still improving - still looking at numbers and over Christmas and NY obviously traffic is quiet. I think 100% recovery is dependent on too many variables, like whether you continue link building during your time fixing the site, losing links by removing pages, adding more pages, competitors gaining authority/rankings etc

          bobforesi 1 Reply Last reply Reply Quote 0
          • Dr-Pete
            Dr-Pete last edited by

            If you want to completely remove these pages, I think Kerry22 is spot on. A 410 is about the fastest method we know of, and her points about leaving the crawl paths open are very important. I completely agree with leaving them in a stand-alone sitemap - that's good advice.

            Saw your other answer, so I assume you don't want to 301 or canonical these pages. The only caveat I'd add is user value. Even if the pages have no links, make sure people aren't trying to visit them.

            This can take time, especially at large scale, and a massive removal can look odd to Google. This doesn't generally result in a penalty or major problems, but it can cause short-term issues as Google re-evaluates the site.

            The only option to speed it up is, if the pages have a consistent URL parameter or folder structure, you may be able to do a mass removal in Google Webmaster Tools. This can be faster, but it's constrained to similar-looking URLs. In other words, there has to be a pattern. The benefit is that you can make the GWT request on top of the 410s, so that can sometimes help. Any massive change takes time, though, and often requires some course correction, I find.

            sp80 Eric_R 2 Replies Last reply Reply Quote 1
            • sp80
              sp80 @Dr-Pete last edited by

              Thanks Pete,

              I appreciate your input. Next to the additional sitemap with the known Google-indexed URLs we want deindexed, we also have reopened some crawl paths to these pages to see if there is a speed up.

              This is an undertaking carried out across 30 international properties so we will be able to experiment with measures for certain domains and see how it affects de-indexing speed as we are tracking the numbers reported by Google daily.

              I agree about the bad user experience of 410s as a dead end. We are mostly de-indexing as a mean of recovery from Panda but the content pages that we try to deindex are actually still useful to the users, just thin and partially duplicative in content. We have decided to still display the content when such page is reached but return a status code of 410. Alternatively it seems we could just set the robot tag to noindex but my feeling is the 410 approach will lead to faster deindexing - would you agree?

              Also if you have any expertise to share on how to compile a more ocomprehensive list of URLs indexed by Google for a particular domain other than scraping the web interface using the site:domain.com query approach (only returns a small subset compared to the stated total number of indexed pages) please let me know.

              Thanks again /Thomas

              Dr-Pete 1 Reply Last reply Reply Quote 0
              • Dr-Pete
                Dr-Pete @sp80 last edited by

                It might be worth exploring NOINDEX'ing the useful pages and 410'ing the non-useful ones, if only because sometimes a mix of signals is more palatable to Google. Any time you remove a swatch of content with one method, it can trigger alarm bells. I'll be honest, though - these situations are almost always tricky and you almost always have to measure and adjust. I've never found a method that's right for all situations.

                1 Reply Last reply Reply Quote 0
                • Eric_R
                  Eric_R @Dr-Pete last edited by

                  Hi Dr. Pete,

                  I know this is a late entry into this thread, but.. what if we did all our content cutting in the wrong ways over the past year – is there something we could/should do now to correct for this?  Our site was hit by panda back in March 2012, and since then we've cut content several times.  But we didn’t use this good process you advocate – here’s what we did when we cut pages:

                  1. We set up permanent 301 redirects for all of them immediately
                  2. Simultaneously, we always removed all links pointing to cut pages (we wanted to make sure users didn’t get redirected all the time)

                  This is a far cry from what you recommend and what Kerry22 did to recover successfully. If you have some advice on the following questions, I’d definitely appreciate it:

                  - Is it possible Google still thinks we have this content on our site or intend to bring it back, and as a result we continue to suffer?

                  • If that is a possibility, then what can we do now (if anything) to correct the damage we did?

                  We're thinking about removing all of those 301s now, letting all cut content return 404s and making a separate sitemap of cut content to submit it to Google.  Do you think it's too late or otherwise inadvisable for us to do this kind of thing?

                  Thanks in advance,
                  Eric

                  Dr-Pete 1 Reply Last reply Reply Quote 0
                  • Dr-Pete
                    Dr-Pete @Eric_R last edited by

                    Did Google process the 301s? In other words, are the old pages still in the index or not? If they processed the 301s eventually, you generally should be ok. If the old URLs seem stranded, then you might be best setting up the XML sitemap with those old URLs to just kick Google a little. I don't think I'd switch signals and move from a 301 to 404, unless the old pages are low quality, had bad links, etc.

                    Unfortunately, these things are very situational, so it can be hard to speak in generalities.

                    1 Reply Last reply Reply Quote 0
                    • bobforesi
                      bobforesi @Kerry22 last edited by

                      Hi kerry,

                      Your post gives me some hope. I was hit by Panda in Feb. 2011 and lost 85% of my google traffic Made many changes to my site -- page deletions re-directs added content etc. Got a bump of 25% in September 2011 but lost that and more afterward.

                      We have an e-commerce gift site with 6000 pages. Is your site an e-commerce site?

                      I have not found a recovery story from any sites like mine that were hit with that large a drop.

                      I hope your recovery would relate to my situation.

                      1 Reply Last reply Reply Quote 0
                      • TomNYC
                        TomNYC last edited by

                        Hi Sp80 (and group),

                        It's been about six months since you posted your Panda recovery question. I'm curious if you implemented Kerry22's suggestions, and what results you've seen. I hope it's worked out for you.

                        We're also dealing with removing thousands of pages of thin content (through 410s, keeping links up and sitemaps, as per Kerry's suggestion). This was a very helpful discussion to read.

                        Thanks,

                        Tom

                        1 Reply Last reply Reply Quote 0
                        • ABK717
                          ABK717 last edited by

                          Hi. I would be curious to know if anyone else has experienced something similar and recovered from Panda. How long did it take you? Did you manually remove the pages, set up 410s or 404s, or create 301s?

                          I've been working on a site for sometime now which has lost a great of traffic since July 2013. Over the past 2 months, a process has gone underway to manually remove the URLs from the index. The index has been cut in half, but still not at what it was pre-penalty. About 20,000 more pages to figure out what needs to be removed before it reaches the level it was before the massive traffic drop.

                          Any recovery or insight would be helpful.

                          1 Reply Last reply Reply Quote 0
                          • 1 / 1
                          • First post
                            Last post
                          • Google Rank 0 - Best way?
                            paints-n-design
                            paints-n-design
                            0
                            2
                            67

                          • New Subdomain & Best Way To Index
                            0
                            1
                            87

                          • How to best add affiliate links in a way that minimizes panda risk?
                            lcourse
                            lcourse
                            0
                            3
                            95

                          • Best way to remove full demo (staging server) website from Google index
                            EugeneSong
                            EugeneSong
                            0
                            8
                            6.5k

                          • Best way to permanently remove URLs from the Google index?
                            MoosaHemani
                            MoosaHemani
                            0
                            4
                            7.2k

                          • Post your 3 best ways to rank well on Google
                            Horizon
                            Horizon
                            0
                            2
                            1.8k

                          • Best way to de-index content from Google and not Bing?
                            ShaMenz
                            ShaMenz
                            0
                            5
                            1.4k

                          • Best way to stop pages being indexed and keeping PageRank
                            Sebes
                            Sebes
                            0
                            2
                            825

                          Get started with Moz Pro!

                          Unlock the power of advanced SEO tools and data-driven insights.

                          Start my free trial
                          Products
                          • Moz Pro
                          • Moz Local
                          • Moz API
                          • Moz Data
                          • STAT
                          • Product Updates
                          Moz Solutions
                          • SMB Solutions
                          • Agency Solutions
                          • Enterprise Solutions
                          • Digital Marketers
                          Free SEO Tools
                          • Domain Authority Checker
                          • Link Explorer
                          • Keyword Explorer
                          • Competitive Research
                          • Brand Authority Checker
                          • Local Citation Checker
                          • MozBar Extension
                          • MozCast
                          Resources
                          • Blog
                          • SEO Learning Center
                          • Help Hub
                          • Beginner's Guide to SEO
                          • How-to Guides
                          • Moz Academy
                          • API Docs
                          About Moz
                          • About
                          • Team
                          • Careers
                          • Contact
                          Why Moz
                          • Case Studies
                          • Testimonials
                          Get Involved
                          • Become an Affiliate
                          • MozCon
                          • Webinars
                          • Practical Marketer Series
                          • MozPod
                          Connect with us

                          Contact the Help team

                          Join our newsletter
                          Moz logo
                          © 2021 - 2026 SEOMoz, Inc., a Ziff Davis company. All rights reserved. Moz is a registered trademark of SEOMoz, Inc.
                          • Accessibility
                          • Terms of Use
                          • Privacy