The Moz Q&A Forum

    • Forum
    • Questions
    • My Q&A
    • Users
    • Ask the Community

    Welcome to the Q&A Forum

    Browse the forum for helpful insights and fresh discussions about all things SEO.

    1. SEO and Digital Marketing Q&A Forum
    2. Categories
    3. Intermediate & Advanced SEO
    4. Best way to "Prune" bad content from large sites?

    Best way to "Prune" bad content from large sites?

    Intermediate & Advanced SEO
    3 3 131
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as question
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • atomiconline
      atomiconline last edited by

      I am in process of pruning my sites for low quality/thin content. The issue is that I have multiple sites with 40k + pages and need a more efficient way of finding the low quality content than looking at each page individually. Is there an ideal way to find the pages that are worth no indexing that will speed up the process but not potentially harm any valuable pages?

      Current plan of action is to pull data from analytics and if the url hasn't brought any traffic in the last 12 months then it is safe to assume it is a page that is not beneficial to the site. My concern is that some of these pages might have links pointing to them and I want to make sure we don't lose that link juice. But, assuming we just no index the pages we should still have the authority pass along...and in theory, the pages that haven't brought any traffic to the site in a year probably don't have much authority to begin with.

      Recommendations on best way to prune content on sites with hundreds of thousands of pages efficiently?  Also, is there a benefit to no indexing the pages vs deleting them? What is the preferred method, and why?

      1 Reply Last reply Reply Quote 0
      • ChrisAshton
        ChrisAshton last edited by

        It's hard to say exactly without seeing your site since there are so many potential variables (e.g. are most of your blog posts low quality or just a minority? etc) that would define the best way to go about it.

        What I can say though is that you're on the right track as far as using analytics data to determine which ones are providing value right now. There is a danger in losing some rankings if you go removing a huge volume of these posts. Unless they're utter rubbish posts, they'll likely be providing relevance signals to Google on what your site is about. That said, I do think it's a necessary evil and I'd expect you'll be rewarded for it in the long run provided you start replacing the trash with high quality posts in the future.

        As for the benefits, if they really are low quality then user engagement is going to be terrible which is obviously not what you should be aiming for. It's also going to be chewing up your crawl budget for no good reason so the leaner your site is, the better base you have to start rebuilding with quality instead of quantity. For the same reason, I generally suggest removing tags and categories that aren't providing any actual benefit too - in most cases I see they're just there either "for good SEO" or because the site owners things that's how users are browsing their site but in almost all cases, that's not true. As always, check your own data on this to be sure.

        As for removing vs noindex, this one is always contentious but I lean toward removing simply because it's going to clean things up for the user too and ultimately they should be your primary focus. Having 40,000+ pages of trash on your website is a fantastic indicator to them that your site may not be somewhere they want to be and noindexing them won't do anything to change the user's experience.

        Hope that helps!

        1 Reply Last reply Reply Quote 1
        • julie-getonthemap
          julie-getonthemap last edited by

          I have a section of my website where I heavily use embedded content.  Embeds from Youtube, Slideshare, Twitter, Quora etc.  Google thinks they're thin, and they don't show up in my analytics because you can read the content without clicking on the page.

          http://getonthemap.us/twitter/blog

          But I like them, and I think they're helpful. So I no-indexed all but one of the blog posts in that section.  It retains the backlinks to the posts, but cleans me up with Google.

          If you're deleting, can't you do that quickly from your console?

          1 Reply Last reply Reply Quote 0
          • 1 / 1
          • First post
            Last post
          • Best to Combine Listing URLs? Are 300 Listing Pages a "Thin Content" Risk?
            julie-getonthemap
            julie-getonthemap
            0
            4
            88

          • Why does old "Free" site ranks better than new "Optimized" site?
            WhatUpHud
            WhatUpHud
            0
            5
            100

          • How should I react to my site being "attacked" by bad links?
            DarrenX
            DarrenX
            0
            4
            218

          • Best to Post Dynamic Content (Listings) under "Posts" in Wordpress?
            evolvingSEO
            evolvingSEO
            0
            2
            299

          • Best strategy for "product blocks" linking to sister site? Penguin Penalty?
            Thos003
            Thos003
            0
            4
            661

          • What's the best way to manage content that is shared on two sites and keep both sites in search results?
            BostonWright
            BostonWright
            0
            13
            285

          • Our Site's Content on a Third Party Site--Best Practices?
            nicole.healthline
            nicole.healthline
            1
            4
            269

          • Best way to find broken links on a large site?
            nicole.healthline
            nicole.healthline
            1
            3
            703

          Get started with Moz Pro!

          Unlock the power of advanced SEO tools and data-driven insights.

          Start my free trial
          Products
          • Moz Pro
          • Moz Local
          • Moz API
          • Moz Data
          • STAT
          • Product Updates
          Moz Solutions
          • SMB Solutions
          • Agency Solutions
          • Enterprise Solutions
          • Digital Marketers
          Free SEO Tools
          • Domain Authority Checker
          • Link Explorer
          • Keyword Explorer
          • Competitive Research
          • Brand Authority Checker
          • Local Citation Checker
          • MozBar Extension
          • MozCast
          Resources
          • Blog
          • SEO Learning Center
          • Help Hub
          • Beginner's Guide to SEO
          • How-to Guides
          • Moz Academy
          • API Docs
          About Moz
          • About
          • Team
          • Careers
          • Contact
          Why Moz
          • Case Studies
          • Testimonials
          Get Involved
          • Become an Affiliate
          • MozCon
          • Webinars
          • Practical Marketer Series
          • MozPod
          Connect with us

          Contact the Help team

          Join our newsletter
          Moz logo
          © 2021 - 2026 SEOMoz, Inc., a Ziff Davis company. All rights reserved. Moz is a registered trademark of SEOMoz, Inc.
          • Accessibility
          • Terms of Use
          • Privacy