The Moz Q&A Forum

    • Forum
    • Questions
    • My Q&A
    • Users
    • Ask the Community

    Welcome to the Q&A Forum

    Browse the forum for helpful insights and fresh discussions about all things SEO.

    1. SEO and Digital Marketing Q&A Forum
    2. Categories
    3. Technical SEO Issues
    4. Huge number of indexed pages with no content

    Huge number of indexed pages with no content

    Technical SEO Issues
    5 2 390
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as question
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • Dilbak
      Dilbak last edited by

      Hi,

      We have accidentally had Google indexed lots os our pages with no useful content at all on them.

      The site in question is a directory site, where we have tags and we have cities. Some cities have suppliers for almost all the tags, but there are lots of cities, where we have suppliers for only a handful of tags.

      The problem occured, when we created a page for each cities, where we list the tags as links.

      Unfortunately, our programmer listed all the tags, so not only the ones, where we have businesses, offering their services, but all of them!

      We have 3,142 cities and 542 tags. I guess, that you can imagine the problem this caused!

      Now I know, that Google might simply ignore these empty pages and not crawl them again, but when I check a city (city site:domain) with only 40 providers, I still have 1,050 pages indexed. (Yes, we have some issues between the 550 and the 1050 as well, but first things first:))

      These pages might not be crawled again, but will be clicked, and bounces and the whole user experience in itself will be terrible.

      My idea is, that I might use meta noindex for all of these empty pages and perhaps also have a 301 redirect from all the empty category pages, directly to the main page of the given city.

      Can this work the way I imagine? Any better solution to cut this really bad nightmare short?

      Thank you in advance.

      Andras

      1 Reply Last reply Reply Quote 0
      • JoshPugh
        JoshPugh last edited by

        I would agree, just setup a 301 redirect so that users don't bounce and actually get directed to something remotely useful, even just a listing of all the tags around the site or a home page or something (even if you do the below, to ensure users who stumble on these pages are still happy).

        You could also use a robots.txt file to show which ones you don't want to be indexed, and finally you may also use Google's Webmaster Tools to manually remove particular pages!

        A combo of all of those will work a treat!

        Dilbak 1 Reply Last reply Reply Quote 0
        • Dilbak
          Dilbak @JoshPugh last edited by

          Thank you for your reply, Josh.

          I will then use the 301, but should I also use the noindex tag for these pages to be removed from the index?

          Does it make an emphasis on my intention, or it adds no extra to the process? Perhaps, they should not be used together at all, as basically they are meant for different tasks.

          (Unfortunatyly, robots.txt is not really a solution, as we have the following url structure:

          www.example.com/city/tag

          Since all the cities have at least a couple of valid tags, I can't specify the path to be excluded from indexing. I would also try not to add 2,000+ cities individually.

          As for GWT, url removal for this number of pages might also not be an option, as I have minimum 100,000+ no-value pages to be removed (the limit is 500 per month).)

          JoshPugh Dilbak 2 Replies Last reply Reply Quote 0
          • JoshPugh
            JoshPugh @Dilbak last edited by

            NoIndex I think is slightly superfluous as the 301 will take care of it and also point people to a proper result and give Google a redirected result.

            However SEOMoz's Robots information page page suggests:

            "In most cases, meta robots with parameters "noindex, follow" should be employed as a way to to restrict crawling or indexation."

            • So maybe consider that...

            As for Robots, you can check out SEOMoz's Robots information page where it has information on wildcards, which you could use, which I THINK would work (i.e. http://domain.com/*/tags ?

            Not quite sure on that last bit though...

            1 Reply Last reply Reply Quote 1
            • Dilbak
              Dilbak @Dilbak last edited by

              Thank you again, John. I will fix this, based on our discussion.

              1 Reply Last reply Reply Quote 0
              • 1 / 1
              • First post
                Last post
              • Number of index pages in web master is different from site:mydomainname
                DirkC
                DirkC
                0
                2
                113

              • Number of indexed pages dropped dramatically
                MikeRoberts
                MikeRoberts
                0
                4
                1.1k

              • Home page indexed but not ranking...interior pages with thin content outrank home page??
                DougHosmer
                DougHosmer
                0
                3
                294

              • Duplicate page content - index.html
                Collie
                Collie
                0
                5
                1.2k

              • The number of pages indexed on Bing DROPPED significantly.
                joony2008
                joony2008
                0
                4
                298

              • I am trying to correct error report of duplicate page content. However I am unable to find in over 100 blogs the page which contains similar content to the page SEOmoz reported as having similar content is my only option to just dlete the blog page?
                evolvingSEO
                evolvingSEO
                0
                4
                344

              • Number of Indexed Pages in Webmaster Tools
                ResslerMotors
                ResslerMotors
                0
                6
                441

              • Discrepency between # of pages and # of pages indexed
                Dan-Petrovic
                Dan-Petrovic
                0
                14
                990

              Get started with Moz Pro!

              Unlock the power of advanced SEO tools and data-driven insights.

              Start my free trial
              Products
              • Moz Pro
              • Moz Local
              • Moz API
              • Moz Data
              • STAT
              • Product Updates
              Moz Solutions
              • SMB Solutions
              • Agency Solutions
              • Enterprise Solutions
              • Digital Marketers
              Free SEO Tools
              • Domain Authority Checker
              • Link Explorer
              • Keyword Explorer
              • Competitive Research
              • Brand Authority Checker
              • Local Citation Checker
              • MozBar Extension
              • MozCast
              Resources
              • Blog
              • SEO Learning Center
              • Help Hub
              • Beginner's Guide to SEO
              • How-to Guides
              • Moz Academy
              • API Docs
              About Moz
              • About
              • Team
              • Careers
              • Contact
              Why Moz
              • Case Studies
              • Testimonials
              Get Involved
              • Become an Affiliate
              • MozCon
              • Webinars
              • Practical Marketer Series
              • MozPod
              Connect with us

              Contact the Help team

              Join our newsletter
              Moz logo
              © 2021 - 2026 SEOMoz, Inc., a Ziff Davis company. All rights reserved. Moz is a registered trademark of SEOMoz, Inc.
              • Accessibility
              • Terms of Use
              • Privacy