The Moz Q&A Forum

    • Forum
    • Questions
    • My Q&A
    • Users
    • Ask the Community

    Welcome to the Q&A Forum

    Browse the forum for helpful insights and fresh discussions about all things SEO.

    1. SEO and Digital Marketing Q&A Forum
    2. Categories
    3. Intermediate & Advanced SEO
    4. Block search engines from URLs created by internal search engine?

    Block search engines from URLs created by internal search engine?

    Intermediate & Advanced SEO
    6 3 2.2k
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as question
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • Voonie
      Voonie last edited by

      Hey guys,

      I've got a question for you all that I've been pondering for a few days now. I'm currently doing an SEO Technical Audit for a large scale directory.

      One major issue that they are having is that their internal search system (Directory Search) will create a new URL everytime a search query is entered by the user. This creates huge amounts of duplication on the website.

      I'm wondering if it would be best to block search engines from crawling these URLs entirely with Robots.txt?

      What do you guys think? Bearing in mind there are probably thousands of these pages already in the Google index?

      Thanks

      Kim

      1 Reply Last reply Reply Quote 0
      • Mark_Jay_Apsey_Jr.
        Mark_Jay_Apsey_Jr. last edited by

        Whats the content look like on the new url? Can you give us an example?

        Voonie 1 Reply Last reply Reply Quote 1
        • Voonie
          Voonie @Mark_Jay_Apsey_Jr. last edited by

          Sure, check below and some of the duplication I mean:

          Capitalization Duplication

          http://yellow.co.nz/yellow+pages/Car+dealer/Auckland+Region

          http://yellow.co.nz/yellow+pages/Car+Dealer/Auckland+Region

          With a few URL parameters

          http://yellow.co.nz/yellow+pages/Car+Dealer/Auckland+Region?encodedRefinement=refineterms..%3D..%5E%22Car+%26+Truck+Dealers+-+Used.%3A.Makes.%3D.Toyota%22%24..%26..Makes+%28Toyota%29&display=&stageName=Composite+search&suppressMobileListings=false

          And with location duplication

          http://yellow.co.nz/yellow+pages/Car+Dealer/Auckland

          Let me know if you need any more info!

          Cheers

          Kim

          1 Reply Last reply Reply Quote 0
          • Dr-Pete
            Dr-Pete last edited by

            It can be a complicated question on a very large site, but in most cases I'd META NOINDEX those pages. Robots.txt isn't great at removing content that's already been indexed. Admittedly, NOINDEX will take a while to work (virtually any solution will), as Google probably doesn't crawl these pages very often.

            Generally, though, the risk of having your index explode with custom search pages is too high for a site like yours (especially post-Panda). I do think blocking those pages somehow is a good bet.

            The only exception I would add is if some of the more popular custom searches are getting traffic and/or links. I assume you have a solid internal link structure and other paths to these listings, but if it looks like a few searches (or a few dozen) have attracted traffic and back-links, you'll want to preserve those somehow.

            Voonie 1 Reply Last reply Reply Quote 3
            • Voonie
              Voonie @Dr-Pete last edited by

              Thanks for your reply Dr. Meyers. I think you're probably right.

              Yes I'm recommending they define a canonical set of pages that are the most popular searches, categories and locations which can be reached via internal links and we'll get all those duplicates re-directed back to that canonical set.

              But for pages that fall outside those categories and locations, I'll recommend a meta-no-index tag.

              Dr-Pete 1 Reply Last reply Reply Quote 0
              • Dr-Pete
                Dr-Pete @Voonie last edited by

                That sounds perfect - if the user-generated URLs are getting enough traffic, make them permanent pages and 301-redirect or canonical. If not, weed them out of the index.

                1 Reply Last reply Reply Quote 0
                • 1 / 1
                • First post
                  Last post
                • Viewing search results for 'We possibly have internal links that link to 404 pages. What is the most efficient way to check our sites internal links?
                  andyheath
                  andyheath
                  0
                  3
                  160

                • Search engine blocked by robots-crawl error by moz & GWT
                  RuthBurrReedy
                  RuthBurrReedy
                  0
                  11
                  445

                • Are there any issues with search engines (other than Google/Bing) reading Protocol-Relative URLs?
                  WikiaSEO
                  WikiaSEO
                  0
                  4
                  113

                • How should I handle URL's created by an internal search engine?
                  CleverPhD
                  CleverPhD
                  0
                  4
                  416

                • Should Site Search results be blocked from search engines?
                  wissamdandan
                  wissamdandan
                  0
                  2
                  175

                • URL blocked
                  Paul78
                  Paul78
                  0
                  3
                  280

                • Is User Agent Detection still a valid method for blocking certain URL parameters from the Search Engines?
                  AlanMosley
                  AlanMosley
                  0
                  2
                  372

                • Best way to block a search engine from crawling a link?
                  deltasystems
                  deltasystems
                  0
                  3
                  1.0k

                Get started with Moz Pro!

                Unlock the power of advanced SEO tools and data-driven insights.

                Start my free trial
                Products
                • Moz Pro
                • Moz Local
                • Moz API
                • Moz Data
                • STAT
                • Product Updates
                Moz Solutions
                • SMB Solutions
                • Agency Solutions
                • Enterprise Solutions
                • Digital Marketers
                Free SEO Tools
                • Domain Authority Checker
                • Link Explorer
                • Keyword Explorer
                • Competitive Research
                • Brand Authority Checker
                • Local Citation Checker
                • MozBar Extension
                • MozCast
                Resources
                • Blog
                • SEO Learning Center
                • Help Hub
                • Beginner's Guide to SEO
                • How-to Guides
                • Moz Academy
                • API Docs
                About Moz
                • About
                • Team
                • Careers
                • Contact
                Why Moz
                • Case Studies
                • Testimonials
                Get Involved
                • Become an Affiliate
                • MozCon
                • Webinars
                • Practical Marketer Series
                • MozPod
                Connect with us

                Contact the Help team

                Join our newsletter
                Moz logo
                © 2021 - 2026 SEOMoz, Inc., a Ziff Davis company. All rights reserved. Moz is a registered trademark of SEOMoz, Inc.
                • Accessibility
                • Terms of Use
                • Privacy