The Moz Q&A Forum

    • Forum
    • Questions
    • My Q&A
    • Users
    • Ask the Community

    Welcome to the Q&A Forum

    Browse the forum for helpful insights and fresh discussions about all things SEO.

    1. SEO and Digital Marketing Q&A Forum
    2. Categories
    3. Technical SEO Issues
    4. Should I set up a disallow in the robots.txt for catalog search results?

    Should I set up a disallow in the robots.txt for catalog search results?

    Technical SEO Issues
    5 3 1.7k
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as question
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • JordanJudson
      JordanJudson last edited by

      When the crawl diagnostics came back for my site its showing around 3,000 pages of duplicate content. Almost all of them are of the catalog search results page. I also did a site search on Google and they have most of the results pages in their index too. I think I should just disallow the bots in the /catalogsearch/ sub folder, but I'm not sure if this will have any negative effect?

      1 Reply Last reply Reply Quote 0
      • AlanBleiweiss
        AlanBleiweiss last edited by

        Jordan,

        Others might have a different view, however that's exactly what I recommend to clients.  but only if you've got other html link based ways for bots to get to all the content in a direct manner, and have a good sitemap.xml file to reinforce that.

        I am happy to see that you have a sound overall site architecture, however I see no robots.txt file at your root so I'm not sure what's up with that.  Also your sitemap.xml file only has 43 URLs in it.  that's a problem not because google can't find content by other means, it's just that I've found Google likes that reinforcement, and Bing especially does a better job discovering content with a proper sitemap.xml submitted through their webmaster system (they're less efficient at discovering content by other means).

        I'd also suggest you have a big push ahead in dealing with near-duplicate content.

        For example:

        http://www.durafaucet.com/mk850-orb.html

        http://www.durafaucet.com/kitchen-faucets/mk850.html

        Sure, these are unique products.  Except there's already so little unique content on either page that the common content compounded by the site-wide replication of top, sidebar and footer content means the total weight of uniqueness is on the very minor end of the spectrum.

        And then there's the issue of a complete lack of inbound link authority - OpenSiteExplorer.org might be wrong, but currently shows almost no inbound links.  Not only will you need inbound links to the home page, but also to as many inner pages as is realistic in terms of implementation capabilities go.  This is especially true for category level pages. (including a variety of inbound link anchor text - brand, domain, keyword phrase and generic text).

        So if you don't address those type of issues, removing all the dupes that show up in search now won't result in as much long-term value as you'll need.

        SteveOllington 1 Reply Last reply Reply Quote 2
        • SteveOllington
          SteveOllington @AlanBleiweiss last edited by

          Totally agree with Alan, it can cause circular navigation problems for crawlers too.

          1 Reply Last reply Reply Quote 1
          • JordanJudson
            JordanJudson last edited by

            Thanks Alan, you are right this site has quite a long way to go. The first crawl was just finished and I notice that the most errors were due to dupe content so I decided I would try and tackle that first. Thank you for all the pointers, I will be taking a look at all those as soon as I can.

            AlanBleiweiss 1 Reply Last reply Reply Quote 0
            • AlanBleiweiss
              AlanBleiweiss @JordanJudson last edited by

              One step at a time = long term success.  I wish you the best with it Jordan.

              1 Reply Last reply Reply Quote 0
              • 1 / 1
              • First post
                Last post
              • Robots.txt Disallow: / in Search Console
                GastonRiera
                GastonRiera
                0
                2
                124

              • Will a robots.txt disallow apply to a 301ed URL?
                Martijn_Scheijbeler
                Martijn_Scheijbeler
                0
                3
                158

              • Googlebot does not obey robots.txt disallow
                Cyrus-Shepard
                Cyrus-Shepard
                0
                12
                1.4k

              • Disallow: /search/ in robots but soft 404s are still showing in GWT and Google search?
                timhatton
                timhatton
                0
                7
                374

              • Allow or Disallow First in Robots.txt
                Net66SEO
                Net66SEO
                0
                12
                27.8k

              • Robots.txt Showing in SERP Results
                bobjones
                bobjones
                0
                5
                1.4k

              • Can I Disallow Faceted Nav URLs - Robots.txt
                AlanMosley
                AlanMosley
                0
                5
                914

              • How does google know a search result is a search result?
                Damien-Anderson
                Damien-Anderson
                0
                2
                642

              Get started with Moz Pro!

              Unlock the power of advanced SEO tools and data-driven insights.

              Start my free trial
              Products
              • Moz Pro
              • Moz Local
              • Moz API
              • Moz Data
              • STAT
              • Product Updates
              Moz Solutions
              • SMB Solutions
              • Agency Solutions
              • Enterprise Solutions
              • Digital Marketers
              Free SEO Tools
              • Domain Authority Checker
              • Link Explorer
              • Keyword Explorer
              • Competitive Research
              • Brand Authority Checker
              • Local Citation Checker
              • MozBar Extension
              • MozCast
              Resources
              • Blog
              • SEO Learning Center
              • Help Hub
              • Beginner's Guide to SEO
              • How-to Guides
              • Moz Academy
              • API Docs
              About Moz
              • About
              • Team
              • Careers
              • Contact
              Why Moz
              • Case Studies
              • Testimonials
              Get Involved
              • Become an Affiliate
              • MozCon
              • Webinars
              • Practical Marketer Series
              • MozPod
              Connect with us

              Contact the Help team

              Join our newsletter
              Moz logo
              © 2021 - 2026 SEOMoz, Inc., a Ziff Davis company. All rights reserved. Moz is a registered trademark of SEOMoz, Inc.
              • Accessibility
              • Terms of Use
              • Privacy