The Moz Q&A Forum

    • Forum
    • Questions
    • My Q&A
    • Users
    • Ask the Community

    Welcome to the Q&A Forum

    Browse the forum for helpful insights and fresh discussions about all things SEO.

    1. SEO and Digital Marketing Q&A Forum
    2. Categories
    3. Technical SEO Issues
    4. Robots.txt on refinements

    Robots.txt on refinements

    Technical SEO Issues
    4 4 176
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as question
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • Gordian
      Gordian last edited by

      In dealing with Panda do you think it is a good idea to put all refinements for category pages in the robots.txt file? We already have a lot as noindex, follow but I am wondering if it would be better to address from a crawl perspective as the pages are probably thin duplicate content to Google.

      1 Reply Last reply Reply Quote 0
      • LesleyPaone
        LesleyPaone last edited by

        I don't know if you have those taco commercials where you live that have the little girl in them that says "Why not both!", but you might do that, it would not hurt and it would make you sleep better at night.

        Oh, here is a link to the commercial, https://www.youtube.com/watch?v=vqgSO8_cRio

        1 Reply Last reply Reply Quote 0
        • ThompsonPaul
          ThompsonPaul last edited by

          One of the most common mistakes I see in SEO... There's nothing about the robots.txt that keeps pages from being indexed. In fact, just the opposite. If you have existing pages to which you've added no-index, but you also block them with robots.txt, then the search crawler will never see them to pick up the no-index and therefore won't know it's supposed to remove them. So they would still count against you as thin content even though they're not being crawled. NOT the result you're looking for.

          If you can no-index them, great. If not, at least use canonical tags to point them to the primary version of the category page. (Remember no-index is a FAR stronger command to the search engines then canonical tags, which they take as "suggestions".)

          The only time it's appropriate to block no-indexed pages with robots is if you're absolutely certain the pages have never made it into the index in the first place. If they've never been indexed, you can no-index them for security, and then drop them behind the robots.txt to save crawl budget.

          Hope that makes sense?

          Paul

          1 Reply Last reply Reply Quote 1
          • evolvingSEO
            evolvingSEO last edited by

            Hi There

            In general you probably don't need to do that. Here's how I would normally deal with indexation in WordPress (assuming you're using WordPress);

            • Categories - index
            • Tags - noindex
            • Date archives - noindex
            • Author (single author blogs) - noindex
            • Author (multi-author) - index
            • Subpages - noindex

            Basically all these settings are shown in my post here on setting up WordPress: http://moz.com/blog/setup-wordpress-for-seo-success

            Yoast is the best plugin to do all this with!

            1 Reply Last reply Reply Quote 1
            • 1 / 1
            • First post
              Last post
            • Robots.txt
              MarieHaynes
              MarieHaynes
              0
              8
              115

            • Robots.txt
              Dan-Lawrence
              Dan-Lawrence
              0
              5
              99

            • Robots txt
              LadyApollo
              LadyApollo
              0
              3
              427

            • Robots.txt usage
              sesertin
              sesertin
              0
              6
              527

            • Invisible robots.txt?
              AjazMozPro
              AjazMozPro
              0
              7
              3.1k

            • Robots.txt
              JordanGodbey
              JordanGodbey
              0
              6
              619

            • Robots.txt and robots meta
              TheEspresseo
              TheEspresseo
              0
              5
              1.1k

            • Robots.txt
              Tom-Anthony
              Tom-Anthony
              0
              4
              1.1k

            Get started with Moz Pro!

            Unlock the power of advanced SEO tools and data-driven insights.

            Start my free trial
            Products
            • Moz Pro
            • Moz Local
            • Moz API
            • Moz Data
            • STAT
            • Product Updates
            Moz Solutions
            • SMB Solutions
            • Agency Solutions
            • Enterprise Solutions
            • Digital Marketers
            Free SEO Tools
            • Domain Authority Checker
            • Link Explorer
            • Keyword Explorer
            • Competitive Research
            • Brand Authority Checker
            • Local Citation Checker
            • MozBar Extension
            • MozCast
            Resources
            • Blog
            • SEO Learning Center
            • Help Hub
            • Beginner's Guide to SEO
            • How-to Guides
            • Moz Academy
            • API Docs
            About Moz
            • About
            • Team
            • Careers
            • Contact
            Why Moz
            • Case Studies
            • Testimonials
            Get Involved
            • Become an Affiliate
            • MozCon
            • Webinars
            • Practical Marketer Series
            • MozPod
            Connect with us

            Contact the Help team

            Join our newsletter
            Moz logo
            © 2021 - 2026 SEOMoz, Inc., a Ziff Davis company. All rights reserved. Moz is a registered trademark of SEOMoz, Inc.
            • Accessibility
            • Terms of Use
            • Privacy