The Moz Q&A Forum

    • Forum
    • Questions
    • My Q&A
    • Users
    • Ask the Community

    Welcome to the Q&A Forum

    Browse the forum for helpful insights and fresh discussions about all things SEO.

    1. SEO and Digital Marketing Q&A Forum
    2. Categories
    3. Technical SEO Issues
    4. How to block "print" pages from indexing

    How to block "print" pages from indexing

    Technical SEO Issues
    23 5 4.5k
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as question
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • dreadmichael
      dreadmichael last edited by

      I have a fairly large FAQ section and every article has a "print" button. Unfortunately, this is creating a page for every article which is muddying up the index - especially on my own site using Google Custom Search.

      Can you recommend a way to block this from happening?

      Example Article:

      http://www.knottyboy.com/lore/idx.php/11/183/Maintenance-of-Mature-Locks-6-months-/article/How-do-I-get-sand-out-of-my-dreads.html

      Example "Print" page:

      http://www.knottyboy.com/lore/article.php?id=052&action=print

      1 Reply Last reply Reply Quote 0
      • SEODinosaur
        SEODinosaur last edited by

        you can block in .robot text, every page that ends in action=print

        dreadmichael 1 Reply Last reply Reply Quote 0
        • dreadmichael
          dreadmichael @SEODinosaur last edited by

          That would be great. Do you mind giving me an example?

          SEODinosaur 2 Replies Last reply Reply Quote 0
          • jennita
            jennita last edited by

            Rather than using robots.txt I'd use a noindex,follow tag instead to the page. This code goes into the tag for each print page. And it will ensure that the pages don't get indexed but that the links are followed.

            SEODinosaur Dr-Pete 2 Replies Last reply Reply Quote 1
            • SEODinosaur
              SEODinosaur @jennita last edited by

              Theres more then one way to skin a chicken.

              jennita SEODinosaur 2 Replies Last reply Reply Quote 0
              • SEODinosaur
                SEODinosaur @dreadmichael last edited by

                Try This.

                User-agent: *

                Disallow: /*&action=print

                1 Reply Last reply Reply Quote 0
                • SEODinosaur
                  SEODinosaur @dreadmichael last edited by

                  http://www.seomoz.org/learn-seo/robotstxt

                  1 Reply Last reply Reply Quote 1
                  • NakulGoyal
                    NakulGoyal last edited by

                    I actually remember Lore from a while ago. It's an interesting, easy to use FAQ CMS.

                    Anyways, I would also recommend implementing Canonical Tags for any possible duplicate content issues. So whether it's the print or the web version, each one of them will contain a canonical tag pointing to the web url of that article in the section of your website.

                    rel="canonical" href="http://www.knottyboy.com/lore/idx.php/11/183/Maintenance-of-Mature-Locks-6-months-/article/How-do-I-get-sand-out-of-my-dreads.html" />
                    dreadmichael SEODinosaur 2 Replies Last reply Reply Quote 1
                    • dreadmichael
                      dreadmichael last edited by

                      Thanks Donnie. Much appreciated!

                      SEODinosaur 1 Reply Last reply Reply Quote 1
                      • jennita
                        jennita @SEODinosaur last edited by

                        True but using robots.txt does not keep them out of the index. Only using "noindex" will do that.

                        1 Reply Last reply Reply Quote 1
                        • dreadmichael
                          dreadmichael @NakulGoyal last edited by

                          Ya it is actually really useful. Unfortunately they are out of business now - so I'm hacking it on my own.

                          I will take your advice. I've shamefully never used rel= canonical before - so now is a good time to start.

                          NakulGoyal SEODinosaur 3 Replies Last reply Reply Quote 0
                          • NakulGoyal
                            NakulGoyal @dreadmichael last edited by

                            Yes, it's strongly recommended. It should be fairly simple to populate this tag with the "full" URL of the article based on the article ID. This approach will not only help you get rid of the duplicate content issue, but a canonical tag essentially works like a 301 redirect. So from all search engine perspective you are 301'ing your print pages to the real web urls without redirecting the actual user's who are browsing the print pages if they need to.

                            1 Reply Last reply Reply Quote 0
                            • Dr-Pete
                              Dr-Pete @jennita last edited by

                              I have to agree with Jen - Robots.txt isn't great for getting indexed pages out. It's good for prevention, but tends to be unreliable as a cure. META NOINDEX is probably more reliable.

                              One trick - DON'T nofollow the print links, at least not yet. You need Google to crawl and read the NOINDEX tags. Once the ?print pages are de-indexed, you could nofollow the links, too.

                              1 Reply Last reply Reply Quote 0
                              • SEODinosaur
                                SEODinosaur @NakulGoyal last edited by

                                Yes, but Rel=Canonical does not block a page it only tells google which page to follow out of two pages.The question was how to block, not how to tell google which link to follow. I believe you gave credit to the wrong answer.

                                http://en.wikipedia.org/wiki/Canonical_link_element

                                This is not fair. lol

                                dreadmichael Dr-Pete jennita 5 Replies Last reply Reply Quote 0
                                • SEODinosaur
                                  SEODinosaur @dreadmichael last edited by

                                  But the spiders still run on the page and read the canonical link, however with the robot text the spiders will not.

                                  1 Reply Last reply Reply Quote 0
                                  • SEODinosaur
                                    SEODinosaur @SEODinosaur last edited by

                                    Although you are correct... there is still more then one way to skin a chicken.

                                    1 Reply Last reply Reply Quote 0
                                    • SEODinosaur
                                      SEODinosaur @dreadmichael last edited by

                                      Your welcome : )

                                      1 Reply Last reply Reply Quote 0
                                      • dreadmichael
                                        dreadmichael @SEODinosaur last edited by

                                        You are right Donnie. I've "good answered" you too.

                                        I've gone ahead and updated my robots.txt file. As soon as I am able, I will use no indexon the page, no follow on the links, and rel=canonical.

                                        This is just what I needed, a quick fix until I can make a more permanent solution.

                                        1 Reply Last reply Reply Quote 0
                                        • Dr-Pete
                                          Dr-Pete @SEODinosaur last edited by

                                          Rel-canonical, in practice, does essentially de-index the non-canonical version. Technically, it's not a de-indexation method, but it works that way.

                                          1 Reply Last reply Reply Quote 0
                                          • jennita
                                            jennita @SEODinosaur last edited by

                                            Josh, please read my and Dr. Pete's comments below. Don't nofollow the links, but do use the meta noindex,follow on the page.

                                            1 Reply Last reply Reply Quote 0
                                            • 1
                                            • 2
                                            • 1 / 2
                                            • First post
                                              Last post
                                            • How to de-index a page with a search string with the structure domain.com/?"spam"
                                              CopyChrisSEO
                                              CopyChrisSEO
                                              1
                                              4
                                              113

                                            • My sites "pages indexed by Google" have gone up more than qten-fold.
                                              MibuKotaro
                                              MibuKotaro
                                              0
                                              4
                                              89

                                            • How do I get my pages to go from "Submitted" to "Indexed" in Google Webmaster Tools?
                                              Nate_D
                                              Nate_D
                                              0
                                              7
                                              1.6k

                                            • If I want clean up my URLs and take the "www.site.com/page.html" and make it "www.site.com/page" do I need a redirect?
                                              Booj
                                              Booj
                                              0
                                              4
                                              113

                                            • How to block text on a page to be indexed?
                                              khi5
                                              khi5
                                              0
                                              8
                                              510

                                            • How Does Google's "index" find the location of pages in the "page directory" to return?
                                              reidsteven75
                                              reidsteven75
                                              0
                                              9
                                              215

                                            • Same URL in "Duplicate Content" and "Blocked by robots.txt"?
                                              alsvik
                                              alsvik
                                              0
                                              3
                                              502

                                            Get started with Moz Pro!

                                            Unlock the power of advanced SEO tools and data-driven insights.

                                            Start my free trial
                                            Products
                                            • Moz Pro
                                            • Moz Local
                                            • Moz API
                                            • Moz Data
                                            • STAT
                                            • Product Updates
                                            Moz Solutions
                                            • SMB Solutions
                                            • Agency Solutions
                                            • Enterprise Solutions
                                            • Digital Marketers
                                            Free SEO Tools
                                            • Domain Authority Checker
                                            • Link Explorer
                                            • Keyword Explorer
                                            • Competitive Research
                                            • Brand Authority Checker
                                            • Local Citation Checker
                                            • MozBar Extension
                                            • MozCast
                                            Resources
                                            • Blog
                                            • SEO Learning Center
                                            • Help Hub
                                            • Beginner's Guide to SEO
                                            • How-to Guides
                                            • Moz Academy
                                            • API Docs
                                            About Moz
                                            • About
                                            • Team
                                            • Careers
                                            • Contact
                                            Why Moz
                                            • Case Studies
                                            • Testimonials
                                            Get Involved
                                            • Become an Affiliate
                                            • MozCon
                                            • Webinars
                                            • Practical Marketer Series
                                            • MozPod
                                            Connect with us

                                            Contact the Help team

                                            Join our newsletter
                                            Moz logo
                                            © 2021 - 2026 SEOMoz, Inc., a Ziff Davis company. All rights reserved. Moz is a registered trademark of SEOMoz, Inc.
                                            • Accessibility
                                            • Terms of Use
                                            • Privacy