The Moz Q&A Forum

    • Forum
    • Questions
    • My Q&A
    • Users
    • Ask the Community

    Welcome to the Q&A Forum

    Browse the forum for helpful insights and fresh discussions about all things SEO.

    1. SEO and Digital Marketing Q&A Forum
    2. Categories
    3. Technical SEO Issues
    4. Google index dymamic webpages after block in robots.txt...

    Google index dymamic webpages after block in robots.txt...

    Technical SEO Issues
    6 4 247
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as question
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • YogeshG
      YogeshG last edited by

      This post is deleted!
      1 Reply Last reply Reply Quote 0
      • Davenport-Tractor
        Davenport-Tractor last edited by

        Atul,

        We have experienced the same issue with our shopping cart paginating the product results. Even though robots.txt has specifically disallowed the crawling of certain pages, doesn't mean they wont be indexed. After all when you think about it you are providing a link to each of the pages you told the spider to disallow. Lets not forget that Google, and other robots are controlled by their respective company's, and their job is to gather content, even if we don't like or want it.

        So the best answer we have found is to embrace it! You can redirect the link juice back to the beginning URL with the "REL=Canonical" tag. Its fairly easy to create a statement in your template file that will test the current URL and build a dynamic URL that points the link juice back to the base URL. Here is an example:

        <linkrel="canonical"href="http://www.yourdomaine.com"/>

        Actually Google says you can point that link juice to any other Domain as well

        1 Reply Last reply Reply Quote 0
        • AndersS
          AndersS last edited by

          Hi!

          Could it be that the pages where already crawled by Google before you added the directives to robots.txt. Perhaps you could remove it, and add the rel="canonical", as Allen suggests. That way you will allow Google to reindex the pages and fetch the changes.

          Hope this helps 🙂

          1 Reply Last reply Reply Quote 0
          • YogeshG
            YogeshG last edited by

            This post is deleted!
            1 Reply Last reply Reply Quote 0
            • Dr-Pete
              Dr-Pete last edited by

              Unfortunately, Robots.txt is a poor choice for content that may have already been indexed, including dynamic content. It's good for blocking specific pages and folders (especially prior to Google crawling them), but it tends to be unreliable in these situations.

              Pagination is a tricky topic, and the "best" solution varies a lot with the situation, but the basic options are:

              (1) Use rel="prev" and rel="next", which helps Google handle the paginated series properly, but still allows it to rank.

              (2) Use META NOINDEX, FOLLOW on pages 2+ of search results (this was probably the most popular method before rel=prev/next).

              (3) Use rel=canonical to point all paginated results to a "View All" page. This page should be available to users and not be too large. It's a decent option if you have a few dozen results, but not 100s or 1000s.

              (4) Use Google Webmaster Tools parameter handling on the "page=" parameter. It seems to work, but since it's Google-specific, it's not the go-to option for most SEOs.

              YogeshG 1 Reply Last reply Reply Quote 1
              • YogeshG
                YogeshG @Dr-Pete last edited by

                This post is deleted!
                1 Reply Last reply Reply Quote 0
                • 1 / 1
                • First post
                  Last post
                • No index tag robots.txt
                  Nigel_Carr
                  Nigel_Carr
                  0
                  11
                  3.3k

                • Should you use robots.txt for pages within your site which do not have high quality content or are not contributing a great deal so when Google crawls your site the best performing content has a higher chance of being indexed?
                  Jacksons_Fencing
                  Jacksons_Fencing
                  0
                  5
                  44

                • Google Webmaster Tools is saying "Sitemap contains urls which are blocked by robots.txt" after Https move...
                  vetofunk
                  vetofunk
                  0
                  5
                  11.2k

                • Block Domain in robots.txt
                  donford
                  donford
                  0
                  6
                  2.2k

                • I accidentally blocked Google with Robots.txt. What next?
                  SebastianCowie
                  SebastianCowie
                  0
                  7
                  2.1k

                • Blocking robots.txt
                  de4e
                  de4e
                  0
                  4
                  432

                • Robots.txt blocking site or not?
                  RyanKent
                  RyanKent
                  0
                  2
                  445

                • Blocking other engines in robots.txt
                  RyanKent
                  RyanKent
                  0
                  2
                  581

                Get started with Moz Pro!

                Unlock the power of advanced SEO tools and data-driven insights.

                Start my free trial
                Products
                • Moz Pro
                • Moz Local
                • Moz API
                • Moz Data
                • STAT
                • Product Updates
                Moz Solutions
                • SMB Solutions
                • Agency Solutions
                • Enterprise Solutions
                • Digital Marketers
                Free SEO Tools
                • Domain Authority Checker
                • Link Explorer
                • Keyword Explorer
                • Competitive Research
                • Brand Authority Checker
                • Local Citation Checker
                • MozBar Extension
                • MozCast
                Resources
                • Blog
                • SEO Learning Center
                • Help Hub
                • Beginner's Guide to SEO
                • How-to Guides
                • Moz Academy
                • API Docs
                About Moz
                • About
                • Team
                • Careers
                • Contact
                Why Moz
                • Case Studies
                • Testimonials
                Get Involved
                • Become an Affiliate
                • MozCon
                • Webinars
                • Practical Marketer Series
                • MozPod
                Connect with us

                Contact the Help team

                Join our newsletter
                Moz logo
                © 2021 - 2026 SEOMoz, Inc., a Ziff Davis company. All rights reserved. Moz is a registered trademark of SEOMoz, Inc.
                • Accessibility
                • Terms of Use
                • Privacy