The Moz Q&A Forum

    • Forum
    • Questions
    • My Q&A
    • Users
    • Ask the Community

    Welcome to the Q&A Forum

    Browse the forum for helpful insights and fresh discussions about all things SEO.

    1. SEO and Digital Marketing Q&A Forum
    2. Categories
    3. Technical SEO Issues
    4. Robots.txt usage

    Robots.txt usage

    Technical SEO Issues
    6 4 527
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as question
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • holidayseo
      holidayseo last edited by

      Hey Guys,

      I am about make an important improvement to our site's robots.txt

      we have large number of properties on our site and we have different views for them. List, gallery and map view. By default list view shows up and user can navigate through gallery view.

      We donot want gallery pages to get indexed and want to save our crawl budget for more important pages.

      this is one example of our site:

      http://www.holiday-rentals.co.uk/France/r31.htm

      When you click on "gallery view" URL of this site will remain same in your address bar: but when you mouse over the "gallery view" tab it will show you URL with parameter "view=g". there are number of parameters: "view=g, view=l and view=m".

      http://www.holiday-rentals.co.uk/France/r31.htm?view=l

      http://www.holiday-rentals.co.uk/France/r31.htm?view=g

      http://www.holiday-rentals.co.uk/France/r31.htm?view=m

      Now my question is:

      I If restrict bots by adding "Disallow: ?view=" in our robots.txt will it effect the list view too?

      Will be very thankful if yo look into this for us.

      Many thanks

      Hassan

      I will test this on some other site within our network too before putting it to important one's. to measure the impact but will be waiting for your recommendations. Thanks

      1 Reply Last reply Reply Quote 0
      • SuperlativB
        SuperlativB last edited by

        Sounds like this is something canonical could solve for you. If you disallow ?view=* you would disallow all "?view" on your homepage, if you are unsure you should go for exact match rather that all.

        1 Reply Last reply Reply Quote 0
        • sesertin
          sesertin last edited by

          You can do the restriction you want but if i get it right m stands for map view g stands for gallery view and l stands for list view. So if you want list view to be indexed and map and gallery view not to be indexed you should add two lines of distriction:

          disallow:?view=m disallow:?view=g

          if these paratmeters are not at the very end os the url you should add * after the letter of the parameter as well in the restriction

          holidayseo 1 Reply Last reply Reply Quote 0
          • holidayseo
            holidayseo @sesertin last edited by

            For  these paratmeters are not at the very end os the url you should add * after the letter of the parameter as well in the restriction

            you got my point, thanks for looking into this. Since our search page load with list view by default and it is not in URL but still v=l represents the list view.

            I want to disallow both parameters "view=g, view=m" in any URL from bots.

            If these parameters are sometimes in between and some time at the end of URL what will be the work around for for both cases, you suggest?

            Thanks for looking into this...

            sesertin 1 Reply Last reply Reply Quote 0
            • QPLF
              QPLF last edited by

              I had a similar issue with my website: there were many ways of sorting a likst of items (date, title, etc) which ended up causing duplicate content, we solved the issue a couple of days ago by restricting the "sorted" pages using the robots.txt file. HOWEVER, this morning i found this text in the Google Webmaster Tools support section:

              Google no longer recommends blocking crawler access to duplicate content on your website, whether with a robots.txt file or other methods. If search engines can't crawl pages with duplicate content, they can't automatically detect that these URLs point to the same content and will therefore effectively have to treat them as separate, unique pages. A better solution is to allow search engines to crawl these URLs, but mark them as duplicates by using the rel="canonical" link element, the URL parameter handling tool, or 301 redirects. In cases where duplicate content leads to us crawling too much of your website, you can also adjust the crawl rate setting in Webmaster Tools.

              source:
              http://www.google.com/support/webmasters/bin/answer.py?answer=66359

              I havent seen any negative effect on my site (yet), but I would agree with SuperlativB in the sense that YOU might be better off using "canonical" tags on these links

              http://www.holiday-rentals.co.uk/...?view=l

              http://www.holiday-rentals.co.uk/...?view=g

              http://www.holiday-rentals.co.uk/...?view=m

              1 Reply Last reply Reply Quote 0
              • sesertin
                sesertin @holidayseo last edited by

                Others are right by the way canonical may be better, but if you insist on robots restriction you should add two schemas to each parameter:

                disallow:?view=m disallow:?view=m*

                so that you block the urls that contain the parameter at the end and block the ones that have it in the middle as well.

                1 Reply Last reply Reply Quote 0
                • 1 / 1
                • First post
                  Last post
                • Robots.txt
                  MichaelC-15022
                  MichaelC-15022
                  0
                  7
                  1.0k

                • Robots.txt
                  Dan-Lawrence
                  Dan-Lawrence
                  0
                  5
                  99

                • Meta Robots Noindex and Robots.txt File
                  Devanur-Rafi
                  Devanur-Rafi
                  0
                  2
                  125

                • Robots.txt anomaly
                  Dan-Lawrence
                  Dan-Lawrence
                  0
                  10
                  127

                • Do I need robots.txt and meta robots?
                  Cyrus-Shepard
                  Cyrus-Shepard
                  0
                  7
                  1.1k

                • Robots.txt
                  Entrusteddev
                  Entrusteddev
                  0
                  3
                  642

                • What is the sense of robots.txt?
                  RyanKent
                  RyanKent
                  0
                  3
                  702

                • Robots.txt and robots meta
                  TheEspresseo
                  TheEspresseo
                  0
                  5
                  1.1k

                Get started with Moz Pro!

                Unlock the power of advanced SEO tools and data-driven insights.

                Start my free trial
                Products
                • Moz Pro
                • Moz Local
                • Moz API
                • Moz Data
                • STAT
                • Product Updates
                Moz Solutions
                • SMB Solutions
                • Agency Solutions
                • Enterprise Solutions
                • Digital Marketers
                Free SEO Tools
                • Domain Authority Checker
                • Link Explorer
                • Keyword Explorer
                • Competitive Research
                • Brand Authority Checker
                • Local Citation Checker
                • MozBar Extension
                • MozCast
                Resources
                • Blog
                • SEO Learning Center
                • Help Hub
                • Beginner's Guide to SEO
                • How-to Guides
                • Moz Academy
                • API Docs
                About Moz
                • About
                • Team
                • Careers
                • Contact
                Why Moz
                • Case Studies
                • Testimonials
                Get Involved
                • Become an Affiliate
                • MozCon
                • Webinars
                • Practical Marketer Series
                • MozPod
                Connect with us

                Contact the Help team

                Join our newsletter
                Moz logo
                © 2021 - 2026 SEOMoz, Inc., a Ziff Davis company. All rights reserved. Moz is a registered trademark of SEOMoz, Inc.
                • Accessibility
                • Terms of Use
                • Privacy