The Moz Q&A Forum

    • Forum
    • Questions
    • My Q&A
    • Users
    • Ask the Community

    Welcome to the Q&A Forum

    Browse the forum for helpful insights and fresh discussions about all things SEO.

    1. SEO and Digital Marketing Q&A Forum
    2. Categories
    3. Intermediate & Advanced SEO
    4. Blocking Pages Via Robots, Can Images On Those Pages Be Included In Image Search

    Blocking Pages Via Robots, Can Images On Those Pages Be Included In Image Search

    Intermediate & Advanced SEO
    6 3 682
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as question
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • HD_Leona
      HD_Leona last edited by

      Hi!

      I have pages within my forum where visitors can upload photos.  When they upload photos they provide a simple statement about the photo but no real information about the image,definitely not enough for the page to be deemed worthy of being indexed.  The industry however is one that really leans on images and having the images in Google Image search is important to us.

      The url structure is like such:  domain.com/community/photos/~username~/picture111111.aspx

      I wish to block the whole folder from Googlebot to prevent these low quality pages from being added to Google's main SERP results.  This would be something like this:

      User-agent: googlebot

      Disallow: /community/photos/

      Can  I disallow Googlebot specifically rather than just using User-agent:  * which would then allow googlebot-image to pick up the photos?  I plan on configuring a way to add meaningful alt attributes and image names to assist in visibility, but the actual act of blocking the pages and getting the images picked up... Is this possible?

      Thanks!

      Leona

      1 Reply Last reply Reply Quote 0
      • Matt-Williamson
        Matt-Williamson last edited by

        Hi Leona

        Googlebot-image and any of the other bots that Google uses follow the rules set out for Googlebot so blocking Googlebot would block your images as it overrides Googlebot-image. I don't think that there is a way around this using the disallow directive as you are blocking the directory which contains your images so they won't be indexed using specific images.

        Something you may want to consider is the Allow directive -

        Disallow: /community/photos/

        Allow: /community/photos/~username~/

        that is if Google is already indexing images under the username path?

        The allow directive will only be successful if it contains more or equal number of characters as the disallow path, so bare in mind that if you had the following;

        Disallow: /community/photos/

        Allow: /community/photos/

        the allow will win out and nothing will be blocked. please note that i haven't actioned the allow directive myself but looked into it in depth when i studied the robots.txt for my own sites it would be good if someone else had an experience of this directive. Hope this helps.

        1 Reply Last reply Reply Quote 1
        • HD_Leona
          HD_Leona last edited by

          Hi Matt,

          Thanks for your feedback!

          It is not my belief that Googlebot overwrides googlebot-images otherwise specifying something for a specific bot of Google's wouldn't work, correct?

          I setup the following:

          User-agent: googlebot

          Disallow: /community/photos/

          User-agent: googlebot-Image

          Allow: /community/photos/

          I tested the results in Google Webmaster Tools which indicated:

          Googlebot:  Blocked by line 26: Disallow: /community/photos/Detected as a directory; specific files may have different restrictions

          Googlebot-Image:  Allowed by line 29: Allow: /community/photos/Detected as a directory; specific files may have different restrictions

          Thanks for your help!

          Leona

          Matt-Williamson Dr-Pete 2 Replies Last reply Reply Quote 0
          • Matt-Williamson
            Matt-Williamson @HD_Leona last edited by

            Hi Leona - what you have done is something along the lines of what I thought would work for you - sorry if I wasn't clear in my original response - I thought you meant if you created a robots.txt and specified Googlebot to be disallowed then Googlebot-image would pick up the photos still and as I said this wouldn't be the case as it Googlebot-image will follow what it set out for Googlebot unless you specify otherwise using the allow directive as I mentioned. Glad it has worked for you - keep us posted on your results.

            HD_Leona 1 Reply Last reply Reply Quote 1
            • HD_Leona
              HD_Leona @Matt-Williamson last edited by

              Thanks Matt for your time and assistance! Leona

              1 Reply Last reply Reply Quote 0
              • Dr-Pete
                Dr-Pete @HD_Leona last edited by

                Are you seeing the images getting indexed, though? Even if GWT recognize the Robots.txt directives, blocking the pages may essentially keep the images from having any ranking value. Like Matt, I'm not sure this will work in practice.

                Another option would be to create an alternate path to just the images, like an HTML sitemap with just links to those images and decent anchor text. The ranking power still wouldn't be great (you'd have a lot of links on this page, most likely), but it would at least kick the crawlers a bit.

                1 Reply Last reply Reply Quote 1
                • 1 / 1
                • First post
                  Last post
                • Robots blocked by pages webmasters tools
                  mihoreis
                  mihoreis
                  0
                  7
                  76

                • Can you disallow links via Search Console?
                  Roman-Delcarmen
                  Roman-Delcarmen
                  0
                  4
                  61

                • Blocking Dynamic Search Result Pages From Google
                  badgergravling
                  badgergravling
                  0
                  2
                  52

                • What can you do when Google can't decide which of two pages is the better search result
                  David-Kley
                  David-Kley
                  0
                  3
                  85

                • Should all pages on a site be included in either your sitemap or robots.txt?
                  RossFruin
                  RossFruin
                  1
                  8
                  162

                • Does using robots.txt to block pages decrease search traffic?
                  KeriMorgret
                  KeriMorgret
                  0
                  4
                  520

                • Search Engine Blocked by robots.txt for Dynamic URLs
                  KeriMorgret
                  KeriMorgret
                  0
                  2
                  689

                • Block all search results (dynamic) in robots.txt?
                  onwebtoday
                  onwebtoday
                  0
                  9
                  4.8k

                Get started with Moz Pro!

                Unlock the power of advanced SEO tools and data-driven insights.

                Start my free trial
                Products
                • Moz Pro
                • Moz Local
                • Moz API
                • Moz Data
                • STAT
                • Product Updates
                Moz Solutions
                • SMB Solutions
                • Agency Solutions
                • Enterprise Solutions
                • Digital Marketers
                Free SEO Tools
                • Domain Authority Checker
                • Link Explorer
                • Keyword Explorer
                • Competitive Research
                • Brand Authority Checker
                • Local Citation Checker
                • MozBar Extension
                • MozCast
                Resources
                • Blog
                • SEO Learning Center
                • Help Hub
                • Beginner's Guide to SEO
                • How-to Guides
                • Moz Academy
                • API Docs
                About Moz
                • About
                • Team
                • Careers
                • Contact
                Why Moz
                • Case Studies
                • Testimonials
                Get Involved
                • Become an Affiliate
                • MozCon
                • Webinars
                • Practical Marketer Series
                • MozPod
                Connect with us

                Contact the Help team

                Join our newsletter
                Moz logo
                © 2021 - 2026 SEOMoz, Inc., a Ziff Davis company. All rights reserved. Moz is a registered trademark of SEOMoz, Inc.
                • Accessibility
                • Terms of Use
                • Privacy