The Moz Q&A Forum

    • Forum
    • Questions
    • My Q&A
    • Users
    • Ask the Community

    Welcome to the Q&A Forum

    Browse the forum for helpful insights and fresh discussions about all things SEO.

    1. SEO and Digital Marketing Q&A Forum
    2. Categories
    3. Technical SEO Issues
    4. Sitemap Contains Blocked Resources

    Sitemap Contains Blocked Resources

    Technical SEO Issues
    6 3 403
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as question
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • ATP
      ATP last edited by

      Hey Mozzers,

      I have several pages on my website that are for user search purposes only. They sort some products by range and answer some direct search queries users type into the site. They are basically just product collections that are else ware grouped in different ways.

      As such I didn't wants SERPS getting their hands on them so blocked them in robots so I could add then worry free. However, they automatically get pulled into the sitemap by Magento.

      This has made Webmaster tools give me a warning that 21 urls in the sitemaps are blocked by robots.

      Is this terrible SEO wise?

      Should I have opted to NOINDEX these URLS instead? I was concerned about thin content so really didnt want google crawling them.

      1 Reply Last reply Reply Quote 0
      • Andy.Drinkwater
        Andy.Drinkwater last edited by

        Hi,

        Is this terrible SEO wise?

        Not really - it just means that Google can see that there is a page they can't access so are informing you of this. There is no negative penalty that is going to come from this. If there were old pages that are now 404's then it would be a different story.

        I just want to be sure of something - were the pages previously open to Google? Are they currently indexed?

        -Andy

        1 Reply Last reply Reply Quote 1
        • ATP
          ATP last edited by

          Hi Andy,

          I just checked and yes they were previously index'd and some of them still are.

          Andy.Drinkwater 1 Reply Last reply Reply Quote 0
          • CleverPhD
            CleverPhD last edited by

            I would recommend that you try and get those pages out of your sitemap. If you look through the Google sitemap best practices, it states that the sitemap should be for pages that Googlebot can access.

            http://googlewebmastercentral.blogspot.com/2014/10/best-practices-for-xml-sitemaps-rssatom.html

            URLs

            URLs in XML sitemaps and RSS/Atom feeds should adhere to the following guidelines:

            • Only include URLs that can be fetched by Googlebot. **A common mistake is **including URLs disallowed by robots.txt — which cannot be fetched by Googlebot, or including URLs of pages that don't exist.
            1 Reply Last reply Reply Quote 2
            • Andy.Drinkwater
              Andy.Drinkwater @ATP last edited by

              OK so first because some are indexed, if you block access, they will never be removed.

              What you will need to do is add a noindex tag to the pages but don't block access to them so that Google can honour the noindex. Remove the pages via Search Console and once you have confirmed these are all removed from the index, you will be good to then block access via robots.txt.

              As CleverPhD said, ideally you don't want pages in the index that can't be crawled, but it isn't likely to cause a penalty of any sort (I have a client with about 70-80 blocked - long story - no issues in 12 months) if you are stuck because of Megento - Perhaps research to see how others have got around this?

              -Andy

              1 Reply Last reply Reply Quote 1
              • ATP
                ATP last edited by

                Thanks for the latest responses guys

                I have researched it into the grave and it the way Magento generates the sitemap makes it impossible for me to exclude these URLS.

                I will just unblock them from robots, and make them all noindex. This seems to solve all problems, i will then block them when im 100% sure they are unindexed.

                Thanks Again chaps.

                Big help as always.

                1 Reply Last reply Reply Quote 1
                • 1 / 1
                • First post
                  Last post
                • Google Search console says 'sitemap is blocked by robots?
                  BlueprintMarketing
                  BlueprintMarketing
                  1
                  10
                  1.2k

                • Sitemaps:
                  bridget.randolph
                  bridget.randolph
                  0
                  5
                  50

                • Sitemap
                  Martijn_Scheijbeler
                  Martijn_Scheijbeler
                  0
                  3
                  69

                • "Url blocked by robots.txt." on my Video Sitemap
                  sergeystefoglo
                  sergeystefoglo
                  0
                  3
                  521

                • Google webmaster… Zopim Live chat blocking the resources
                  RyanPurkey
                  RyanPurkey
                  0
                  2
                  1.4k

                • Why is robots.txt blocking URL's in sitemap?
                  PurpleGriffon
                  PurpleGriffon
                  0
                  3
                  346

                • How could i create sitemap with 1000 page and should i update sitemap frequently?
                  magician
                  magician
                  0
                  4
                  841

                • Summarize your question.Sitemap blocking or not blocking that is the question?
                  Nobody1560986989723
                  Nobody1560986989723
                  0
                  2
                  353

                Get started with Moz Pro!

                Unlock the power of advanced SEO tools and data-driven insights.

                Start my free trial
                Products
                • Moz Pro
                • Moz Local
                • Moz API
                • Moz Data
                • STAT
                • Product Updates
                Moz Solutions
                • SMB Solutions
                • Agency Solutions
                • Enterprise Solutions
                • Digital Marketers
                Free SEO Tools
                • Domain Authority Checker
                • Link Explorer
                • Keyword Explorer
                • Competitive Research
                • Brand Authority Checker
                • Local Citation Checker
                • MozBar Extension
                • MozCast
                Resources
                • Blog
                • SEO Learning Center
                • Help Hub
                • Beginner's Guide to SEO
                • How-to Guides
                • Moz Academy
                • API Docs
                About Moz
                • About
                • Team
                • Careers
                • Contact
                Why Moz
                • Case Studies
                • Testimonials
                Get Involved
                • Become an Affiliate
                • MozCon
                • Webinars
                • Practical Marketer Series
                • MozPod
                Connect with us

                Contact the Help team

                Join our newsletter
                Moz logo
                © 2021 - 2026 SEOMoz, Inc., a Ziff Davis company. All rights reserved. Moz is a registered trademark of SEOMoz, Inc.
                • Accessibility
                • Terms of Use
                • Privacy