The Moz Q&A Forum

    • Forum
    • Questions
    • My Q&A
    • Users
    • Ask the Community

    Welcome to the Q&A Forum

    Browse the forum for helpful insights and fresh discussions about all things SEO.

    1. SEO and Digital Marketing Q&A Forum
    2. Categories
    3. Technical SEO Issues
    4. Google Webmaster Tools is saying "Sitemap contains urls which are blocked by robots.txt" after Https move...

    Google Webmaster Tools is saying "Sitemap contains urls which are blocked by robots.txt" after Https move...

    Technical SEO Issues
    5 2 11.2k
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as question
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • vetofunk
      vetofunk last edited by

      Hi Everyone,

      I really don't see anything wrong with our robots.txt file after our https move that just happened, but Google says all URLs are blocked. The only change I know we need to make is changing the sitemap url to https. Anything you all see wrong with this robots.txt file?

      robots.txt

      This file is to prevent the crawling and indexing of certain parts

      of your site by web crawlers and spiders run by sites like Yahoo!

      and Google. By telling these "robots" where not to go on your site,

      you save bandwidth and server resources.

      This file will be ignored unless it is at the root of your host:

      Used:    http://example.com/robots.txt

      Ignored: http://example.com/site/robots.txt

      For more information about the robots.txt standard, see:

      http://www.robotstxt.org/wc/robots.html

      For syntax checking, see:

      http://www.sxw.org.uk/computing/robots/check.html

      Website Sitemap

      Sitemap: http://www.bestpricenutrition.com/sitemap.xml

      Crawlers Setup

      User-agent: *

      Allowable Index

      Allow: /*?p=
      Allow: /index.php/blog/
      Allow: /catalog/seo_sitemap/category/

      Directories

      Disallow: /404/
      Disallow: /app/
      Disallow: /cgi-bin/
      Disallow: /downloader/
      Disallow: /includes/
      Disallow: /lib/
      Disallow: /magento/
      Disallow: /pkginfo/
      Disallow: /report/
      Disallow: /stats/
      Disallow: /var/

      Paths (clean URLs)

      Disallow: /index.php/
      Disallow: /catalog/product_compare/
      Disallow: /catalog/category/view/
      Disallow: /catalog/product/view/
      Disallow: /catalogsearch/
      Disallow: /checkout/
      Disallow: /control/
      Disallow: /contacts/
      Disallow: /customer/
      Disallow: /customize/
      Disallow: /newsletter/
      Disallow: /poll/
      Disallow: /review/
      Disallow: /sendfriend/
      Disallow: /tag/
      Disallow: /wishlist/
      Disallow: /aitmanufacturers/index/view/
      Disallow: /blog/tag/
      Disallow: /advancedreviews/abuse/reportajax/
      Disallow: /advancedreviews/ajaxproduct/
      Disallow: /advancedreviews/proscons/checkbyproscons/
      Disallow: /catalog/product/gallery/
      Disallow: /productquestions/index/ajaxform/

      Files

      Disallow: /cron.php
      Disallow: /cron.sh
      Disallow: /error_log
      Disallow: /install.php
      Disallow: /LICENSE.html
      Disallow: /LICENSE.txt
      Disallow: /LICENSE_AFL.txt
      Disallow: /STATUS.txt

      Paths (no clean URLs)

      Disallow: /.php$
      Disallow: /
      ?SID=
      disallow: /?cat=
      disallow: /
      ?price=
      disallow: /?flavor=
      disallow: /
      ?dir=
      disallow: /?mode=
      disallow: /
      ?list=
      disallow: /?limit=5
      disallow: /
      ?limit=10
      disallow: /?limit=15
      disallow: /
      ?limit=20
      disallow: /*?limit=25

      1 Reply Last reply Reply Quote 0
      • GastonRiera
        GastonRiera last edited by

        Hello Jeff,

        Just some routine questions to establish a base line:

        1. Have you checked that the sitemap doesnt include any of the disallowed URLs?
        2. You said that there was a movement to HTTPS, have you created a new account for the new domain?
        3. Im seing that the robots.txt has the old URL for the sitemap, without the HTTPS correction.

        Let me know.

        1 Reply Last reply Reply Quote 0
        • vetofunk
          vetofunk last edited by

          Thanks for the quick response.

          1. Yes...Google Webmaster Tools is giving examples...and they are basically all the product pages.

          2. Did the Add Site under Google Webmaster Tools yes...this is from that new 'account'.

          3. Yes...we are fixing that.

          You see anything in that robots.text above that would indicate we are blocking https product pages?

          GastonRiera 1 Reply Last reply Reply Quote 0
          • GastonRiera
            GastonRiera @vetofunk last edited by

            Jeff,

            I was only able to find only ONE URL in the sitemap that is blocked by the robots.txt that you've posted in this question.
            Check the image attached.
            The URL is: https://www.bestpricenutrition.com/catalog/product/view/id/15650.html

            What did I do? A manual search of all the disallowed terms in the sitemap.

            Also, you might want to take a comprehensive read at this article about robots.txt. It helped me to find that mistake.
            The complete guide to Robots.txt - Portent.com

            Best Luck.
            GR.

            22901473c0a7ba7fc6d7dbad6b3ab319

            1 Reply Last reply Reply Quote 1
            • vetofunk
              vetofunk last edited by

              Thanks again for the response. Looks like it just took a little more time for Google to resolve the issue. No more errors. Didn't do anything but resubmit Sitemap and Robots.txt.

              Thanks for the tips as well. I am going to post one more question in another thread.

              1 Reply Last reply Reply Quote 0
              • 1 / 1
              • First post
                Last post
              • Google Search console says 'sitemap is blocked by robots?
                BlueprintMarketing
                BlueprintMarketing
                1
                10
                1.2k

              • "Url blocked by robots.txt." on my Video Sitemap
                sergeystefoglo
                sergeystefoglo
                0
                3
                521

              • Can I Block https URLs using Host directive in robots.txt?
                LoganRay
                LoganRay
                0
                4
                760

              • How do I get my pages to go from "Submitted" to "Indexed" in Google Webmaster Tools?
                Nate_D
                Nate_D
                0
                7
                1.6k

              • "Links to your site" in google webmaster tools not showing any data
                TimKelsey
                TimKelsey
                0
                2
                3.3k

              • Google webmaster tool doestn allow me to send 'URL and all linked pages"
                RobertFisher
                RobertFisher
                0
                2
                219

              • Should we block URL param in Webmaster tools after URL migration?
                gmk1567
                gmk1567
                0
                2
                594

              Get started with Moz Pro!

              Unlock the power of advanced SEO tools and data-driven insights.

              Start my free trial
              Products
              • Moz Pro
              • Moz Local
              • Moz API
              • Moz Data
              • STAT
              • Product Updates
              Moz Solutions
              • SMB Solutions
              • Agency Solutions
              • Enterprise Solutions
              • Digital Marketers
              Free SEO Tools
              • Domain Authority Checker
              • Link Explorer
              • Keyword Explorer
              • Competitive Research
              • Brand Authority Checker
              • Local Citation Checker
              • MozBar Extension
              • MozCast
              Resources
              • Blog
              • SEO Learning Center
              • Help Hub
              • Beginner's Guide to SEO
              • How-to Guides
              • Moz Academy
              • API Docs
              About Moz
              • About
              • Team
              • Careers
              • Contact
              Why Moz
              • Case Studies
              • Testimonials
              Get Involved
              • Become an Affiliate
              • MozCon
              • Webinars
              • Practical Marketer Series
              • MozPod
              Connect with us

              Contact the Help team

              Join our newsletter
              Moz logo
              © 2021 - 2026 SEOMoz, Inc., a Ziff Davis company. All rights reserved. Moz is a registered trademark of SEOMoz, Inc.
              • Accessibility
              • Terms of Use
              • Privacy