The Moz Q&A Forum

    • Forum
    • Questions
    • My Q&A
    • Users
    • Ask the Community

    Welcome to the Q&A Forum

    Browse the forum for helpful insights and fresh discussions about all things SEO.

    1. SEO and Digital Marketing Q&A Forum
    2. Categories
    3. Intermediate & Advanced SEO
    4. I have two sitemaps which partly duplicate - one is blocked by robots.txt but can't figure out why!

    I have two sitemaps which partly duplicate - one is blocked by robots.txt but can't figure out why!

    Intermediate & Advanced SEO
    5 2 114
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as question
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • McTaggart
      McTaggart last edited by

      Hi, I've just found two sitemaps - one of them is .php and represents part of the site structure on the website. The second is a .txt file which lists every page on the website. The .txt file is blocked via robots exclusion protocol (which doesn't appear to be very logical as it's the only full sitemap). Any ideas why a developer might have done that?

      1 Reply Last reply Reply Quote 0
      • Chris.Menke
        Chris.Menke last edited by

        Luke,

        The .php one would have been created as a navigation tool to help users find what they're looking for faster, as well as to  provide html links to search engine spiders to help them reach all pages on the site.  On small sites, such sitemaps often include all pages of the site, on large ones, it might just be high level pages.  The .txt file is non html and exists to provide search engines with a full list of urls on the site for the sole purpose of helping search engines index all the site's pages.

        The robots.txt file can also be used to specify the location of the sitemap.txt file such as

        sitemap: http://www.example.com/sitemap_location.txt

        Are you sure the sitemap is being blocked by the robots.txt file or is the robots.txt file just listing the location of the sitemap.txt?

        McTaggart 2 Replies Last reply Reply Quote 0
        • McTaggart
          McTaggart @Chris.Menke last edited by

          Thanks for the useful feedback Chris - much appreciated - Is it good practice to use both - I guess it's a good idea if onsite version only includes top-level pages? PS. Just checking nature of block!

          1 Reply Last reply Reply Quote 0
          • McTaggart
            McTaggart @Chris.Menke last edited by

            yes, sitemap.txt is blocked for some strange reason. I know SEOs do this sometimes for various reasons, but in this case it just doesn't make sense - not to me, anyway.

            Chris.Menke 1 Reply Last reply Reply Quote 0
            • Chris.Menke
              Chris.Menke @McTaggart last edited by

              There are standards for the sitemaps .txt and .xml sitemaps, where there are no standards for html varieties.  Neither guarantees the listed pages will be crawled, though.  HTML has some advantage of potentially passing pagerank, where .txt and .xml varieties don't.

              These days, xml sitemaps may be more common  than .txt sitemaps but both perform the same function.

              1 Reply Last reply Reply Quote 1
              • 1 / 1
              • First post
                Last post
              • Can't support IE 7,8,9, 10\. Can we redirect them to another page that's optimized for those browsers so that we can have our site work on modern browers while still providing a destination of IE browsers?
                0
                1
                18

              • My crawl can't find ANY product pages. The links to product pages aren't links, they're script. :(
                Joe.Robison
                Joe.Robison
                0
                8
                247

              • Weird rankings on my website, can't figure it out
                evolvingSEO
                evolvingSEO
                0
                4
                117

              • What can you do when Google can't decide which of two pages is the better search result
                David-Kley
                David-Kley
                0
                3
                85

              • Can't find X-Robots tag!
                Martijn_Scheijbeler
                Martijn_Scheijbeler
                0
                3
                417

              • Can URLs blocked with robots.txt hurt your site?
                workzentre
                workzentre
                0
                4
                302

              • Googlebot Can't Access My Sites After I Repair My Robots File
                Igal_Zeifman
                Igal_Zeifman
                1
                4
                2.6k

              Get started with Moz Pro!

              Unlock the power of advanced SEO tools and data-driven insights.

              Start my free trial
              Products
              • Moz Pro
              • Moz Local
              • Moz API
              • Moz Data
              • STAT
              • Product Updates
              Moz Solutions
              • SMB Solutions
              • Agency Solutions
              • Enterprise Solutions
              • Digital Marketers
              Free SEO Tools
              • Domain Authority Checker
              • Link Explorer
              • Keyword Explorer
              • Competitive Research
              • Brand Authority Checker
              • Local Citation Checker
              • MozBar Extension
              • MozCast
              Resources
              • Blog
              • SEO Learning Center
              • Help Hub
              • Beginner's Guide to SEO
              • How-to Guides
              • Moz Academy
              • API Docs
              About Moz
              • About
              • Team
              • Careers
              • Contact
              Why Moz
              • Case Studies
              • Testimonials
              Get Involved
              • Become an Affiliate
              • MozCon
              • Webinars
              • Practical Marketer Series
              • MozPod
              Connect with us

              Contact the Help team

              Join our newsletter
              Moz logo
              © 2021 - 2026 SEOMoz, Inc., a Ziff Davis company. All rights reserved. Moz is a registered trademark of SEOMoz, Inc.
              • Accessibility
              • Terms of Use
              • Privacy