The Moz Q&A Forum

    • Forum
    • Questions
    • My Q&A
    • Users
    • Ask the Community

    Welcome to the Q&A Forum

    Browse the forum for helpful insights and fresh discussions about all things SEO.

    1. SEO and Digital Marketing Q&A Forum
    2. Categories
    3. Technical SEO Issues
    4. Sitemap.xml - autogenerated by CMS is full of crud

    Sitemap.xml - autogenerated by CMS is full of crud

    Technical SEO Issues
    4 2 1.2k
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as question
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • k3nn3dy3
      k3nn3dy3 last edited by

      Hi all,

      hope you can help.

      the Magento ecommerce system I'm working with autogenerates sitemap.xml - it's well formed with priority and frequency parameters.

      However, it has generated lots of URLs that are pointing to broken pages returning fatal erros, duplicate URLs (not canonicals), 404s etc

      I'm thinking of hand creating sitemap.xml - the site has around 50 main pages including products and categories, and I can get the main page URLs listed by screaming frog or xenu.

      Then I'll have to get into the hand editing the crud pages with noindex, and useful duplicates with canonicals.

      Is this the way to go or is there another solution

      thanks in advance for any advice

      1 Reply Last reply Reply Quote 0
      • KaneJamison
        KaneJamison last edited by

        If it's returning 404 pages, that sounds like a dated sitemap. Have you activated the cron service?

        See the "Refreshing Sitemaps at Regular Intervals" section of this page if not:

        Magento can be set up to automatically refresh Google Sitemaps at regular intervals. This function is configured in Admin > System > Configuration > Google Sitemap.

        To use Magento’s automatic generation of Google Sitemaps, you must activate the Magento Cron service.

        If you do have that setup, and you're certain it's working correctly, then I would turn to the forums at MagentoCommerce.com - you're going to get a lot faster answer there since everyone is familiar with that exact platform.

        1 Reply Last reply Reply Quote 0
        • k3nn3dy3
          k3nn3dy3 last edited by

          Hi Kane,

          the sitemap is new - it's just that Magento create lots of duplicate files on the fly & it's not putting the canonical URLs in the sitemap etc.

          I just wondered whether its worth hand creating a sitemap.xml containing the content pages (60 or 70 of them) for this relatively small site, or not worry too much about the sitemap, the site is pretty well indexed by google already

          I'll head over to the Magento forums again to see if I can find more info

          many thanks for you help

          KaneJamison 1 Reply Last reply Reply Quote 0
          • KaneJamison
            KaneJamison @k3nn3dy3 last edited by

            If the cron is working then I would personally turn to the other forum to see if anyone knows a way to rope those messy URLs in and get them under control. I try to avoid manually generating and updating sitemaps whenever I can, because it's a hassle on a small site, not to mention the trouble on an ecommerce site.

            If your site is going to stay that small, then a manual sitemap might be less of a headache for you than customizing Magento.

            I would worry about keeping a clean sitemap. If the search engines learn that you keep a messy sitemap, they will rely on it less and less. 404 & 500 codes especially, but also redirects and perhaps duplicate content.

            For Further Reading:

            Google Sitemaps Ask For Clean URLs - http://www.johnfdoherty.com/google-sitemaps-ask-for-clean-urls/

            1 Reply Last reply Reply Quote 0
            • 1 / 1
            • First post
              Last post
            • If I'm using a compressed sitemap (sitemap.xml.gz) that's the URL that gets submitted to webmaster tools, correct?
              ThompsonPaul
              ThompsonPaul
              0
              6
              1.8k

            • Automate XML Sitemaps
              DmitriiK
              DmitriiK
              0
              2
              142

            • Generating a xml sitemap?
              JVRudnick
              JVRudnick
              1
              4
              128

            • XML Sitemap Generators
              GPainter
              GPainter
              0
              2
              62

            • XML Sitemap Creation
              Jeff_Lucas
              Jeff_Lucas
              0
              4
              145

            • XML Sitemap Issue or not?
              Tay1986
              Tay1986
              0
              6
              368

            • Do I need an XML sitemap?
              pugh
              pugh
              0
              6
              9.2k

            • How to generate a visual sitemap using sitemap.xml
              Churchill1
              Churchill1
              0
              3
              8.4k

            Get started with Moz Pro!

            Unlock the power of advanced SEO tools and data-driven insights.

            Start my free trial
            Products
            • Moz Pro
            • Moz Local
            • Moz API
            • Moz Data
            • STAT
            • Product Updates
            Moz Solutions
            • SMB Solutions
            • Agency Solutions
            • Enterprise Solutions
            • Digital Marketers
            Free SEO Tools
            • Domain Authority Checker
            • Link Explorer
            • Keyword Explorer
            • Competitive Research
            • Brand Authority Checker
            • Local Citation Checker
            • MozBar Extension
            • MozCast
            Resources
            • Blog
            • SEO Learning Center
            • Help Hub
            • Beginner's Guide to SEO
            • How-to Guides
            • Moz Academy
            • API Docs
            About Moz
            • About
            • Team
            • Careers
            • Contact
            Why Moz
            • Case Studies
            • Testimonials
            Get Involved
            • Become an Affiliate
            • MozCon
            • Webinars
            • Practical Marketer Series
            • MozPod
            Connect with us

            Contact the Help team

            Join our newsletter
            Moz logo
            © 2021 - 2026 SEOMoz, Inc., a Ziff Davis company. All rights reserved. Moz is a registered trademark of SEOMoz, Inc.
            • Accessibility
            • Terms of Use
            • Privacy