The Moz Q&A Forum

    • Forum
    • Questions
    • My Q&A
    • Users
    • Ask the Community

    Welcome to the Q&A Forum

    Browse the forum for helpful insights and fresh discussions about all things SEO.

    1. SEO and Digital Marketing Q&A Forum
    2. Categories
    3. Technical SEO Issues
    4. How to create site map for large site (ecommerce type) that has 1000's if not 100,000 of pages.

    How to create site map for large site (ecommerce type) that has 1000's if not 100,000 of pages.

    Technical SEO Issues
    10 5 9.5k
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as question
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • BestRide
      BestRide last edited by

      I know this is kind of a newbie question but I am having an amazing amount of trouble creating a sitemap for our site Bestride.com.  We just did a complete redesign (look and feel, functionality, the works) and now I am trying to create a site map.  Most of the generators I have used "break" after reaching some number of pages.  I am at a loss as to how to create the sitemap.  Any help would be greatly appreciated!

      Thanks

      1 Reply Last reply Reply Quote 0
      • Chris.Menke
        Chris.Menke last edited by

        You can use screamingfrog to create your sitemap.  You just need to license it for crawl more than 500 URI.

        Chris.Menke 1 Reply Last reply Reply Quote 0
        • Red_educativa
          Red_educativa last edited by

          Hi Kristin,

          Each sitemap.xml can support maximum 50.000 URLs. So, If you have a site with more than 100K, It'd be better to create 2 or 3 o 4 etc sitemaps.xml in order to contain all URLs. Hope it is useful.

          Kind regards!

          Francesca

          1 Reply Last reply Reply Quote 1
          • Chris.Menke
            Chris.Menke @Chris.Menke last edited by

            Of course, you can also use the moz's crawl test report at http://pro.moz.com/tools/crawl-test

            1 Reply Last reply Reply Quote 0
            • LesleyPaone
              LesleyPaone last edited by

              Are you using a custom platform or an off the shelf e-commerce package? Most off the shelf packages actually have a module that can create a site map and a lot have it where you can cron it too.

              1 Reply Last reply Reply Quote 0
              • BestRide
                BestRide last edited by

                Thanks for the feedback!

                I will look into screamingfrog for sure.

                @Lesley - we are using a custom platform (in house) so we don't have that functionality.  The issue is that we have a lot of inventory (millions) of cars.  We have built (and are releasing new functionality today) to provide internal links so that Google can crawl all the inventory easily (users can too :).  My question about sitemaps has boiled down to this: Do we need to build the sitemap to include every single page (all the inventory) or do we provide a "map" so that google can find the top pages and then crawl the inventory from there.  Again the site is bestride.com.  If anyone wants to take a look at the site, that would be fantastic!

                Thanks

                LesleyPaone 1 Reply Last reply Reply Quote 0
                • Chris.Menke
                  Chris.Menke last edited by

                  Typically, a sitemap is going to include every page on the site. As Francesca said, each sitemap can be up to 50K urls and if you need multiple sitemaps then you create a sitemap index that points to the rest of the sitemaps.

                  https://support.google.com/webmasters/answer/183668?hl=en

                  1 Reply Last reply Reply Quote 2
                  • BestRide
                    BestRide last edited by

                    That's a great help Chris, thank you!  And thanks to all for your help!

                    1 Reply Last reply Reply Quote 0
                    • LesleyPaone
                      LesleyPaone @BestRide last edited by

                      The easiest thing i can think of is to write a script that works with your dispatcher to create a site map. The format I would use is add the page and all of the "product images" on the page to the map and move to the next. At the same time I would use an auto increment variable to keep track of how many lines you have written. When you get around 50k, write out the name of the next site map file that the program will create and have them chained together this way.

                      1 Reply Last reply Reply Quote 1
                      • Robin_Jennings
                        Robin_Jennings last edited by

                        I agree with Chris. With such large websites it would be advisable having a sitemap index and then splitting the index into various individual indexes such as Pages, Products, Categories, images, media, tags etc.

                        1 Reply Last reply Reply Quote 0
                        • 1 / 1
                        • First post
                          Last post
                        • Why are only PDFs on my client's site being indexed, and not actual pages?
                          mfrgolfgti
                          mfrgolfgti
                          0
                          5
                          133

                        • Google how deal with licensed content when this placed on vendor & client's website too. Will Google penalize the client's site for this ?
                          katemorris
                          katemorris
                          1
                          4
                          94

                        • Anything new if determining how many of a sites pages are in Google's supplemental index vs the main index?
                          SEMPassion
                          SEMPassion
                          0
                          4
                          390

                        • My beta site (beta.website.com) has been inadvertently indexed. Its cached pages are taking traffic away from our real website (website.com). Should I just "NO INDEX" the entire beta site and if so, what's the best way to do this? Please advise.
                          Vuly
                          Vuly
                          0
                          5
                          1.6k

                        • Ecommerce website: Product page setup & SKU's
                          Keszi
                          Keszi
                          0
                          9
                          1.3k

                        • Product landing page URL's for e-commerce sites - best practices?
                          BenRWoodard
                          BenRWoodard
                          0
                          2
                          2.3k

                        • Can I format my H1 to be smaller than H2's and H3's on the same page?
                          theLotter
                          theLotter
                          0
                          5
                          2.8k

                        • Finding the site's most relevant page
                          RyanKent
                          RyanKent
                          0
                          6
                          650

                        Get started with Moz Pro!

                        Unlock the power of advanced SEO tools and data-driven insights.

                        Start my free trial
                        Products
                        • Moz Pro
                        • Moz Local
                        • Moz API
                        • Moz Data
                        • STAT
                        • Product Updates
                        Moz Solutions
                        • SMB Solutions
                        • Agency Solutions
                        • Enterprise Solutions
                        • Digital Marketers
                        Free SEO Tools
                        • Domain Authority Checker
                        • Link Explorer
                        • Keyword Explorer
                        • Competitive Research
                        • Brand Authority Checker
                        • Local Citation Checker
                        • MozBar Extension
                        • MozCast
                        Resources
                        • Blog
                        • SEO Learning Center
                        • Help Hub
                        • Beginner's Guide to SEO
                        • How-to Guides
                        • Moz Academy
                        • API Docs
                        About Moz
                        • About
                        • Team
                        • Careers
                        • Contact
                        Why Moz
                        • Case Studies
                        • Testimonials
                        Get Involved
                        • Become an Affiliate
                        • MozCon
                        • Webinars
                        • Practical Marketer Series
                        • MozPod
                        Connect with us

                        Contact the Help team

                        Join our newsletter
                        Moz logo
                        © 2021 - 2026 SEOMoz, Inc., a Ziff Davis company. All rights reserved. Moz is a registered trademark of SEOMoz, Inc.
                        • Accessibility
                        • Terms of Use
                        • Privacy