The Moz Q&A Forum

    • Forum
    • Questions
    • My Q&A
    • Users
    • Ask the Community

    Welcome to the Q&A Forum

    Browse the forum for helpful insights and fresh discussions about all things SEO.

    1. SEO and Digital Marketing Q&A Forum
    2. Categories
    3. Technical SEO Issues
    4. Sitemap generator partially finding list of website URLs

    Sitemap generator partially finding list of website URLs

    Technical SEO Issues
    6 3 65
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as question
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • Taysir
      Taysir last edited by

      Hi everyone,

      When creating my XML sitemap here it is only able to detect a portion of the website. I am missing at least 20 URLs (blog pages + newly created resource pages). I have checked those missing URLs and all of them are index and they're not blocked by the robots.txt.

      Any idea why this is happening? I need to make sure all wanted URLs to be generated in an XML sitemap.

      Thanks!

      1 Reply Last reply Reply Quote 0
      • GastonRiera
        GastonRiera last edited by

        Hi taysir!

        Have you tried any other crawler to check whether those pages can be finded?
        I'd strongly suggest you Screaming Frog spider, the free version allows you up to 500 URLs. Also, it has a feature to create sitemaps from the crawled URLs. Even though dont know if that available in the free version.
        Here some info about that feature: XML sitemap genetator - Screaming Frog

        Usual issues in not being findable are:

        • Poor internal linking
        • Not having a sitemap (this is why you find out)
        • Blocked resources in robots.txt
        • Blocked pages with robots meta tag

        That being said, its completely normal that Google has indexed pages that you cant find in a AdHoc crawl, that is because GoogleBot could have found those pages from external linking.
        Also keep in mind that having pages blocked with Robots.txt or robots meta tag will not prevent that page from being indexed nor will make them deindex if you add some rules to block them.

        Hope it helps.
        Best luck
        GR

        Taysir 1 Reply Last reply Reply Quote 1
        • TucsonAZWebDesign
          TucsonAZWebDesign last edited by

          Google not only provides a basic template you could do the sitemap manually if you wished, and this link has Google listing several dozen open source sitemap generators.

          If Google Webmaster's can't read the one you generated fully, then clearly an alternate generator should definitely fix that for you. Good luck!

          1 Reply Last reply Reply Quote 0
          • Taysir
            Taysir @GastonRiera last edited by

            Thanks for your response Gaston. These pages are definitely not blocked by the robots.txt file. I think that it is an internal linking problem. I actually subscribed to pro-sitemap.com and was wondering if I should use this section and add remaining sitemap URLs that are missing: https://cl.ly/0k0t093f0Y1T

            Do you think this would do the trick?

            GastonRiera 1 Reply Last reply Reply Quote 0
            • GastonRiera
              GastonRiera @Taysir last edited by

              Hi Taysir,

              I´ve never used that service. I suspect that the section you refer to should do the trick.
              I believe that you do know how many URLs there are in the whole site, so you can compare how much pro-sitemaps.com finds to your numbers.

              Best luck!
              GR

              Taysir 1 Reply Last reply Reply Quote 0
              • Taysir
                Taysir @GastonRiera last edited by

                Gaston,

                Interestingly enough by default the generator only located only half of the URLs. I hope that one of those 2 fields will do the trick.

                1 Reply Last reply Reply Quote 0
                • 1 / 1
                • First post
                  Last post
                • Can you help by advising how to stop a URL from referring to another URL on my website please?
                  Levanniko253
                  Levanniko253
                  0
                  2
                  31

                • What is the best tool for getting a SiteMap url for a website with over 4k pages?
                  effectdigital
                  effectdigital
                  2
                  4
                  58

                • If I'm using a compressed sitemap (sitemap.xml.gz) that's the URL that gets submitted to webmaster tools, correct?
                  ThompsonPaul
                  ThompsonPaul
                  0
                  6
                  1.8k

                • Canonical sitemap URL different to website URL architecture
                  StephanSolomonidis
                  StephanSolomonidis
                  0
                  3
                  105

                • How to change URL for this website
                  Kizinko
                  Kizinko
                  0
                  4
                  93

                • Does anyone know a sitemap generation tool that updates your sitemap based on changes on your website?
                  Lifequotes
                  Lifequotes
                  0
                  4
                  383

                • How can I best find out which URLs from large sitemaps aren't indexed?
                  Audiohype
                  Audiohype
                  0
                  4
                  271

                • How to find original URLS after Hosting Company added canonical URLs, URL rewrites and duplicate content.
                  Nobody1560986989723
                  Nobody1560986989723
                  0
                  2
                  366

                Get started with Moz Pro!

                Unlock the power of advanced SEO tools and data-driven insights.

                Start my free trial
                Products
                • Moz Pro
                • Moz Local
                • Moz API
                • Moz Data
                • STAT
                • Product Updates
                Moz Solutions
                • SMB Solutions
                • Agency Solutions
                • Enterprise Solutions
                • Digital Marketers
                Free SEO Tools
                • Domain Authority Checker
                • Link Explorer
                • Keyword Explorer
                • Competitive Research
                • Brand Authority Checker
                • Local Citation Checker
                • MozBar Extension
                • MozCast
                Resources
                • Blog
                • SEO Learning Center
                • Help Hub
                • Beginner's Guide to SEO
                • How-to Guides
                • Moz Academy
                • API Docs
                About Moz
                • About
                • Team
                • Careers
                • Contact
                Why Moz
                • Case Studies
                • Testimonials
                Get Involved
                • Become an Affiliate
                • MozCon
                • Webinars
                • Practical Marketer Series
                • MozPod
                Connect with us

                Contact the Help team

                Join our newsletter
                Moz logo
                © 2021 - 2026 SEOMoz, Inc., a Ziff Davis company. All rights reserved. Moz is a registered trademark of SEOMoz, Inc.
                • Accessibility
                • Terms of Use
                • Privacy