The Moz Q&A Forum

    • Forum
    • Questions
    • My Q&A
    • Users
    • Ask the Community

    Welcome to the Q&A Forum

    Browse the forum for helpful insights and fresh discussions about all things SEO.

    1. SEO and Digital Marketing Q&A Forum
    2. Categories
    3. Intermediate & Advanced SEO
    4. When I try creating a sitemap, it doesnt crawl my entire site.

    When I try creating a sitemap, it doesnt crawl my entire site.

    Intermediate & Advanced SEO
    3 2 348
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as question
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • TheSquareFoot
      TheSquareFoot last edited by

      We just launched a new Ruby app at (used to be a wordpress blog) -

      http://www.thesquarefoot.com

      We have not had time to create an auto-generated sitemap, so I went to a few different websites with free sitemap generation tools. Most of them index up to 100 or 500 URLS. Our site has over 1,000 individual listings and 3 landing pages, so when I put our URL into a sitemap creator, it should be finding all of these pages. However, that is not happening, only 4 pages seem to be seen by the crawlers.

      TheSquareFoothttp://www.thesquarefoot.com/http://www.thesquarefoot.com/users/sign_inhttp://www.thesquarefoot.com/searchhttp://www.thesquarefoot.com/renters/sign_up**This worries me that when Google comes to crawl our site, these are the only pages it will see as well.  Our robots.txt is blank, so there should be nothing stopping the crawlers from going through the entire site. Here is an example of one of the 1,000s of pages not being crawled****http://www.thesquarefoot.com/listings/Houston/TX/77098/Central_Houston/3910_Kirby_Dr/Suite_204**Any help would be much appreciated!

      1 Reply Last reply Reply Quote 0
      • matbennett
        matbennett last edited by

        I'd worry less about the sitemaps and more about internal linking structure.  The problem you are having with crawlers is as symptom of the linking problem.

        Most of your content seems to be on the other side of a search form.  When crawlers, including those from search engines, explore you site they are looking for href links to follow - they will not submit forms.

        If then you want the other content to be indexed then you need to provide a crawl path to it.  Could you add links to each neighbourhood on page somewhere so that there is path to follow?  That might lead on to further questions about your url structure and use of ajax too.

        The general principal is that you should link to content you want to rank. Many will argue that a sitemap removes that necessity, but links provide more information that a list of URLs and I certainly wouldn't rely on sitemaps alone to get content indexed let alone ranked.

        1 Reply Last reply Reply Quote 1
        • TheSquareFoot
          TheSquareFoot last edited by

          Thanks for you help, can I ask one more question -

          We just submitted a new sitemap to google for our new rails app -

          http://www.thesquarefoot.com/sitemap.xml

          Which has over 1,300 pages, however Google is only seeing 114. About 1,025 are in the listings folder / 250 blog posts / and 15 landing pages.

          Any help would be appreciated!

          Aron

          sitemap.png sitemap.png

          1 Reply Last reply Reply Quote 0
          • 1 / 1
          • First post
            Last post
          • What's the best way of crawling my entire site to get a list of NoFollow links?
            NamaSEO1
            NamaSEO1
            0
            6
            136

          • Google Mobile site crawl returns poorer results on 100% responsive site
            WebQuest
            WebQuest
            0
            4
            37

          • Redirecting an Entire Site to a Page on Another Site?
            photoseo1
            photoseo1
            0
            3
            50

          • Site not being crawled properly
            MattRoney
            MattRoney
            0
            6
            231

          • Duplicate site (disaster recovery) being crawled and creating two indexed search results
            Dr-Pete
            Dr-Pete
            0
            4
            195

          • Development site crawled
            ollan
            ollan
            0
            5
            175

          • I have a general site for my insurance agency. Should I create niche sites too?
            Talooma
            Talooma
            0
            5
            345

          • Is it safe to not have a sitemap if Google is already crawling my site every 5-10 min?
            MattAntonino
            MattAntonino
            0
            5
            471

          Get started with Moz Pro!

          Unlock the power of advanced SEO tools and data-driven insights.

          Start my free trial
          Products
          • Moz Pro
          • Moz Local
          • Moz API
          • Moz Data
          • STAT
          • Product Updates
          Moz Solutions
          • SMB Solutions
          • Agency Solutions
          • Enterprise Solutions
          • Digital Marketers
          Free SEO Tools
          • Domain Authority Checker
          • Link Explorer
          • Keyword Explorer
          • Competitive Research
          • Brand Authority Checker
          • Local Citation Checker
          • MozBar Extension
          • MozCast
          Resources
          • Blog
          • SEO Learning Center
          • Help Hub
          • Beginner's Guide to SEO
          • How-to Guides
          • Moz Academy
          • API Docs
          About Moz
          • About
          • Team
          • Careers
          • Contact
          Why Moz
          • Case Studies
          • Testimonials
          Get Involved
          • Become an Affiliate
          • MozCon
          • Webinars
          • Practical Marketer Series
          • MozPod
          Connect with us

          Contact the Help team

          Join our newsletter
          Moz logo
          © 2021 - 2026 SEOMoz, Inc., a Ziff Davis company. All rights reserved. Moz is a registered trademark of SEOMoz, Inc.
          • Accessibility
          • Terms of Use
          • Privacy