The Moz Q&A Forum

    • Forum
    • Questions
    • My Q&A
    • Users
    • Ask the Community

    Welcome to the Q&A Forum

    Browse the forum for helpful insights and fresh discussions about all things SEO.

    1. SEO and Digital Marketing Q&A Forum
    2. Categories
    3. Intermediate & Advanced SEO
    4. Can anyone help me diagnose an indexing/sitemap issue on a large e-commerce site?

    Can anyone help me diagnose an indexing/sitemap issue on a large e-commerce site?

    Intermediate & Advanced SEO
    4 2 80
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as question
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • webrocket
      webrocket last edited by

      Hey guys. Wondering if someone can help diagnose a problem for me.

      Here's our site: https://www.flagandbanner.com/

      We have a fairly large e-commerce site--roughly 23,000 urls according to crawls using both Moz and Screaming Frog. I have created an XML sitemap (using SF) and uploading to Webmaster Tools. WMT is only showing about 2,500 urls indexed. Further, WMT is showing that Google is indexing only about 1/2 (approx. 11,000) of the urls. Finally (to add even more confusion), when doing a site search on Google (site:) it's only showing about 5,400 urls found. The numbers are all over the place!

      Here's the robots.txt file:

      User-agent: *
      Allow: /
      Disallow: /aspnet_client/
      Disallow: /httperrors/
      Disallow: /HTTPErrors/
      Disallow: /temp/
      Disallow: /test/

      Disallow: /i_i_email_friend_request
      Disallow: /i_i_narrow_your_search
      Disallow: /shopping_cart
      Disallow: /add_product_to_favorites
      Disallow: /email_friend_request
      Disallow: /searchformaction
      Disallow: /search_keyword
      Disallow: /page=
      Disallow: /hid=
      Disallow: /fab/*

      Sitemap: https://www.flagandbanner.com/images/sitemap.xml

      Anyone have any thoughts as to what our problems are??

      Mike

      1 Reply Last reply Reply Quote 0
      • rjonesx. 0
        rjonesx. 0 last edited by

        Thanks for the question!

        First, it is very common to get inconsistent answers from GSC, site:, sitemap and crawl results. Don't worry too much about that.

        Your goal is to get as many of your pages indexed and that is a function of links pointing to your site and internal link structure. While it is an imperfect analogy, we often refer to this as "crawl budget".  There are essentially 2 solutions to this...

        1. Get more/better backlinks to a diversity of pages on your site.

        2. Improve your internal link architecture so that Googlebot finds your pages more quickly.

        I think the problem in your case is that the site inundates bots with generic navigational links. For example, this page...

        http://www.flagandbanner.com/products/chrome-air-force-lt-general-flag-kit.asp

        has 1400 internal links! That is crazy!

        This page has 1500!

        https://www.flagandbanner.com/products/citizenship-gifts.asp

        You need to reel this back in dramatically. Your navigation should like to top level categories or maybe a handful of subcategories. Once in a category, you can reveal deeper categories. This will increase the likelihood that the related and "also" buy links that you find on product pages will get found and followed by Googlebot.

        Finally, on a different note, you need to make sure you standardize the casing of URLs (ie: /Products/ or /products/) I noticed that you have links both internal and external that do not take this into account, causing unnecessary duplicate content.

        webrocket 1 Reply Last reply Reply Quote 1
        • webrocket
          webrocket @rjonesx. 0 last edited by

          Thanks so much for your response, Russ.

          You're confirming one of the many issues we have identified (too many internal links) but I had not connected it to indexing or site speed. When I use the Google Page Speed Tool, many of our pages are not even registering. It seems like it's taking too long to load them so it times out. Could the crazy amount of links have to do with this, too?

          Moreover, our mobile speed is especially poor. This could be an even bigger problem in mobile, no?

          Are you familiar with .asp sites, in particular, having indexing issues...or is that a false assumption?

          Mike

          1 Reply Last reply Reply Quote 0
          • rjonesx. 0
            rjonesx. 0 last edited by

            A site running ASP should be perfectly fine. I bet you will see substantial increases in a lot of positive metrics by just pairing down that navigation.

            1 Reply Last reply Reply Quote 1
            • 1 / 1
            • First post
              Last post
            • I have a metadata issue. My site crawl is coming back with missing descriptions, but all of the pages look like site tags (i.e. /blog/?_sft_tag=call-routing)
              Rajesh.Prajapati
              Rajesh.Prajapati
              0
              2
              43

            • How do we decide which pages to index/de-index? Help for a 250k page site
              julie-getonthemap
              julie-getonthemap
              0
              2
              63

            • Client wants to remove mobile URLs from their sitemap to avoid indexing issues. However this will require SEVERAL billing hours. Is having both mobile/desktop URLs in a sitemap really that detrimental to search indexing?
              RosemaryB
              RosemaryB
              0
              7
              89

            • Do image sitemaps provide value for non e-commerce sites?
              EGOL
              EGOL
              1
              4
              314

            • Huge Google index on E-commerce site
              ssiebn7
              ssiebn7
              0
              5
              757

            • Indexing an e-commerce site
              jenga11
              jenga11
              0
              9
              594

            • What on-page/site optimization techniques can I utilize to improve this site (http://www.paradisus.com/)?
              RyanKent
              RyanKent
              0
              2
              626

            • Can a XML sitemap index point to other sitemaps indexes?
              KeriMorgret
              KeriMorgret
              0
              3
              1.9k

            Get started with Moz Pro!

            Unlock the power of advanced SEO tools and data-driven insights.

            Start my free trial
            Products
            • Moz Pro
            • Moz Local
            • Moz API
            • Moz Data
            • STAT
            • Product Updates
            Moz Solutions
            • SMB Solutions
            • Agency Solutions
            • Enterprise Solutions
            • Digital Marketers
            Free SEO Tools
            • Domain Authority Checker
            • Link Explorer
            • Keyword Explorer
            • Competitive Research
            • Brand Authority Checker
            • Local Citation Checker
            • MozBar Extension
            • MozCast
            Resources
            • Blog
            • SEO Learning Center
            • Help Hub
            • Beginner's Guide to SEO
            • How-to Guides
            • Moz Academy
            • API Docs
            About Moz
            • About
            • Team
            • Careers
            • Contact
            Why Moz
            • Case Studies
            • Testimonials
            Get Involved
            • Become an Affiliate
            • MozCon
            • Webinars
            • Practical Marketer Series
            • MozPod
            Connect with us

            Contact the Help team

            Join our newsletter
            Moz logo
            © 2021 - 2026 SEOMoz, Inc., a Ziff Davis company. All rights reserved. Moz is a registered trademark of SEOMoz, Inc.
            • Accessibility
            • Terms of Use
            • Privacy