The Moz Q&A Forum

    • Forum
    • Questions
    • My Q&A
    • Users
    • Ask the Community

    Welcome to the Q&A Forum

    Browse the forum for helpful insights and fresh discussions about all things SEO.

    1. SEO and Digital Marketing Q&A Forum
    2. Categories
    3. Technical SEO Issues
    4. Sitemap url's not being indexed

    Sitemap url's not being indexed

    Technical SEO Issues
    7 4 158
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as question
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • GreenStone
      GreenStone last edited by

      There is an issue on one of our sites regarding many of the sitemap url's not being indexed.  (at least 70% is not being indexed)

      The url's in the sitemap are normal url's without any strange characters attached to them, but after looking into it, it seems a lot of the url's get a #. + a number sequence attached to them once you actually go to that url. We are not sure if the "addthis" bookmark could cause this, or if it's another script doing it.

      For example

      Url in the sitemap: http://example.com/example-category/0246

      Url once you actually go to that link: http://example.com/example-category/0246#.VR5a

      Just for further information, the XML file does not have any style information associated with it and is in it's most basic form.

      Has anyone had similar issues with their sitemap not being indexed properly ?...Could this be the cause of many of these url's not being indexed ?

      Thanks all for your help.

      1 Reply Last reply Reply Quote 0
      • PatrickDelehanty
        PatrickDelehanty last edited by

        Hi there

        Could you provide you website's URL? It would help the community take a deeper look - thanks!

        Good luck!

        GreenStone 1 Reply Last reply Reply Quote 2
        • LesleyPaone
          LesleyPaone last edited by

          Yes, add this is doing this to your url. I hate it, that is one reason why I do not use them.

          Here is an article on how to remove them, http://support.addthis.com/customer/portal/articles/1013558-removing-all-hashtags-anchors-weird-codes-from-your-urls

          GreenStone 1 Reply Last reply Reply Quote 4
          • GreenStone
            GreenStone @PatrickDelehanty last edited by

            Patrick,

            We'd prefer to keep the actual url's private, however I can provide further information to help hopefully allow the community to dissect this further:

            • It's an E-commerce website, meaning many facets, filters, and possible duplicate content angles
            • It seems many of the static pages (/products main page, /contact,etc) are indexed, however it seems the individual products are mostly not being indexed through the sitemap
            • While the url's found in webmaster tools under "index" has also steadily been going down, it definitely doesn't correspond with the lack of pages indexed vs submitted within the sitemap
            • We have checked robots.txt, and it is not blocking any important pages. (I also had them allow robots to crawl css and js so google could have full access)
            • The individual product pages all have the "addthis" feature, meaning they all have a #. + number sequence added to the url's. However one would think this wouldn't be the cause of this lack of indexation ?

            Thanks for your help.

            1 Reply Last reply Reply Quote 0
            • GreenStone
              GreenStone @LesleyPaone last edited by

              Lesley,

              Thanks for the confirmation on that one and the article. Since it doesn't seem like a lot of people on the site are using that address share function, I do not think it would do any harm to remove it.

              At least we know the root cause of why it's doing it to the url's. Now the real question is...could it be getting in the way of indexing those url's ?...one would think not, as from what I've read, google would simply ignore what comes after the #.

              Thoughts ?

              Appreciate the help.

              AndersS 1 Reply Last reply Reply Quote 0
              • AndersS
                AndersS @GreenStone last edited by

                I agree - i probably would ignore everything after the "#".

                But have you tried added a <link rel="canonical" href="http://example.com/page-url" /> to your pages and see if this will update it? Also: Add the sitemap to your robots.txt if not allready done.

                Regarding indexing pages - have you restricted crawl frequency in Google Search Console, or is it set to be determined by GoogleBot? Any other warnings or messages in Search Console?

                Best regards,
                Anders

                GreenStone 1 Reply Last reply Reply Quote 1
                • GreenStone
                  GreenStone @AndersS last edited by

                  Anders,

                  Thanks for the reply. I definitely agree a self referring canonical might just be a good extra addition on these product pages, so I'm definitely adding that to our list of to do's if it does not improve.

                  In terms of indexing pages - We have not restricted crawl frequency, we have it set to "allow google to determine the optimal crawl rate".  No other warnings found within the search console either.

                  Thanks for your help.

                  1 Reply Last reply Reply Quote 0
                  • 1 / 1
                  • First post
                    Last post
                  • Clean URL vs. Parameter URL and Using Canonical URL...That's a Mouthfull!
                    Roman-Delcarmen
                    Roman-Delcarmen
                    0
                    4
                    170

                  • Strange URL's for client's site
                    everestagency
                    everestagency
                    0
                    3
                    456

                  • Sitemap issue? 404's & 500's are regenerating?
                    jeff-rackaid.com
                    jeff-rackaid.com
                    0
                    5
                    263

                  • What's the best way to handle Overly Dynamic Url's?
                    GKLA
                    GKLA
                    0
                    2
                    309

                  • Best Practices for adding Dynamic URL's to XML Sitemap
                    DeanAndrews
                    DeanAndrews
                    0
                    3
                    7.4k

                  • How can I best find out which URLs from large sitemaps aren't indexed?
                    Audiohype
                    Audiohype
                    0
                    4
                    271

                  • I'm redesigning a website which will have a new URL format. What's the best way to redirect all the old URLs to the new ones? Is there an automated, fast way to do this?
                    GregFindley.co.uk
                    GregFindley.co.uk
                    0
                    2
                    303

                  • Google indexing less url's then containded in my sitemap.xml
                    Juist
                    Juist
                    0
                    7
                    3.4k

                  Get started with Moz Pro!

                  Unlock the power of advanced SEO tools and data-driven insights.

                  Start my free trial
                  Products
                  • Moz Pro
                  • Moz Local
                  • Moz API
                  • Moz Data
                  • STAT
                  • Product Updates
                  Moz Solutions
                  • SMB Solutions
                  • Agency Solutions
                  • Enterprise Solutions
                  • Digital Marketers
                  Free SEO Tools
                  • Domain Authority Checker
                  • Link Explorer
                  • Keyword Explorer
                  • Competitive Research
                  • Brand Authority Checker
                  • Local Citation Checker
                  • MozBar Extension
                  • MozCast
                  Resources
                  • Blog
                  • SEO Learning Center
                  • Help Hub
                  • Beginner's Guide to SEO
                  • How-to Guides
                  • Moz Academy
                  • API Docs
                  About Moz
                  • About
                  • Team
                  • Careers
                  • Contact
                  Why Moz
                  • Case Studies
                  • Testimonials
                  Get Involved
                  • Become an Affiliate
                  • MozCon
                  • Webinars
                  • Practical Marketer Series
                  • MozPod
                  Connect with us

                  Contact the Help team

                  Join our newsletter
                  Moz logo
                  © 2021 - 2026 SEOMoz, Inc., a Ziff Davis company. All rights reserved. Moz is a registered trademark of SEOMoz, Inc.
                  • Accessibility
                  • Terms of Use
                  • Privacy