The Moz Q&A Forum

    • Forum
    • Questions
    • My Q&A
    • Users
    • Ask the Community

    Welcome to the Q&A Forum

    Browse the forum for helpful insights and fresh discussions about all things SEO.

    1. SEO and Digital Marketing Q&A Forum
    2. Categories
    3. On-Page / Site Optimization
    4. Sitemap include all site links or just ones we want indexed?

    Sitemap include all site links or just ones we want indexed?

    On-Page / Site Optimization
    4 2 758
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as question
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • Whebb
      Whebb last edited by

      Got a quick sitemap question. We have a clients site built in opencart and are getting ready to submit the sitmap. The default sitemap setting generates urls right off of the root. For example site.com/product. These urls are also accessible through the site itself. We prefer to give the site some depth and have structured the products so the urls are site.com/category/product. All of the product pages have canonicals including the category so we should not have to worry about duplicate content on the /product page vs the /category/product page. My question is both types of product pages are included in the sitemap at the moment. Since we don't want google to index the /product urls should we leave them off of the sitemap even though they are readily accessible from the frontend(though not linked)? Or just leave them and let the canonical tag be used in directing google as to which urls to index. Thanks in advance.

      1 Reply Last reply Reply Quote 0
      • Ray-pp
        Ray-pp last edited by

        Hi JStrong,

        Great question to be asking and an important topic to be doing your due diligence on, especially when dealing with an eCommerce related website.

        Google uses a sitemap as a guideline for crawling your site. So, just because you put a URL in your sitemap, doesn't mean that they URL will actually be indexed. You can see those stats in your Google Webmaster Tools account, under the Sitemap area. It will display how many URLs are in the sitemap and how many out of those URLs are indexed.

        If you do not want certain pages to be indexed by Google, then you would need to adjust your robots.txt file to give Google those instructions.

        As long as you have the correct Canonical configurations, you should avoid any duplicate content issues from the URLs you've described above.

        Good luck!

        1 Reply Last reply Reply Quote 1
        • Whebb
          Whebb last edited by

          Hey Raymond,

          Thanks for the response, feel like I'm over thinking this a bit, as usually we just leave our opencart setups as is, other then a few minor tweaks. Lately I've really been scrutinizing opencart's SEO setup and how to improve it, since it seems there are a lot of gaps in he way it handles this.

          I thought the robots.txt would have been a good way to block the pages, but the issue is I would need to block every single product page as opencart automatically creates a page for every product that is site.com/product and since we are adding lots of products there should be a better way to handle this.  After I posted I came across this tidbit from a 6 year old google webmaster central blog post. Basically it states that 'While we can't guarantee that our algorithms will display that particular URL in search results, it's still helpful for you to indicate your preference by including that URL in your Sitemap. '. I think going this route along with the canonical should do the trick.

          1 Reply Last reply Reply Quote 0
          • Ray-pp
            Ray-pp last edited by

            Hi again JS,

            I think it's great that you continue to evaluate your platform from all perspectives and evaluate its strengths/weaknesses. Many times, a platform can do a lot of the basics well, but fall short on the details that differentiate us from our competition. For example, opencart may do the basic SEO requirements well, but not include ecommerce microdata (schema.org) which have a high impact on our search listings.

            You can do a lot of harm/good with the robots.txt file - like deindex entire website (probably not a good thing) or block certain directories (your /product issue). I would gain some deeper knowledge about what you can do with the robots.txt file and how you need it to perform for your business.

            1 Reply Last reply Reply Quote 0
            • 1 / 1
            • First post
              Last post
            • What to do to index all my links of my website?
              paints-n-design
              paints-n-design
              0
              2
              68

            • Linking Out To External Sites
              artdivision
              artdivision
              0
              9
              131

            • One site, one location, multiple languages - best approach?
              gfiorelli1
              gfiorelli1
              0
              7
              531

            • What is the best way to optimise a site that wants to target one service in multiple locations?
              chris.kent
              chris.kent
              1
              4
              186

            • One Page Website vs. Multipage Site, if you want to target one specific Keyword only.
              Valarlf
              Valarlf
              0
              2
              318

            • One site with one product or multi product website
              hith234
              hith234
              0
              8
              2.4k

            • ON SITE SEARCH INDEXED BY GOOGLE - no follow or no index
              ShaMenz
              ShaMenz
              0
              4
              770

            • On my site, www.myagingfolks.com, only a small number of my pages appear to be indexed by google or yahoo. Is that due to not having an XML sitemap, keywords, or some other problem?
              Jordanrg
              Jordanrg
              0
              3
              687

            Get started with Moz Pro!

            Unlock the power of advanced SEO tools and data-driven insights.

            Start my free trial
            Products
            • Moz Pro
            • Moz Local
            • Moz API
            • Moz Data
            • STAT
            • Product Updates
            Moz Solutions
            • SMB Solutions
            • Agency Solutions
            • Enterprise Solutions
            • Digital Marketers
            Free SEO Tools
            • Domain Authority Checker
            • Link Explorer
            • Keyword Explorer
            • Competitive Research
            • Brand Authority Checker
            • Local Citation Checker
            • MozBar Extension
            • MozCast
            Resources
            • Blog
            • SEO Learning Center
            • Help Hub
            • Beginner's Guide to SEO
            • How-to Guides
            • Moz Academy
            • API Docs
            About Moz
            • About
            • Team
            • Careers
            • Contact
            Why Moz
            • Case Studies
            • Testimonials
            Get Involved
            • Become an Affiliate
            • MozCon
            • Webinars
            • Practical Marketer Series
            • MozPod
            Connect with us

            Contact the Help team

            Join our newsletter
            Moz logo
            © 2021 - 2026 SEOMoz, Inc., a Ziff Davis company. All rights reserved. Moz is a registered trademark of SEOMoz, Inc.
            • Accessibility
            • Terms of Use
            • Privacy