The Moz Q&A Forum

    • Forum
    • Questions
    • My Q&A
    • Users
    • Ask the Community

    Welcome to the Q&A Forum

    Browse the forum for helpful insights and fresh discussions about all things SEO.

    1. SEO and Digital Marketing Q&A Forum
    2. Categories
    3. Feature Requests
    4. Crawl diagnostic errors due to query string

    Crawl diagnostic errors due to query string

    Feature Requests
    9 4 2.6k
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as question
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • jmorehouse
      jmorehouse last edited by

      I'm seeing a large amount of duplicate page titles, duplicate content, missing meta descriptions, etc. in my Crawl Diagnostics Report due to URLs' query strings. These pages already have canonical tags, but I know canonical tags aren't considered in MOZ's crawl diagnostic reports and therefore won't reduce the number of reported errors. Is there any way to configure MOZ to not consider query string variants as unique URLs? It's difficult to find a legitimate error among hundreds of these non-errors.

      1 Reply Last reply Reply Quote 0
      • PatrickDelehanty
        PatrickDelehanty last edited by

        Hi there

        Check out Google's duplicate content resources - they provide help on how to categorize your parameters and URL strings.

        You can also handle this via your robots.txt. Make sure that you have a canonical tag on that page as well.

        Hope this helps! Good luck!

        jmorehouse 1 Reply Last reply Reply Quote 0
        • jmorehouse
          jmorehouse @PatrickDelehanty last edited by

          Hi Patrick,

          Thanks for the quick reply as always. As far as Google is concerned, these pages are set up correctly with canonical tags and URL strings - MOZ actually reports far more duplicate content than Webmaster tools.

          My issue is just with the number of errors reported in MOZ. You mentioned that I can handle this via the robots.txt file - Is there a way to only disallow Rogerbot from crawling URLs with query strings, or URLs that contain a certain phrase such as "item_id=" or "cat_id="?

          PatrickDelehanty AGMContainerControls 2 Replies Last reply Reply Quote 0
          • PatrickDelehanty
            PatrickDelehanty @jmorehouse last edited by

            Hi there

            My bad! Yeah - you could just do this:

            User-agent: Rogerbot
            Disallow: (check out this resource on how to block specific query strings)

            Hope this helps! Good luck!

            1 Reply Last reply Reply Quote 1
            • AGMContainerControls
              AGMContainerControls @jmorehouse last edited by

              Is your traffic lower than expected?

              I was having an issue like this where moz was showing a lot more duplicate content than webmaster tools was, actually webmaster tools showed none, but I was being penalized. I realized this when I added an exclusion to robots.txt to exclude any query strings on my site. After I did this I saw my rankings shoot through the roof.

              Not saying that this is happening to you but I just like to err on the side of caution.

              jmorehouse 1 Reply Last reply Reply Quote 0
              • jmorehouse
                jmorehouse @AGMContainerControls last edited by

                This is very interesting! Strange that Webmaster tools wouldn't display duplicate content, but Google would still penalize you. I'd like to try this on my site, but am a little wary because I think some pages rank with the query string version of the URL, despite a canonical being specified.

                1 Reply Last reply Reply Quote 0
                • kevin.loesken
                  kevin.loesken last edited by

                  Hi there!

                  Our tool has a 90% tolerance for duplicate content, which means it will flag any content that has 90% of the same code between pages. This includes all the source code on the page and not just the viewable text. You can run your own tests using this tool: http://www.webconfs.com/similar-page-checker.php. In the case of http://www.optp.com/SMARTROLLER?cat_id=205#.VZreQhNVhBc and http://www.optp.com/SMARTROLLER?cat_id=54#.VZrdJhNVhBc, these pages are 100% similar, which is why they're being flagged.

                  I hope this helps! If you need any more help with your crawl, feel free to contact our Help Team at help@moz.com.

                  Thanks!

                  Kevin
                  Help Team

                  jmorehouse 1 Reply Last reply Reply Quote 0
                  • jmorehouse
                    jmorehouse @kevin.loesken last edited by

                    Hi Kevin,

                    I understand how MOZ's duplicate content system works. It would just be nice if it could take canonical URLs into consideration for Crawl Diagnostics Reports or give you the option of not counting URLs appended with parameters as unique pages.

                    Patrick was able to help me figure out that I can do the latter via the robots.txt feature by using a wildcard: Disallow: *?.

                    1 Reply Last reply Reply Quote 0
                    • kevin.loesken
                      kevin.loesken last edited by

                      I'm glad to hear you got this figured out - thank you Patrick for your help! 🙂

                      Kevin
                      Help Team

                      1 Reply Last reply Reply Quote 0
                      • 1 / 1
                      • First post
                        Last post
                      • Server blocking crawl bot due to DOS protection and MOZ Help team not responding
                        JamesDavison
                        JamesDavison
                        0
                        4
                        65

                      • How Often Does Moz Crawl a Site for DA
                        eli.myers
                        eli.myers
                        1
                        14
                        819

                      • Is there a way to take notes on a crawled URL?
                        dave.kudera
                        dave.kudera
                        0
                        2
                        57

                      • MOZ Site Crawl - Ignore functionality question
                        Prasadgotteti
                        Prasadgotteti
                        1
                        5
                        169

                      • How to solve Moz crawl error?
                        KristinaKeyser
                        KristinaKeyser
                        0
                        2
                        150

                      • Crawl error : 804 https (SSL) error
                        Sitiodev
                        Sitiodev
                        0
                        3
                        109

                      • Crawl test limitaton - ways to take advantage of large sites?
                        KristinaKeyser
                        KristinaKeyser
                        0
                        2
                        58

                      • High Priority - Error Code 804: HTTPS (SSL) Error Encountered
                        moz_support
                        moz_support
                        0
                        2
                        152

                      Get started with Moz Pro!

                      Unlock the power of advanced SEO tools and data-driven insights.

                      Start my free trial
                      Products
                      • Moz Pro
                      • Moz Local
                      • Moz API
                      • Moz Data
                      • STAT
                      • Product Updates
                      Moz Solutions
                      • SMB Solutions
                      • Agency Solutions
                      • Enterprise Solutions
                      • Digital Marketers
                      Free SEO Tools
                      • Domain Authority Checker
                      • Link Explorer
                      • Keyword Explorer
                      • Competitive Research
                      • Brand Authority Checker
                      • Local Citation Checker
                      • MozBar Extension
                      • MozCast
                      Resources
                      • Blog
                      • SEO Learning Center
                      • Help Hub
                      • Beginner's Guide to SEO
                      • How-to Guides
                      • Moz Academy
                      • API Docs
                      About Moz
                      • About
                      • Team
                      • Careers
                      • Contact
                      Why Moz
                      • Case Studies
                      • Testimonials
                      Get Involved
                      • Become an Affiliate
                      • MozCon
                      • Webinars
                      • Practical Marketer Series
                      • MozPod
                      Connect with us

                      Contact the Help team

                      Join our newsletter
                      Moz logo
                      © 2021 - 2026 SEOMoz, Inc., a Ziff Davis company. All rights reserved. Moz is a registered trademark of SEOMoz, Inc.
                      • Accessibility
                      • Terms of Use
                      • Privacy