The Moz Q&A Forum

    • Forum
    • Questions
    • My Q&A
    • Users
    • Ask the Community

    Welcome to the Q&A Forum

    Browse the forum for helpful insights and fresh discussions about all things SEO.

    1. SEO and Digital Marketing Q&A Forum
    2. Categories
    3. Technical SEO Issues
    4. Disallowing URL Parameters vs. Canonicalizing

    Disallowing URL Parameters vs. Canonicalizing

    Technical SEO Issues
    5 3 58
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as question
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • Alces
      Alces last edited by

      Hi all,

      I have a client that has a unique search setup. So they have Region pages (/state/city). We want these indexed and are using self-referential canonicals.

      They also have a search function that emulates the look of the Region pages. When you search for, say, Los Angeles, the URL changes to _/search/los+angeles  _and looks exactly like /ca/los-angeles.

      These search URLs can also have parameters (/search/los+angeles?age=over-2&time[]=part-time), which we obviously don't want indexed.

      Right now my concern is how best to ensure the /search pages don't get indexed and we don't get hit with duplicate content penalties. The options are this:

      1. Self-referential canonicals for the Region pages, and disallow everything after the second slash in /search/ (so the main search page is indexed)

      2. Self-referential canonicals for the Region pages, and write a rule that automatically canonicalizes all other search pages to /search.

      Potential Concern: /search/ URLs are created even with misspellings.

      Thanks!

      1 Reply Last reply Reply Quote 1
      • jasongmcmahon
        jasongmcmahon last edited by

        Hi there

        Canonical tags prevent problems caused by identical or "duplicate" content across multiple URLs.  So in this instance implement the disallow rule on al of the  URLs containing /search/

        1 Reply Last reply Reply Quote 0
        • effectdigital
          effectdigital last edited by

          Robots.txt controls crawling, not indexation. Google will still sometimes index pages they cannot crawl. Canonical tags are for duplicate content consolidation, but are not a hard signal and Google frequently ignores them. Meta no-index tags (or X-robots no-index through the HTTP header, if you cannot apply Meta no-index in the HTML) is a harder signal and is meant to help you control indexation

          To be honest if the pages are identical why not just 301 redirect the relevant searches (the top-line ones, which result in pages exactly the same as your regional ones) to the regional URLs? If the pages really are the same, it won't be any different for users except for a small delay during the redirect (which won't really be felt, especially if you are using Nginx redirects)

          If you can't do that, you're really left with the Meta no-index tag and the canonical tag. Canonical tags avoid content duplication penalties but are a softer signal and they don't consolidate link equity like 301 redirects do (so in many way, there's not actually that much different between Meta no-index and canonicals, except canonical tags are more complex to set up in the first place as they require a destination field)

          I'd probably just Meta no-index all the search URLs. Once Google had swallowed that, I would then (after 2-3 weeks) apply the relevant robots.txt rules

          If you apply them both at the same time, Google won't be able to crawl the search URLs (since your robots.txt rule will block them) and therefore they will be blind to your canonical / Meta no index directive(s). So you have to handle de-indexation first, and THEN after that block the crawling to save your crawl allowance a bit

          But don't do it all at once or you'll get in an unholy mess!

          Alces 1 Reply Last reply Reply Quote 0
          • Alces
            Alces @effectdigital last edited by

            Yeah, I know the difference between the two, I've just been in a situation where canonicals were recommended as a means of controlling the preferred page _within an indexation context. _If that makes sense.

            My biggest concern is with the creation of URLs from misspellings, which still return search results if it's close enough. The redirects could work. Honestly that wasn't something we considered.

            I'm liking the noindex approach. They'd have to write a rule that applies it to every page created with a search parameter, which I think they should be able to do.

            If it helps, almost the entire site is run by Javascript. Like...everything.

            Thanks for the advice. Much appreciated.

            -Brad

            effectdigital 1 Reply Last reply Reply Quote 1
            • effectdigital
              effectdigital @Alces last edited by

              Just so you know Meta no-index can be applied through the HTML but also through the HTTP header which might make it easier to implement on such a highly generated website

              1 Reply Last reply Reply Quote 0
              • 1 / 1
              • First post
                Last post
              • SEO Elements for Canonicalized URLs?
                R0bin_L0rd
                R0bin_L0rd
                0
                5
                57

              • URL Parameters to Ignore
                Martijn_Scheijbeler
                Martijn_Scheijbeler
                1
                6
                630

              • URL Parameters as pagination
                LoganRay
                LoganRay
                0
                2
                239

              • URL Parameters
                DirkC
                DirkC
                1
                2
                120

              • How do I deindex url parameters
                BT2009
                BT2009
                0
                5
                793

              • GWT, URL Parameters, and Magento
                Marketing.SCG
                Marketing.SCG
                0
                4
                2.6k

              • Dynamic Parameters in URL
                sesertin
                sesertin
                0
                2
                513

              • Ignore url parameters without the 'parameter=' ?
                Hakkasan
                Hakkasan
                0
                3
                950

              Get started with Moz Pro!

              Unlock the power of advanced SEO tools and data-driven insights.

              Start my free trial
              Products
              • Moz Pro
              • Moz Local
              • Moz API
              • Moz Data
              • STAT
              • Product Updates
              Moz Solutions
              • SMB Solutions
              • Agency Solutions
              • Enterprise Solutions
              • Digital Marketers
              Free SEO Tools
              • Domain Authority Checker
              • Link Explorer
              • Keyword Explorer
              • Competitive Research
              • Brand Authority Checker
              • Local Citation Checker
              • MozBar Extension
              • MozCast
              Resources
              • Blog
              • SEO Learning Center
              • Help Hub
              • Beginner's Guide to SEO
              • How-to Guides
              • Moz Academy
              • API Docs
              About Moz
              • About
              • Team
              • Careers
              • Contact
              Why Moz
              • Case Studies
              • Testimonials
              Get Involved
              • Become an Affiliate
              • MozCon
              • Webinars
              • Practical Marketer Series
              • MozPod
              Connect with us

              Contact the Help team

              Join our newsletter
              Moz logo
              © 2021 - 2026 SEOMoz, Inc., a Ziff Davis company. All rights reserved. Moz is a registered trademark of SEOMoz, Inc.
              • Accessibility
              • Terms of Use
              • Privacy