The Moz Q&A Forum

    • Forum
    • Questions
    • My Q&A
    • Users
    • Ask the Community

    Welcome to the Q&A Forum

    Browse the forum for helpful insights and fresh discussions about all things SEO.

    1. SEO and Digital Marketing Q&A Forum
    2. Categories
    3. Technical SEO Issues
    4. Robots.txt file question? NEver seen this command before

    Robots.txt file question? NEver seen this command before

    Technical SEO Issues
    7 4 772
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as question
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • RobMay
      RobMay last edited by

      Hey Everyone!

      Perhaps someone can help me. I came across this command in the robots.txt file of our Canadian corporate domain. I looked around online but can't seem to find a definitive answer (slightly relevant).

      the command line is as follows:

      Disallow: /*?*
      

      I'm guessing this might have something to do with blocking php string searches on the site?. It might also have something to do with blocking sub-domains, but the "?" mark puzzles me 😞

      Any help would be greatly appreciated!

      Thanks, Rob

      1 Reply Last reply Reply Quote 0
      • AdoptionHelp
        AdoptionHelp last edited by

        Its preventing spiders from crawling pages with parameters in the URL. For example when you search on google you'll see a URL like so:

        http://www.google.com/search?q=seo

        This passes the parameter of q with a value of 'seo' to the page at google.com for it to work its magic with. This is almost definitely a good thing, unless the only way to access some content on your site is via URL parameters.

        RobMay omnea 2 Replies Last reply Reply Quote 3
        • RobMay
          RobMay @AdoptionHelp last edited by

          So, for this parameter, should I keep it in the robots file?

          AdoptionHelp 1 Reply Last reply Reply Quote 0
          • rhutchings
            rhutchings last edited by

            Its not a bad idea in the robots.txt, but unless you are 100% confidant that you wont block something that you really want, i would consider just handling unwanted parameters and pages through the new Google Webmaster url handling toolset.   that way you have more control over which ones do and dont get blocked.

            RobMay 1 Reply Last reply Reply Quote 1
            • RobMay
              RobMay @rhutchings last edited by

              Thanks Ryan and Ryan!  I'm just unfamiliar with this command set in the robots file, and getting settled into the company (5 weeks).. so I am still learning the site's structure and arch. With it all being new to me with limitations I am seeing from the CMS side, I was wondering if this might have been causing crawl issues for Bing and or Yahoo... I'm trying to gauge where we might be experiencing problems with the sites crawl functions.

              1 Reply Last reply Reply Quote 0
              • AdoptionHelp
                AdoptionHelp @RobMay last edited by

                It depends on how your site is structured.

                For example if you have a page at

                http://www.yourdomain.com/products.php

                and this shows different things based on the parameter, like:

                http://www.yourdomain.com/products.php?type=widgets

                You will want to get rid of this line in your robots.txt

                However if the parameter(s) doesn't change the content on the page, you can leave it in.

                1 Reply Last reply Reply Quote 2
                • omnea
                  omnea @AdoptionHelp last edited by

                  I don't think this is correct.

                  ? is an attempt at using a RegEx in Robots file which I don't think works.

                  Further, if it was a properly formed regex, it would be ?

                  • is a special character for the user agent to mean all.  For the disallow line, I believe you have to use a specific directory or page.

                  http://www.robotstxt.org/robotstxt.html

                  I could be wrong, but the info on this site has been my understanding from the past too.

                  1 Reply Last reply Reply Quote 0
                  • 1 / 1
                  • First post
                    Last post
                  • Blocking subdomains with Robots.txt file
                    PaulM01
                    PaulM01
                    0
                    3
                    641

                  • Meta Robots Noindex and Robots.txt File
                    Devanur-Rafi
                    Devanur-Rafi
                    0
                    2
                    125

                  • Does this robots.txt file look right?
                    ThompsonPaul
                    ThompsonPaul
                    0
                    8
                    188

                  • Robots.txt Question
                    ThompsonPaul
                    ThompsonPaul
                    0
                    5
                    700

                  • Does Bing ignore robots txt files?
                    Nightwing
                    Nightwing
                    0
                    3
                    2.8k

                  • Robots.txt question
                    KeriMorgret
                    KeriMorgret
                    0
                    7
                    791

                  • Robots.txt and robots meta
                    TheEspresseo
                    TheEspresseo
                    0
                    5
                    1.1k

                  • Robots.txt File Redirects to Home Page
                    NickPateman81
                    NickPateman81
                    0
                    6
                    4.1k

                  Get started with Moz Pro!

                  Unlock the power of advanced SEO tools and data-driven insights.

                  Start my free trial
                  Products
                  • Moz Pro
                  • Moz Local
                  • Moz API
                  • Moz Data
                  • STAT
                  • Product Updates
                  Moz Solutions
                  • SMB Solutions
                  • Agency Solutions
                  • Enterprise Solutions
                  • Digital Marketers
                  Free SEO Tools
                  • Domain Authority Checker
                  • Link Explorer
                  • Keyword Explorer
                  • Competitive Research
                  • Brand Authority Checker
                  • Local Citation Checker
                  • MozBar Extension
                  • MozCast
                  Resources
                  • Blog
                  • SEO Learning Center
                  • Help Hub
                  • Beginner's Guide to SEO
                  • How-to Guides
                  • Moz Academy
                  • API Docs
                  About Moz
                  • About
                  • Team
                  • Careers
                  • Contact
                  Why Moz
                  • Case Studies
                  • Testimonials
                  Get Involved
                  • Become an Affiliate
                  • MozCon
                  • Webinars
                  • Practical Marketer Series
                  • MozPod
                  Connect with us

                  Contact the Help team

                  Join our newsletter
                  Moz logo
                  © 2021 - 2026 SEOMoz, Inc., a Ziff Davis company. All rights reserved. Moz is a registered trademark of SEOMoz, Inc.
                  • Accessibility
                  • Terms of Use
                  • Privacy