The Moz Q&A Forum

    • Forum
    • Questions
    • My Q&A
    • Users
    • Ask the Community

    Welcome to the Q&A Forum

    Browse the forum for helpful insights and fresh discussions about all things SEO.

    1. SEO and Digital Marketing Q&A Forum
    2. Categories
    3. Intermediate & Advanced SEO
    4. Disallow: /jobs/? is this stopping the SERPs from indexing job posts

    Disallow: /jobs/? is this stopping the SERPs from indexing job posts

    Intermediate & Advanced SEO
    9 4 299
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as question
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • JamesHancocks1
      JamesHancocks1 last edited by

      Hi,
      I was wondering what this would be used for as it's in the Robots.exe of a recruitment agency website that posts jobs. Should it be removed?

      Disallow: /jobs/?
      Disallow: /jobs/page/*/

      Thanks in advance.
      James

      1 Reply Last reply Reply Quote 0
      • MattJanaway
        MattJanaway last edited by

        I'd guess that the jobs get pulled from a job board. If this is the case, then the content ( job description, title etc.) will just be a duplication of the content that can be found in many other locations. If a plugin is used, they sometimes automatically add a disallow into the robots.txt file as to not hurt the parent version of the job page by creating thousands of duplicate content issues.

        I'd recommend creating some really high-quality hub pages based on job type, or location and pulling the relevant jobs into that page, instead of trying to index and rank the actual job pages.

        1 Reply Last reply Reply Quote 2
        • Keszi
          Keszi last edited by

          Hi James,

          Regarding the robots.txt syntax:

          Disallow: /jobs/? which basically blocks every single URL that contains /jobs/**? **

          For example: domain.com**/jobs/?**sort-by=... will be blocked

          If you want to disallow query parameters from URL, the correct implementation would be Disallow: /jobs/*? or even specify which query parameter you want to block. For example Disallow: /jobs/*?page=

          My question to you, if these jobs are linked from any other page and/or sitemap? Or only from the listing page, which has it's pagination, sorting, etc. is blocked by robots.txt? If they are not linked, it could be a simple case of orphan pages, where basically the crawler cannot access the job posting pages, because there is no actual link to it. I know it is an old rule, but it is still true: Crawl > Index > Rank.

          BTW. I don't know why you would block your pagination. There are other optimal implementations.

          And there is always the scenario, that was already described by Matt. But I believe in that case you would have at least some of the pages indexed even if they are not going to get ranked well.

          Also, make sure other technical implementations are not stopping your job posting pages from being indexed.

          1 Reply Last reply Reply Quote 1
          • Dezzign
            Dezzign last edited by

            I don't think it should be blocked by robots.txt at all. It's stopping Google from crawling the site fully. And they may even treat it negatively as they've been really clamping down on blocking folders with robots.txt lately. I've seen sites with warning in search console for: Disallow: /wp-admin

            You may want to consider just using a noindex tag on those pages instead. And then also use a canonical tag that points back to the main job category page. That way Google can crawl the pages and perhaps pass all the juice back to the main job category page via the canonical. Then just make sure those junk job pages aren't in the sitemap either.

            Keszi 1 Reply Last reply Reply Quote 1
            • Keszi
              Keszi @Dezzign last edited by

              Sorry Richard, but using noindex with canonical link is not quite a good practice.

              It's an old entry, but still true: https://www.seroundtable.com/noindex-canonical-google-18274.html

              Dezzign 1 Reply Last reply Reply Quote 1
              • Dezzign
                Dezzign @Keszi last edited by

                Ah yes when it's pointed out like that, it's a conflicting signal isn't It. Makes sense in theory, but if you're setting it to noindex and then passing that on via a canonical it's probably not the best is it.

                They're was link out in that thread to a discussion of people who still do that with success, but after reading that I would just use noindex only as you said. (Still prefer the no index on the robots block though)

                Keszi 1 Reply Last reply Reply Quote 1
                • Keszi
                  Keszi @Dezzign last edited by

                  The idea is (which we both highlighted), that blocking your listing page from robots.txt is wrong, for pagination you have several methods to deal with (how you deal with it, it really depends on the technical possibilities that you have on the project).

                  Regarding James' original question, my feeling is, that he is somehow blocking their posting pages. Cutting the access to these pages makes it really hard for Google, or any other search engine to index it. But without a URL in front of us, we cannot really answer his question, we can only create theories that he can test 🙂

                  JamesHancocks1 1 Reply Last reply Reply Quote 1
                  • JamesHancocks1
                    JamesHancocks1 @Keszi last edited by

                    Hi Istvan,

                    Sorry I've been away for a while. Thanks for all of your advice guys.

                    Here is the url if that helps?

                    https://www.pkeducation.co.uk/jobs/

                    Cheers,

                    James

                    Keszi 1 Reply Last reply Reply Quote 0
                    • Keszi
                      Keszi @JamesHancocks1 last edited by

                      Hi James,

                      So far as I can see you have the following architecture:

                      • job posting: https://www.pkeducation.co.uk/job/post-name/
                      • jobs listing page: https://www.pkeducation.co.uk/jobs/

                      Since from the robots.txt the listing page pagination is blocked, the crawler can access only the first 15 job postings are available to crawl via a normal crawl.

                      I would say, you should remove the blocking from the robots.txt and focus on implementing a correct pagination. *which method you choose is your decision, but allow the crawler to access all of your job posts. Check https://yoast.com/pagination-seo-best-practices/

                      Another thing I would change is to make the job post title an anchor text for the job posting. (every single job is linked with "Find out more").

                      Also if possible, create a separate sitemap.xml for your job posts and submit it in Search Console, this way you can keep track of any anomaly with indexation.

                      Last, and not least, focus on the quality of your content (just as Matt proposed in the first answer).

                      Good luck!

                      1 Reply Last reply Reply Quote 1
                      • 1 / 1
                      • First post
                        Last post
                      • What does Disallow: /french-wines/?* actually do - robots.txt
                        LoganRay
                        LoganRay
                        0
                        8
                        558

                      • Redirect wordpress from /%post_id%/%postname%/ to /blog/%postname%/
                        Taiger
                        Taiger
                        0
                        3
                        446

                      • Links / Metadata around Recent Posts etc in Wordpress / Blog - Good SEO Practice?
                        Mark_Ginsberg
                        Mark_Ginsberg
                        0
                        2
                        97

                      • Does anyone know how to appear with snippet that says something like: Jobs 1-10 of 80 in the beginning of the description on Google? e.g. like on: https://www.google.co.za/#q=pickers+and+packers
                        Everett
                        Everett
                        0
                        4
                        151

                      • How to remove "/magento/" and "/index.php/" showing in internal links and dup pages in GWT
                        GarGar
                        GarGar
                        0
                        6
                        6.4k

                      • How Long Does it Take for Rel Canonical to De-Index / Re-Index a Page?
                        Travis-W
                        Travis-W
                        0
                        3
                        906

                      • Indexed non existent pages, problem appeared after we 301d the url/index to the url.
                        ThompsonPaul
                        ThompsonPaul
                        0
                        4
                        331

                      • Sitemaps / Google Indexing / Submitted
                        Copstead
                        Copstead
                        0
                        3
                        360

                      Get started with Moz Pro!

                      Unlock the power of advanced SEO tools and data-driven insights.

                      Start my free trial
                      Products
                      • Moz Pro
                      • Moz Local
                      • Moz API
                      • Moz Data
                      • STAT
                      • Product Updates
                      Moz Solutions
                      • SMB Solutions
                      • Agency Solutions
                      • Enterprise Solutions
                      • Digital Marketers
                      Free SEO Tools
                      • Domain Authority Checker
                      • Link Explorer
                      • Keyword Explorer
                      • Competitive Research
                      • Brand Authority Checker
                      • Local Citation Checker
                      • MozBar Extension
                      • MozCast
                      Resources
                      • Blog
                      • SEO Learning Center
                      • Help Hub
                      • Beginner's Guide to SEO
                      • How-to Guides
                      • Moz Academy
                      • API Docs
                      About Moz
                      • About
                      • Team
                      • Careers
                      • Contact
                      Why Moz
                      • Case Studies
                      • Testimonials
                      Get Involved
                      • Become an Affiliate
                      • MozCon
                      • Webinars
                      • Practical Marketer Series
                      • MozPod
                      Connect with us

                      Contact the Help team

                      Join our newsletter
                      Moz logo
                      © 2021 - 2026 SEOMoz, Inc., a Ziff Davis company. All rights reserved. Moz is a registered trademark of SEOMoz, Inc.
                      • Accessibility
                      • Terms of Use
                      • Privacy