The Moz Q&A Forum

    • Forum
    • Questions
    • My Q&A
    • Users
    • Ask the Community

    Welcome to the Q&A Forum

    Browse the forum for helpful insights and fresh discussions about all things SEO.

    1. SEO and Digital Marketing Q&A Forum
    2. Categories
    3. Technical SEO Issues
    4. No Index PDFs

    No Index PDFs

    Technical SEO Issues
    5 4 6.9k
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as question
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • MonicaOConnor
      MonicaOConnor last edited by

      Our products have about 4 PDFs a piece, which really inflates our indexed pages. I was wondering if I could add REL=No Index to the PDF's URL? All of the files are on a file server, so they are embedded with links on our product pages. I know I could add a No Follow attribute, but I was wondering if any one knew if the No Index would work the same or if that is even possible. Thanks!

      1 Reply Last reply Reply Quote 0
      • DirkC
        DirkC last edited by

        If the pdf's are in a separate folder on your site - you could mark that folder as noindex in robots.txt

        As far as I know, it's not possible to add a noindex to a link.

        rgds

        Dirk

        1 Reply Last reply Reply Quote 1
        • OlegKorneitchouk
          OlegKorneitchouk last edited by

          1. If you want to deindex all PDF files, I recommend using the x-robots-tag in .htaccess - https://yoast.com/x-robots-tag-play/

          2. If the PDFs are pdf versions of existing pages, I would set canonicals to point to the URL you do want indexed (#2 on http://moz.com/blog/htaccess-file-snippets-for-seos )

          1 Reply Last reply Reply Quote 1
          • Alick300
            Alick300 last edited by

            Hi Monica,

            I presume you already check all the options before posting this question. I have concluded this by seeing your others posts/reply in this community. 🙂

            Now here is my answer

            To prevent your PDF file (or any non HTML file) from being listed in search results, the only way is to use the HTTP X-Robots-Tag response header, e.g.:

            X-Robots-Tag: noindex

            robots.txt does not prevent your page from being listed in search results.

            What it does is stop the bot from crawling your page, but if a third party links to your PDF file from their website, your page will still be listed.

            If you stop the bot from crawling your page using robots.txt, it will not have the chance to see the X-Robots-Tag: noindex response tag. Therefore, never ever ever disallow a page in robots.txt if you employ the X-Robots-Tag header.

            I hope it helps but not very sure. 🙂

            Thanks

            MonicaOConnor 1 Reply Last reply Reply Quote 1
            • MonicaOConnor
              MonicaOConnor @Alick300 last edited by

              The files aren't duplicate. I am familiar with using the XRobots tag. I was really just curious if my theory would work.

              Thanks for all your input.

              1 Reply Last reply Reply Quote 0
              • 1 / 1
              • First post
                Last post
              • URLs dropping from index (Crawled, currently not indexed)
                DmitriiK
                DmitriiK
                0
                2
                38

              • Anything new if determining how many of a sites pages are in Google's supplemental index vs the main index?
                SEMPassion
                SEMPassion
                0
                4
                390

              • Google Webmaster tools Sitemap submitted vs indexed vs Index Status
                K-WINTER
                K-WINTER
                0
                5
                20.5k

              • Removing a staging area/dev area thats been indexed via GWT (since wasnt hidden) from the index
                Kingof5
                Kingof5
                0
                5
                124

              • Best way to handle indexed pages you don't want indexed
                NakulGoyal
                NakulGoyal
                0
                11
                786

              • Duplicate content issue index.html vs non index.html
                KaneJamison
                KaneJamison
                0
                10
                1.7k

              • Ensuring Assets (PDFs, PowerPoint Files, Word Docs, etc.) are Indexable on Site
                ChrisDyson
                ChrisDyson
                0
                4
                464

              • Https indexed - though a no index no follow tag has been added
                Theo-NL
                Theo-NL
                0
                3
                1.3k

              Get started with Moz Pro!

              Unlock the power of advanced SEO tools and data-driven insights.

              Start my free trial
              Products
              • Moz Pro
              • Moz Local
              • Moz API
              • Moz Data
              • STAT
              • Product Updates
              Moz Solutions
              • SMB Solutions
              • Agency Solutions
              • Enterprise Solutions
              • Digital Marketers
              Free SEO Tools
              • Domain Authority Checker
              • Link Explorer
              • Keyword Explorer
              • Competitive Research
              • Brand Authority Checker
              • Local Citation Checker
              • MozBar Extension
              • MozCast
              Resources
              • Blog
              • SEO Learning Center
              • Help Hub
              • Beginner's Guide to SEO
              • How-to Guides
              • Moz Academy
              • API Docs
              About Moz
              • About
              • Team
              • Careers
              • Contact
              Why Moz
              • Case Studies
              • Testimonials
              Get Involved
              • Become an Affiliate
              • MozCon
              • Webinars
              • Practical Marketer Series
              • MozPod
              Connect with us

              Contact the Help team

              Join our newsletter
              Moz logo
              © 2021 - 2026 SEOMoz, Inc., a Ziff Davis company. All rights reserved. Moz is a registered trademark of SEOMoz, Inc.
              • Accessibility
              • Terms of Use
              • Privacy