The Moz Q&A Forum

    • Forum
    • Questions
    • My Q&A
    • Users
    • Ask the Community

    Welcome to the Q&A Forum

    Browse the forum for helpful insights and fresh discussions about all things SEO.

    1. SEO and Digital Marketing Q&A Forum
    2. Categories
    3. Intermediate & Advanced SEO
    4. PDF for link building - avoiding duplicate content

    PDF for link building - avoiding duplicate content

    Intermediate & Advanced SEO
    14 4 1.7k
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as question
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • DoRM
      DoRM last edited by

      Indicate the canonical version of a URL by responding with the Link rel="canonical" HTTP header. Addingrel="canonical" to the head section of a page is useful for HTML content, but it can't be used for PDFs and other file types indexed by Google Web Search. In these cases you can indicate a canonical URL by responding with the Link rel="canonical" HTTP header, like this (note that to use this option, you'll need to be able to configure your server):

      Link: <http: www.example.com="" downloads="" white-paper.pdf="">; rel="canonical"</http:> 
      

      Google currently supports these link header elements for Web Search only.

      You can read more her http://support.google.com/webmasters/bin/answer.py?hl=en&answer=139394

      BobGW 1 Reply Last reply Reply Quote 1
      • BobGW
        BobGW @DoRM last edited by

        Thank you DoRM,

        I assume that the PDF is what I want to be the main version since that is what I'll be marketing, but I could be wrong? What if I get backlinks to both pages, will both sets of backlinks count?

        DoRM BobGW ThompsonPaul 6 Replies Last reply Reply Quote 0
        • DoRM
          DoRM @BobGW last edited by

          PDF seem to not rank as well as other normal webpages.  They still rank do not get me wrong, we have over 100 pdf pages that get traffic for us. The main version is really up to you, what do you want to show in the search results.  I think it would be easier to rank for a normal webpage though.  If you are doing a rel="canonical"  it will pass most of the link juice, not all but most.

          1 Reply Last reply Reply Quote 1
          • DoRM
            DoRM @BobGW last edited by

            PDF seem to not rank as well as other normal webpages.  They still rank do not get me wrong, we have over 100 pdf pages that get traffic for us. The main version is really up to you, what do you want to show in the search results.  I think it would be easier to rank for a normal webpage though.  If you are doing a rel="canonical"  it will pass most of the link juice, not all but most.

            1 Reply Last reply Reply Quote 0
            • BobGW
              BobGW @BobGW last edited by

              How do I know if I can do an HTTP header request? I'm using shared hosting through hostgator.

              1 Reply Last reply Reply Quote 0
              • BobGW
                BobGW @BobGW last edited by

                Can I just use htaccess?

                See here: http://www.seomoz.org/blog/how-to-advanced-relcanonical-http-headers

                We only have one pdf like this right now and we plan to have no more than five.

                Say the pdf is located at

                domain.com/pdfs/white-papers.pdf

                and the article that I want to rank is at

                domain.com/articles/article.pdf

                do I simply add this to my htaccess file?:

                Header add Link "<http: www.domain.com="" articles="" article.pdf="">; rel="canonical""</http:>

                1 Reply Last reply Reply Quote 0
                • ThompsonPaul
                  ThompsonPaul @BobGW last edited by

                  You can insert the canonical header link using your site's .htaccess file, Bob. I'm sure Hostgator provides access to the htaccess file through ftp (sometimes you have to turn on "show hidden files") or through the file manager built into your cPanel.

                  Check tip #2 in this recent SEOMoz blog article for specifics:
                  seomoz.org/blog/htaccess-file-snippets-for-seos

                  Just remember too - you will want to do the same kind of on-page optimization for the PDF as you do for regular pages.

                  • Give it a good, descriptive, keyword-appropriate, dash-separated file name. (essential for usability as well, since it will become the title of the icon when saved to someone's desktop)
                  • Fill out the metadata for the PDF, especially the Title and Description. In Acrobat it's under File -> Properties -> Description tab (to get the meta-description itself, you'll need to click on the Additional Metadata button)

                  I'd be tempted to build the links to the html page as much as possible as those will directly help ranking, unlike the PDF's inbound links which will have to pass their link juice through the canonical, assuming you're using it. Plus, the visitor will get a preview of the PDF's content and context from the rest of your site which which may increase trust and engender further engagement..

                  Your comment about links in the PDF got kind of muddled, but you'll definitely want to make certain there are good links and calls to action back to your website within the PDF - preferably on each page. Otherwise there's no clear "next step" for users reading the PDF back to a purchase on your site. Make sure to put Analytics tracking tags on these links so you can assess the value of traffic generated back from the PDF - otherwise the traffic will just appear as Direct in your Analytics.

                  Hope that all helps;

                  Paul

                  1 Reply Last reply Reply Quote 2
                  • BobGW
                    BobGW @BobGW last edited by

                    Thanks ThompsonPaul,

                    Say the pdf is located at

                    domain.com/pdfs/white-papers.pdf

                    and the article that I want to rank is at

                    domain.com/articles/article.html

                    do I simply add this to my htaccess file?:

                    Header add Link "<http: www.domain.com="" articles="" article.html="">; rel="canonical""</http:>

                    1 Reply Last reply Reply Quote 0
                    • Marcus_Miller
                      Marcus_Miller last edited by

                      Hey Bob

                      There is a much easier way to do this and simply have your PDFs that you don't want indexed in a folder that you block access to in robots.txt. This way you can just drop PDFs into articles and link to them knowing full well these pages will not be indexed.

                      Assuming you had a PDF called article.pdf in a folder called pdfs/ then the following would prevent indexation.

                      User-agent: * Disallow: /pdfs/

                      Or to just block the file itself:

                      User-agent: *
                      Disallow: /pdfs/yourfile.pdf Additionally, There is no reason not to add the canonical link as well and if you find people are linking directly to the PDF then having this would ensure that the equity associated with those links was correctly attributed to the parent page (always a good thing).

                      Header add Link '<http: www.url.co.uk="" pdfs="" article.html="">; </http:> rel="canonical"'

                      Generally, there are better ways to block indexation than with robots.txt but in the case of PDFs, we really don't want these files indexed as they make for such poor landing pages (no navigation) and we certainly want to remove any competition or duplication between the page and the PDF so in this case, it makes for a quick, painless and suitable solution.

                      Hope that helps!
                      Marcus

                      BobGW 1 Reply Last reply Reply Quote 2
                      • BobGW
                        BobGW @Marcus_Miller last edited by

                        Thanks for the code Marcus.

                        Actually, the pdf is what people will be linking to. It's a guide for websites. I think the PDF will be much easier to promote than the article.I assume so anyway.

                        Is there a way to make sure my canonical code in htaccess is working after I insert the code?

                        Thanks again,

                        Bob

                        Marcus_Miller BobGW 3 Replies Last reply Reply Quote 0
                        • Marcus_Miller
                          Marcus_Miller @BobGW last edited by

                          Yep, check the HTTP headers with webbug or there are a bunch of browser plugins that will let you see the headers for the document.

                          That said, I would push to drive the links to the page though rather than the document itself and just create a nice page that houses the document and make that the link target.

                          You could even make the PDF link only available by email once they have singed up or some such as canonical is only a directive and you would still be better getting those links flooding into a real page on the site.

                          You could even offer up some HTML to make this easier for folks to link to that linked to your main page. If you take a look at any savvy infographics etc folks will try to draw a link into a page rather than the image itself for the very same reasons.

                          If you look at something like the Noobs Guide to Online Marketing from Unbounce then you will see something like this as the suggested linking code:

                          [](<strong>http://unbounce.com/noob-guide-to-online-marketing-infographic/</strong>)

                          [The Noob Guide to Online Marketing - Infographic](<strong>http://unbounce.com/noob-guide-to-online-marketing-infographic/</strong>)

                          [](<strong>http://unbounce.com/noob-guide-to-online-marketing-infographic/</strong>)

                          Unbounce – The DIY Landing Page Platform

                          So, the image is there but the link they are pimping is a standard page:

                          http://unbounce.com/noob-guide-to-online-marketing-infographic/

                          They also cheekily add an extra homepage link in as well with some keywords and the brand so if folks don't remove that they still get that benefit.

                          Ultimately, it means that when links flood into the site they benefit the whole site rather than just promote one PDF.

                          Just my tuppence! 
                          Marcus

                          1 Reply Last reply Reply Quote 0
                          • BobGW
                            BobGW @BobGW last edited by

                            I see, thanks! I think it's important not to have the ecommerce navigation on the page promoting the pdf. What would you say is ideal as far as the graphical and navigation components of the page with the PDF on it - what kind of navigation and graphical header should I have on it?

                            1 Reply Last reply Reply Quote 0
                            • Marcus_Miller
                              Marcus_Miller @BobGW last edited by

                              Hey Bob

                              I think you should forget about any kind of perceived conventions and have whatever you think works best for your users and goals.

                              Again, look at unbounce, that is a custom landing page with a homepage link (to share the love) but not the general site navigation.

                              They also have a footer to do a bit more link love but really, do what works for you.

                              Forget conventions - do what works!

                              Hope that helps
                              Marcus

                              1 Reply Last reply Reply Quote 0
                              • 1 / 1
                              • First post
                                Last post
                              • How to solve this issue and avoid duplicated content?
                                TimHolmes
                                TimHolmes
                                0
                                3
                                57

                              • What is future of Link building ? Any link building experts Here ?
                                w1t
                                w1t
                                0
                                5
                                131

                              • Duplicate content within sections of a page but not full page duplicate content
                                J_Sinclair
                                J_Sinclair
                                0
                                3
                                112

                              • PDF for link building - avoiding duplicate content
                                DoRM
                                DoRM
                                0
                                2
                                188

                              • Stellar Content - Calls to Action - Link Building
                                EGOL
                                EGOL
                                0
                                2
                                303

                              • Blog content - what to do, and what to avoid in terms of links, when you're paying for blog content
                                McTaggart
                                McTaggart
                                0
                                5
                                350

                              • Link Acquisition - link building
                                musicforkids
                                musicforkids
                                0
                                3
                                1.1k

                              • Can PDF be seen as duplicate content? If so, how to prevent it?
                                ilonka65
                                ilonka65
                                1
                                20
                                10.0k

                              Get started with Moz Pro!

                              Unlock the power of advanced SEO tools and data-driven insights.

                              Start my free trial
                              Products
                              • Moz Pro
                              • Moz Local
                              • Moz API
                              • Moz Data
                              • STAT
                              • Product Updates
                              Moz Solutions
                              • SMB Solutions
                              • Agency Solutions
                              • Enterprise Solutions
                              • Digital Marketers
                              Free SEO Tools
                              • Domain Authority Checker
                              • Link Explorer
                              • Keyword Explorer
                              • Competitive Research
                              • Brand Authority Checker
                              • Local Citation Checker
                              • MozBar Extension
                              • MozCast
                              Resources
                              • Blog
                              • SEO Learning Center
                              • Help Hub
                              • Beginner's Guide to SEO
                              • How-to Guides
                              • Moz Academy
                              • API Docs
                              About Moz
                              • About
                              • Team
                              • Careers
                              • Contact
                              Why Moz
                              • Case Studies
                              • Testimonials
                              Get Involved
                              • Become an Affiliate
                              • MozCon
                              • Webinars
                              • Practical Marketer Series
                              • MozPod
                              Connect with us

                              Contact the Help team

                              Join our newsletter
                              Moz logo
                              © 2021 - 2026 SEOMoz, Inc., a Ziff Davis company. All rights reserved. Moz is a registered trademark of SEOMoz, Inc.
                              • Accessibility
                              • Terms of Use
                              • Privacy