The Moz Q&A Forum

    • Forum
    • Questions
    • My Q&A
    • Users
    • Ask the Community

    Welcome to the Q&A Forum

    Browse the forum for helpful insights and fresh discussions about all things SEO.

    1. SEO and Digital Marketing Q&A Forum
    2. Categories
    3. Technical SEO Issues
    4. Dynamically-generated .PDF files, instead of normal pages, indexed by and ranking in Google

    Dynamically-generated .PDF files, instead of normal pages, indexed by and ranking in Google

    Technical SEO Issues
    3 2 1.0k
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as question
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • fugu
      fugu last edited by

      Hi,

      I come across a tough problem. I am working on an online-store website which contains the functionlaity of viewing products details in .PDF format (by the way, the website is built on Joomla CMS), now when I search my site's name in Google, the SERP simply displays my .PDF files in the first couple positions (shown in normal .PDF files format: [PDF]...)and I cannot find the normal pages there on SERP #1 unless I search the full site domain in Google. I really don't want this! Would you please tell me how to figure the problem out and solve it. I can actually remove the corresponding component (Virtuemart) that are in charge of generating the .PDF files. Now I am trying to redirect all the .PDF pages ranking in Google to a 404 page and remove the functionality, I plan to regenerate a sitemap of my site and submit it to Google, will it be working for me? I really appreciate that if you could help solve this problem. Thanks very much.

      Sincerely

      SEOmoz Pro Member

      1 Reply Last reply Reply Quote 0
      • TheEspresseo
        TheEspresseo last edited by

        I would consider either excluding the PDFs from the index with your robots.txt in conjunction with resubmitting your sitemap (which you're all over), or placing a text link at the bottom of each PDF pointing back to the HTML version of that page (which, all things being equal, should cause the HTML version of the page to rank instead). I am not sure about serving 404 headers to Google instead of the PDFs that are currently in the index. Why not 301 to the HTML version of each PDF? Obviously that can't be a permanent solution, as you will eventually want to restore the functionality to users, right? But it will tell Googlebot that the content of each PDF is to be found from here on out at the URL containing the HTML version. This is a case where it would be handy to serve one thing to the bots and another to the human viewers, but I am afraid that doing so could get you into trouble.

        I am interested in your case though—let us know what, if anything besides the 404s and sitemap resubmittal, you end up trying and what happens with it. I'm also curious to know what other mozzers suggest.

        1 Reply Last reply Reply Quote 0
        • TheEspresseo
          TheEspresseo last edited by

          Recently discovered this:

          Indicate the canonical version of a URL by responding with the Link rel="canonical" HTTP header. Addingrel="canonical" to the head section of a page is useful for HTML content, but it can't be used for PDFs and other file types indexed by Google Web Search. In these cases you can indicate a canonical URL by responding with the Link rel="canonical" HTTP header, like this (note that to use this option, you'll need to be able to configure your server).

          Link: <http: www.example.com="" downloads="" white-paper.pdf="">; rel="canonical"</http:>

          Google currently supports these link header elements for Web Search only.

          -http://support.google.com/webmasters/bin/answer.py?hl=en&answer=139394

          1 Reply Last reply Reply Quote 0
          • 1 / 1
          • First post
            Last post
          • Over 40+ pages have been removed from the indexed and this page has been selected as the google preferred canonical.
            willcritchlow
            willcritchlow
            0
            4
            69

          • Home Page Ranking Instead of Service Pages
            ChrisAshton
            ChrisAshton
            0
            7
            1.0k

          • Redesigned and Migrated Website - Lost Almost All Organic Traffic - Mobile Pages Indexing over Normal Pages
            DirkC
            DirkC
            0
            4
            245

          • Getting Google to index a large PDF file
            OlegKorneitchouk
            OlegKorneitchouk
            0
            2
            141

          • Why Google ranks a page with Meta Robots: NO INDEX, NO FOLLOW?
            BruceA
            BruceA
            0
            4
            651

          • Page disappeared from Google index. Google cache shows page is being redirected.
            shop.nordstrom
            shop.nordstrom
            0
            5
            761

          • Which carries more weight Google page rank or Alexa Rank?
            sherohass
            sherohass
            0
            3
            397

          • Pages not indexed by Google
            loopyal
            loopyal
            0
            6
            1.8k

          Get started with Moz Pro!

          Unlock the power of advanced SEO tools and data-driven insights.

          Start my free trial
          Products
          • Moz Pro
          • Moz Local
          • Moz API
          • Moz Data
          • STAT
          • Product Updates
          Moz Solutions
          • SMB Solutions
          • Agency Solutions
          • Enterprise Solutions
          • Digital Marketers
          Free SEO Tools
          • Domain Authority Checker
          • Link Explorer
          • Keyword Explorer
          • Competitive Research
          • Brand Authority Checker
          • Local Citation Checker
          • MozBar Extension
          • MozCast
          Resources
          • Blog
          • SEO Learning Center
          • Help Hub
          • Beginner's Guide to SEO
          • How-to Guides
          • Moz Academy
          • API Docs
          About Moz
          • About
          • Team
          • Careers
          • Contact
          Why Moz
          • Case Studies
          • Testimonials
          Get Involved
          • Become an Affiliate
          • MozCon
          • Webinars
          • Practical Marketer Series
          • MozPod
          Connect with us

          Contact the Help team

          Join our newsletter
          Moz logo
          © 2021 - 2026 SEOMoz, Inc., a Ziff Davis company. All rights reserved. Moz is a registered trademark of SEOMoz, Inc.
          • Accessibility
          • Terms of Use
          • Privacy