The Moz Q&A Forum

    • Forum
    • Questions
    • My Q&A
    • Users
    • Ask the Community

    Welcome to the Q&A Forum

    Browse the forum for helpful insights and fresh discussions about all things SEO.

    1. SEO and Digital Marketing Q&A Forum
    2. Categories
    3. Search Engine Trends
    4. Stop google indexing CDN pages

    Stop google indexing CDN pages

    Search Engine Trends
    3 2 2.7k
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as question
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • loopyal
      loopyal last edited by

      Just when I thought I'd seen it all, google hits me with another nasty surprise!

      I have a CDN to deliver images, js and css  to visitors around the world. I have no links to static HTML pages on the site, as far as I can tell, but someone else may have - perhaps a scraper site?

      Google has decided the static pages they were able to access through the CDN have more value than my real pages, and they seem to be slowly replacing my pages in the index with the static pages.

      Anyone got an idea on how to stop that?

      Obviously, I have no access to the static area, because it is in the CDN, so there is no way I know of that I can have a robots file there.

      It could be that I have to trash the CDN and change it to only allow the image directory, and maybe set up a separate CDN subdomain for content that only contains the JS and CSS?

      Have you seen this problem and beat it?

      (Of course the next thing is Roger might look at google results and start crawling them too, LOL)

      P.S. The reason I am not asking this question in the google forums is that others have asked this question many times and nobody at google has bothered to answer, over the past 5 months, and nobody who did try, gave an answer that was remotely useful. So I'm not really hopeful of anyone here having a solution either, but I expect this is my best bet because you guys are always willing to try.

      1 Reply Last reply Reply Quote 0
      • edwardlewis
        edwardlewis last edited by

        It sounds like you have set up your CDN slightly wrong.

        After setting up a few like you have I realised that I was actually making a complete duplicate of the site rather than just the images or assets

        I imagine you have your origin directory for the CDN in the public html folder.

        Create a subdomain, set that as the origin.

        Eg.. I'm working on this site at the moment: http://looksfishy.co.uk/

        I have a subdomain called assets: http://assets.looksfishy.co.uk/

        The cdn content: http://cdn.looksfishy.co.uk/

        Files uploaded here:

        http://assets.looksfishy.co.uk/species/holder/pike.jpg

        Displayed here:

        http://cdn.looksfishy.co.uk/species/holder/pike.jpg

        Check the ip address on them.

        It does make uploading images by ftp a bit of a faff, but does make your site better

        1 Reply Last reply Reply Quote 2
        • loopyal
          loopyal last edited by

          Thank you Edward.

          I don't have quite that problem, but I think you are right too.

          My CDN is set up to be Origin Pull.

          That means there is no need to FTP - the system just fetches content as requested.

          • you should check that out if you have to ftp everything.

          But what you said that helped me is this - that I should have had one CNAME for images and anotehr CNAME for content and the content should be limited to a folder called content, so I can put the CSS files and the JS files in it and that way, the plain HTML pages at teh root level will never be affected.

          I also realized, while checking the system, that I wasn't using a canonical tag in the intermediate pages, as I was in the story pages. So I just added code to add canonical tags for all the intermediate pages and the front page.

          I do have a few other types of pages, so I will handle the code for them next.

          I think adding the canonical tag might fix the problem, but I will also work on reconfiguring the CDN and change over when the action is not too busy, in case it takes a while to propagate.

          1 Reply Last reply Reply Quote 0
          • 1 / 1
          • First post
            Last post
          • Google Search Console Not Indexing Pages
            0
            1
            73

          • Non-indexed or indexed top hierarchy pages get high PageRank at Google?
            effectdigital
            effectdigital
            0
            2
            35

          • How long for google to de-index old pages on my site?
            rubennunez
            rubennunez
            0
            7
            8.1k

          • Google indexing site content that I did not wish to be indexed
            David-E-Carey
            David-E-Carey
            0
            6
            144

          • Is it stil a rule that Google will only index pages up to three tiers deep? Or has this changed?
            seoessentials
            seoessentials
            0
            5
            93

          • Google indexing my website's Search Results pages. Should I block this?
            irvingw
            irvingw
            0
            4
            4.9k

          • Has Google problems in indexing pages that use <base href=""> the last days?
            0
            1
            1.4k

          • Does Google index Wordpress pages with frames
            BradBorst
            BradBorst
            0
            3
            922

          Get started with Moz Pro!

          Unlock the power of advanced SEO tools and data-driven insights.

          Start my free trial
          Products
          • Moz Pro
          • Moz Local
          • Moz API
          • Moz Data
          • STAT
          • Product Updates
          Moz Solutions
          • SMB Solutions
          • Agency Solutions
          • Enterprise Solutions
          • Digital Marketers
          Free SEO Tools
          • Domain Authority Checker
          • Link Explorer
          • Keyword Explorer
          • Competitive Research
          • Brand Authority Checker
          • Local Citation Checker
          • MozBar Extension
          • MozCast
          Resources
          • Blog
          • SEO Learning Center
          • Help Hub
          • Beginner's Guide to SEO
          • How-to Guides
          • Moz Academy
          • API Docs
          About Moz
          • About
          • Team
          • Careers
          • Contact
          Why Moz
          • Case Studies
          • Testimonials
          Get Involved
          • Become an Affiliate
          • MozCon
          • Webinars
          • Practical Marketer Series
          • MozPod
          Connect with us

          Contact the Help team

          Join our newsletter
          Moz logo
          © 2021 - 2026 SEOMoz, Inc., a Ziff Davis company. All rights reserved. Moz is a registered trademark of SEOMoz, Inc.
          • Accessibility
          • Terms of Use
          • Privacy