The Moz Q&A Forum

    • Forum
    • Questions
    • My Q&A
    • Users
    • Ask the Community

    Welcome to the Q&A Forum

    Browse the forum for helpful insights and fresh discussions about all things SEO.

    1. SEO and Digital Marketing Q&A Forum
    2. Categories
    3. Intermediate & Advanced SEO
    4. Google Indexing Of Pages As HTTPS vs HTTP

    Google Indexing Of Pages As HTTPS vs HTTP

    Intermediate & Advanced SEO
    13 4 1.4k
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as question
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • GastonRiera
      GastonRiera last edited by

      Hello vikasnwu,

      As what you are looking for is to remove from index the pages, follow this steps:

      1. Allow the whole website to be crawable in the robots.txt
      2. add the robots meta tag with "noindex,follow" parametres
      3. wait several weeks, 6 to 8 weeks is a fairly good time. Or just do a followup on those pages
      4. when you got the results (all your desired pages to be de-indexed) re-block with robots.txt those pages
      5. DO NOT erase the meta robots tag.

      Remember that http://site.com andhttps://site.com are different websites to google.
      When your client's website is fixed with https, follow these steps:

      1. Allow the whole website (or the parts wanted to be indexed) to be crawable in robots.txt
      2. Remove the robots meta tag
      3. Redirect 301 http to https
      4. Sit and wait.

      Information about the redirection to HTTPS and a cool checklist:
      The Big List of SEO Tips and Tricks for Using HTTPS on Your Website - Moz Blog
      The HTTP to HTTPs Migration Checklist in Google Docs to Share, Copy & Download - AleydaSolis
      Google SEO HTTPS Migration Checklist - SERoundtable

      Hope I'm helpful.
      Best luck.
      GR.

      vikasnwu ThompsonPaul AgenciaSEO.eu 3 Replies Last reply Reply Quote 0
      • vikasnwu
        vikasnwu @GastonRiera last edited by

        GR - thanks for the response.

        Given our site is just 65 pages, would it make sense to just put all of the site's "https" URLs in the robots.txt file as "noindex" now rather than waiting for all the pages to get indexed as "https" and then remove them?

        And then upload a sitemap to webmaster tools with the URLS as "http://"?

        VW

        GastonRiera 1 Reply Last reply Reply Quote 0
        • GastonRiera
          GastonRiera @vikasnwu last edited by

          Noup, Robots.txt works on a website level. This means that there has to be a file for the http and another for the https website.
          And, there is no need for waiting until the whole site is indexed.

          Just to clarify, robots.txt itself does not remove pages already indexed. It just blocks bots from crawling a website and/or specific pages with in it.

          1 Reply Last reply Reply Quote 0
          • ThompsonPaul
            ThompsonPaul last edited by

            There's a very simple solution to this issue - and no, you absolutely do NOT want to artificially force removal of those HTTPS pages from the index.

            You need to make sure the SSL certificate is still in place, then re-add the 301-redirect in the site's htaccess file, but this time redirecting all HTTPS URLs back their HTTP equivalents.

            You don't want to forcibly "remove" those URLs from the SERPs, because they are what Google now understands to be the correct pages. If you remove them, you'll have to wait however long it takes for Google and other search engines to completely re-understand the conflicting signals you've sent them about your site. And traffic will inevitably suffer in that process. Instead, you need to provide standard directives that the search engines don't have to interpret and can't ignore. Once the search engines have seen the new redirects for long enough, they'll start reverting the SERP listings back to the HTTP URLs naturally.

            The key here is the SSL cert must stay in place. As it stands now, a visitor clicking a page in the search engine is trying to make an HTTPS connection to your site. If there is no certificate in place, they will get the harmful security warning. BUT! You can't just put in a 301-redirect in that case. The reason for this is that the initial connection from the SERP is coming in over the "secure channel". That connection must be negotiated securely first, before the redirect can even be read. If that first connection isn't secure, the browser will return the security warning without ever trying to read the redirect.

            Having the SSL cert in place even though you're not running all pages under HTTPS means that first connection can still be made securely, then the redirect can be read back to the HTTP URL, and the visitor will get to the page they expect in a seamless manner. And search engines will be able to understand and apply authority without misunderstandings/confusion.

            Hope that all makes sense?

            Paul

            1 Reply Last reply Reply Quote 3
            • ThompsonPaul
              ThompsonPaul @GastonRiera last edited by

              You can't noindex a URL by protocol, Gaston - adding no-index would eliminate the page from being returned as a search result regardless of whether HTTP or HTTPS, essentially making those important pages invisible and wasting whatever link equity they may have. (You also can't block in robots.txt by protocol either, in my experience.)

              GastonRiera 1 Reply Last reply Reply Quote 0
              • vikasnwu
                vikasnwu last edited by

                Paul that does make sense - i'll add the SSL certificate back, and then redirect from https to http via the htaccess file.

                ThompsonPaul 1 Reply Last reply Reply Quote 0
                • ThompsonPaul
                  ThompsonPaul @vikasnwu last edited by

                  Great! I'd really like to hear how it goes when you get the switch back in.

                  P.

                  vikasnwu 1 Reply Last reply Reply Quote 0
                  • vikasnwu
                    vikasnwu @ThompsonPaul last edited by

                    i'm first going to see what happens if I just upload a sitemap with http URLs since there wasn't a sitemap in webmaster tools from before.  Will give you the update then.

                    AgenciaSEO.eu ThompsonPaul 2 Replies Last reply Reply Quote 0
                    • GastonRiera
                      GastonRiera @ThompsonPaul last edited by

                      Paul, 
                      I just provided the solution to de-index the https version. I understood that what's wanted, as they need their client to fix their end.

                      And of course that there is no way to noindex by protocol. I do agree what you are saying.

                      Thanks a lot for explaining further and prividing other ways to help solvinf the issue, im inspired by used like you to help others and make a great community.

                      GR.

                      1 Reply Last reply Reply Quote 0
                      • AgenciaSEO.eu
                        AgenciaSEO.eu @GastonRiera last edited by

                        Nice answer!

                        But you forgot to mention:

                        1. Updating the sitemap files with the good URLs
                        2. Upload them to Google Search Console
                        3. You can even force the indexing at Google Search Console

                        Thanks,

                        Roberto

                        1 Reply Last reply Reply Quote 0
                        • AgenciaSEO.eu
                          AgenciaSEO.eu @vikasnwu last edited by

                          Remember you can force indexing using Google Search Console 😉

                          1 Reply Last reply Reply Quote 0
                          • ThompsonPaul
                            ThompsonPaul @vikasnwu last edited by

                            That's not going to solve your problem, vikasnwu. Your immediate issue is that you have URLs in the index that are HTTPS and will cause searchers who click on them not to reach your site due to the security error warnings. The only way to fix that quickly is to get the SSL certificate and redirect to HTTP in place.

                            You've sent the search engines a number of very conflicting signals. Waiting while they try to work out what URLs they're supposed to use and then waiting while they reindex them is likely to cause significant traffic issues and ongoing ranking harm before the SEs figure it out for themselves. The whole point of what I recommended is it doesn't depend on the SEs figuring anything out - you will have provided directives that force them to do what you need.

                            Paul

                            1 Reply Last reply Reply Quote 1
                            • 1 / 1
                            • First post
                              Last post
                            • How will canonicalizing an https page affect the SERP-ranked http version of that page?
                              BlueprintMarketing
                              BlueprintMarketing
                              0
                              4
                              85

                            • HTTP Pages Indexed as HTTPS
                              friendoffood
                              friendoffood
                              0
                              5
                              95

                            • What is the tool to check if a page (ex. a dynamic page) will properly be indexed by Google?
                              ThompsonPaul
                              ThompsonPaul
                              0
                              4
                              78

                            • Does Google still don't index Hashtag Links ? No chance to get a Search Result that leads directly to a section of a page? or to one of numeras Hashtag Pages in a single HTML page?
                              Muhammad_Jabali
                              Muhammad_Jabali
                              0
                              3
                              748

                            • Incorrect cached page indexing in Google while correct page indexes intermittently
                              MikeRoberts
                              MikeRoberts
                              0
                              2
                              298

                            • I have removed over 2000+ pages but Google still says i have 3000+ pages indexed
                              CleverPhD
                              CleverPhD
                              0
                              6
                              148

                            • Why is Google displaying inside pages for our sites rather than the index pages?
                              aloley
                              aloley
                              0
                              7
                              430

                            • Push for site-wide https, but all pages in index are http. Should I fight the tide?
                              G-Force
                              G-Force
                              0
                              3
                              1.2k

                            Get started with Moz Pro!

                            Unlock the power of advanced SEO tools and data-driven insights.

                            Start my free trial
                            Products
                            • Moz Pro
                            • Moz Local
                            • Moz API
                            • Moz Data
                            • STAT
                            • Product Updates
                            Moz Solutions
                            • SMB Solutions
                            • Agency Solutions
                            • Enterprise Solutions
                            • Digital Marketers
                            Free SEO Tools
                            • Domain Authority Checker
                            • Link Explorer
                            • Keyword Explorer
                            • Competitive Research
                            • Brand Authority Checker
                            • Local Citation Checker
                            • MozBar Extension
                            • MozCast
                            Resources
                            • Blog
                            • SEO Learning Center
                            • Help Hub
                            • Beginner's Guide to SEO
                            • How-to Guides
                            • Moz Academy
                            • API Docs
                            About Moz
                            • About
                            • Team
                            • Careers
                            • Contact
                            Why Moz
                            • Case Studies
                            • Testimonials
                            Get Involved
                            • Become an Affiliate
                            • MozCon
                            • Webinars
                            • Practical Marketer Series
                            • MozPod
                            Connect with us

                            Contact the Help team

                            Join our newsletter
                            Moz logo
                            © 2021 - 2026 SEOMoz, Inc., a Ziff Davis company. All rights reserved. Moz is a registered trademark of SEOMoz, Inc.
                            • Accessibility
                            • Terms of Use
                            • Privacy