The Moz Q&A Forum

    • Forum
    • Questions
    • My Q&A
    • Users
    • Ask the Community

    Welcome to the Q&A Forum

    Browse the forum for helpful insights and fresh discussions about all things SEO.

    1. SEO and Digital Marketing Q&A Forum
    2. Categories
    3. Technical SEO Issues
    4. Using the Google Remove URL Tool to remove https pages

    Using the Google Remove URL Tool to remove https pages

    Technical SEO Issues
    3 2 5.9k
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as question
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • sparrowdog
      sparrowdog last edited by

      I have found a way to get a list of 'some' of my 180,000+ garbage URLs now, and I'm going through the tedious task of using the URL removal tool to put them in one at a time. Between that and my robots.txt file and the URL Parameters, I'm hoping to see some change each week.

      I have noticed when I put URL's starting with https:// in to the removal tool, it adds the http:// main URL at the front.

      For example, I add to the removal tool:-

      https://www.mydomain.com/blah.html?search_garbage_url_addition

      On the confirmation page, the URL actually shows as:-

      http://www.mydomain.com/https://www.mydomain.com/blah.html?search_garbage_url_addition

      I don't want to accidentally remove my main URL or cause problems. Is this the right way this should look?

      AND PART 2 OF MY QUESTION

      If you see the search description in Google for a page you want removed that says the following in the SERP results, should I still go to the trouble of putting in the removal request?

      www.domain.com/url.html?xsearch_...

      A description for this result is not available because of this site's robots.txt – learn more.

      1 Reply Last reply Reply Quote 1
      • TomRayner
        TomRayner last edited by

        Hi there

        I'll start with question 2 first as it's a bit easier to answer.  Robots.txt blocks the crawling of a page, but not necessarily indexing.  Of course, if the page cannot be crawled it will be deindexed eventually anyway, but if you're getting that description for one of your URLs, Google has not been able to access it and will stop trying to.  So that is usually enough, although if you want to remove it as well, you can by all means.

        For question 1 - GWT is a bit awkward in the sense that it treats http and https versions of your site as different webmaster properties.  Furthermore, if you want to remove a URL on your site, it will always prefix it with the http/https version of your site, no matter how you enter it.

        If you added another WMT property that was https://www.yourdomain.com - you would be able to manage that domain as well and thus you would be able to remove any URLs under that prefix.

        Incidentally, if you want to block all HTTPS pages from being accessed, you can do that with a special instruction in your htaccess file and robots txt.  You can instruct the Googlebot and other bots to read a specific robots.txt file if they visit an HTTPS URL.  To do that, you would first add this to your htaccess file:

        RewriteCond %{HTTPS} ^on$
        RewriteCond %{REQUEST_URI} ^/robots.txt$
        RewriteRule ^(.*)$ /robots_ssl.txt [L]

        This command basically says "if the URL has https, read the robots_ssl.txt file".  You then upload a file called robots_ssl.txt to your root domain.  In the txt file you just add:

        User-agent: *
        Disallow: /

        So now, if a bot reaches an https URL, it has to read the robots_ssl.txt file and upon reading that, they are denied access.  That would prevent all of your https URLs from being indexed.

        That might be useful to you, but if you go ahead and use it please take care to backup all your files in case anything goes wrong - your htaccess file is very important!

        sparrowdog 1 Reply Last reply Reply Quote 1
        • sparrowdog
          sparrowdog @TomRayner last edited by

          Thanks so much for taking the time to respond.

          I think I will add the https to WMT and remove them that way.

          I will take a look through the .htaccess file and the creation of the ssl robots file. A while back, it seemed that Google was indexing a lot of my site as https and then the dropped it and went mainly back to http. I will get that sorted to make it clear.

          1 Reply Last reply Reply Quote 0
          • 1 / 1
          • First post
            Last post
          • Over 40+ pages have been removed from the indexed and this page has been selected as the google preferred canonical.
            willcritchlow
            willcritchlow
            0
            4
            69

          • After you remove a 301 redirect that Google has processed, will the new URL retain any of the link equity from the old URL?
            johnwalkersmith
            johnwalkersmith
            0
            3
            61

          • Use existing page with bad URL or brand new URL?
            XLMarketing
            XLMarketing
            0
            5
            47

          • Google Appending Blog URL inbetween my homepage and product page is it issue with base url?
            amu123
            amu123
            0
            11
            68

          • Why google does not remove my page?
            Joe_Stoffel
            Joe_Stoffel
            0
            3
            61

          • Should you use google url remover if older indexed pages are still being kept?
            Deacyde
            Deacyde
            0
            5
            96

          • Google webmaster tool doestn allow me to send 'URL and all linked pages"
            RobertFisher
            RobertFisher
            0
            2
            219

          • Our Development team is planning to make our website nearly 100% AJAX and JavaScript. My concern is crawlability or lack thereof. Their contention is that Google can read the pages using the new #! URL string. What do you recommend?
            john4math
            john4math
            0
            2
            651

          Get started with Moz Pro!

          Unlock the power of advanced SEO tools and data-driven insights.

          Start my free trial
          Products
          • Moz Pro
          • Moz Local
          • Moz API
          • Moz Data
          • STAT
          • Product Updates
          Moz Solutions
          • SMB Solutions
          • Agency Solutions
          • Enterprise Solutions
          • Digital Marketers
          Free SEO Tools
          • Domain Authority Checker
          • Link Explorer
          • Keyword Explorer
          • Competitive Research
          • Brand Authority Checker
          • Local Citation Checker
          • MozBar Extension
          • MozCast
          Resources
          • Blog
          • SEO Learning Center
          • Help Hub
          • Beginner's Guide to SEO
          • How-to Guides
          • Moz Academy
          • API Docs
          About Moz
          • About
          • Team
          • Careers
          • Contact
          Why Moz
          • Case Studies
          • Testimonials
          Get Involved
          • Become an Affiliate
          • MozCon
          • Webinars
          • Practical Marketer Series
          • MozPod
          Connect with us

          Contact the Help team

          Join our newsletter
          Moz logo
          © 2021 - 2026 SEOMoz, Inc., a Ziff Davis company. All rights reserved. Moz is a registered trademark of SEOMoz, Inc.
          • Accessibility
          • Terms of Use
          • Privacy