The Moz Q&A Forum

    • Forum
    • Questions
    • My Q&A
    • Users
    • Ask the Community

    Welcome to the Q&A Forum

    Browse the forum for helpful insights and fresh discussions about all things SEO.

    1. SEO and Digital Marketing Q&A Forum
    2. Categories
    3. Intermediate & Advanced SEO
    4. Lately I have noticed Google indexing many files on the site without the .html extension

    Lately I have noticed Google indexing many files on the site without the .html extension

    Intermediate & Advanced SEO
    4 2 106
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as question
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • gheh2013
      gheh2013 last edited by

      Hello,

      Our site, while we convert, remains in HTML 4.0.

      Fle names such as http://www.sample.com/samples/index.shtml are being picked up in the SERPS as http://www.sample.com/samples/ even when I use the "rel="canonical" tag and specify the full file name therein as recommended. The link to the truncated URL (http://www.sample.com/samples/) results in what MOZ shows as fewer incoming links than the full file name is shown as having incoming.

      I am not sure if this is causing a loss in placement (the MOZ stats are showing a decline of late), which I have seen recently (of course, I am aware of other possible reasons, such as not being in HTML5 yet).

      Any help with this would be great.

      Thank you in advance

      1 Reply Last reply Reply Quote 0
      • KristinaKledzik
        KristinaKledzik last edited by

        Hmm, that doesn't seem good. It's hard to say whether this is causing the decline in your rankings, but either way, you want to make sure that you're not splitting your link equity between your / and .shtml pages. Here's what I'd do:

        1. If you can, 301 redirect / pages to .shtml pages. Obviously, it'd be easier if the canonical worked, but it sounds like it doesn't.
        2. Use ScreamingFrog or DeepCrawl to look through internal pages on your site to see if you're ever linking to the / version of pages rather than the .shtml pages. When Google chooses a different version of a URL over the canonical one, it's often because that's how it sees internal links pointing to the page. Make sure that you only have links to the .shtml version of the page.
        3. Use a tool like Moz or Ahrefs to find all internal links to your site. For any links that you built or have a partnership with the owners, make sure that they're linking to the .shtml version of the page. I could especially see your ad partners using / because it's a cleaner before parameters than .shtml.

        After that, wait and see if Google fixes the problem.

        Also worth noting: have you thought about changing your default to /? That's more common today, so you're probably getting a lot of external links with / instead of .shtml, and you'll never be able to fix that problem. If that's a possible solution, you may want to explore it.

        Good luck!

        Kristina

        gheh2013 1 Reply Last reply Reply Quote 1
        • gheh2013
          gheh2013 @KristinaKledzik last edited by

          Many thanks for taking the time to respond Kristina.

          1. I don't like to do redirects, as so many have warned of the consequences in terms of link juice

          2. No, I don't link to the pages in question using "/" rather than the ".shtml" version of the page indexed.

          3. A few external sources use the "/" version (recent linkers) I have found, but they likely only did so as they saw it displayed as such in the SERPs previously. No commercial or other affiliate  sites do.

          The reason I was really confused is that some pages are indexed using the "/", while others are not -- with no apparent reason I could locate. The "/" version for pages still remains on the first page for keywords, even with far less domain authorities and pages linking to them (for now!). We will be moving to another platform with a different default extension, so I wonder how that will be handled. Endless mysteries.

          Thank you again for your time and suggestions,

          Greg

          KristinaKledzik 1 Reply Last reply Reply Quote 0
          • KristinaKledzik
            KristinaKledzik @gheh2013 last edited by

            Can you clarify what you're concerned about for 301 redirects in terms of link juice?

            301 redirects don't carry as much link juice as a direct link, but it doesn't impact correct links, just the links that, otherwise, wouldn't get link juice to your end destination at all. (Though, if your canonical is working correctly, it'll pass the same amount of link juice as a 301 redirect.)

            Dr. Pete goes into this a bit more over here: https://moz.com/community/q/do-canonical-tags-pass-all-of-the-link-juice-onto-the-url-they-point-to

            1 Reply Last reply Reply Quote 0
            • 1 / 1
            • First post
              Last post
            • Google Indexed Site A's Content On Site B, Site C etc
              Paddy_Moogan
              Paddy_Moogan
              1
              7
              70

            • Google Is Indexing my 301 Redirects to Other sites
              Keszi
              Keszi
              0
              4
              573

            • Moving html site to wordpress and 301 redirect from index.htm to index.php or just www.example.com
              zeehj
              zeehj
              0
              4
              1.3k

            • I'm noticing that URL that were once indexed by Google are suddenly getting dropped without any error messages in Webmasters Tools, has anyone seen issues like this before?
              nystromandy
              nystromandy
              0
              7
              67

            • Why isn't Google indexing this site?
              GastonRiera
              GastonRiera
              0
              8
              124

            • Why is this site not indexed by Google?
              PaddyDisplays
              PaddyDisplays
              0
              2
              113

            • Why is a site no longer being indexed by Google after HTTPS switch?
              JaneCopland
              JaneCopland
              0
              3
              114

            • How can we get a site reconsidered for Google indexing?
              d25kart
              d25kart
              0
              3
              301

            Get started with Moz Pro!

            Unlock the power of advanced SEO tools and data-driven insights.

            Start my free trial
            Products
            • Moz Pro
            • Moz Local
            • Moz API
            • Moz Data
            • STAT
            • Product Updates
            Moz Solutions
            • SMB Solutions
            • Agency Solutions
            • Enterprise Solutions
            • Digital Marketers
            Free SEO Tools
            • Domain Authority Checker
            • Link Explorer
            • Keyword Explorer
            • Competitive Research
            • Brand Authority Checker
            • Local Citation Checker
            • MozBar Extension
            • MozCast
            Resources
            • Blog
            • SEO Learning Center
            • Help Hub
            • Beginner's Guide to SEO
            • How-to Guides
            • Moz Academy
            • API Docs
            About Moz
            • About
            • Team
            • Careers
            • Contact
            Why Moz
            • Case Studies
            • Testimonials
            Get Involved
            • Become an Affiliate
            • MozCon
            • Webinars
            • Practical Marketer Series
            • MozPod
            Connect with us

            Contact the Help team

            Join our newsletter
            Moz logo
            © 2021 - 2026 SEOMoz, Inc., a Ziff Davis company. All rights reserved. Moz is a registered trademark of SEOMoz, Inc.
            • Accessibility
            • Terms of Use
            • Privacy