The Moz Q&A Forum

    • Forum
    • Questions
    • My Q&A
    • Users
    • Ask the Community

    Welcome to the Q&A Forum

    Browse the forum for helpful insights and fresh discussions about all things SEO.

    1. SEO and Digital Marketing Q&A Forum
    2. Categories
    3. White Hat / Black Hat SEO
    4. Page not being indexed or crawled and no idea why!

    Page not being indexed or crawled and no idea why!

    White Hat / Black Hat SEO
    7 2 1.0k
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as question
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • CSawatzky
      CSawatzky last edited by

      Hi everyone,

      There are a few pages on our website that aren't being indexed right now on Google and I'm not quite sure why. A little background:

      We are an IT training and management training company and we have locations/classrooms around the US. To better our search rankings and overall visibility, we made some changes to the on page content, URL structure, etc. Let's take our Washington DC location for example. The old address was:

      http://www2.learningtree.com/htfu/location.aspx?id=uswd44

      And the new one is:

      http://www2.learningtree.com/htfu/uswd44/reston/it-and-management-training

      All of the SEO changes aren't live yet, so just bear with me. My question really regards why the first URL is still being indexed and crawled and showing fine in the search results and the second one (which we want to show) is not. Changes have been live for around a month now - plenty of time to at least be indexed.

      In fact, we don't want the first URL to be showing anymore, we'd like the second URL type to be showing across the board. Also, when I type into Google site:http://www2.learningtree.com/htfu/uswd44/reston/it-and-management-training I'm getting a message that Google can't read the page because of the robots.txt file. But, we have no robots.txt file. I've been told by our web guys that the two pages are exactly the same. I was also told that we've put in an order to have all those old links 301 redirected to the new ones. But still, I'm perplexed as to why these pages are not being indexed or crawled - even manually submitted it into Webmaster tools.

      So, why is Google still recognizing the old URLs and why are they still showing in the index/search results?

      And, why is Google saying "A description for this result is not available because of this site's robots.txt"

      Thanks in advance!

      • Pedram
      1 Reply Last reply Reply Quote 0
      • MikeRoberts
        MikeRoberts last edited by

        Your Robots.txt (which can be found at http://www2.learningtree.com/robots.txt) does in fact have Disallow: /htfu/ which would be blocking http://www2.learningtree.com**/htfu/**uswd44/reston/it-and-management-training from being crawled. While your old page is also technically blocked, it has been around longer and would already have been cached so will still appear in the SERPs.... the bots just won't be able to see changes made to it because they can't crawl it.

        You need to fix the disallow so the bots can crawl your site correctly and you should 301 your old page to the new one.

        1 Reply Last reply Reply Quote 1
        • CSawatzky
          CSawatzky last edited by

          Thanks, Mike. That was incredibly helpful. See, I did click the link on the SERP when I did the "site" search on Google, but I was thinking it was a mistake. Are you able to see the disallow robot on the source code?

          MikeRoberts 1 Reply Last reply Reply Quote 0
          • MikeRoberts
            MikeRoberts @CSawatzky last edited by

            The pages in question don't have any Meta Robots Tags on them. So once the Disallow in Robots.txt is gone and you do a fetch request in Webmaster Tools, the page should get crawled and indexed fine. If you don't have a Meta Robots Tag, the spiders consider it Index,Follow. Personally I prefer to include the index, follow tag anyway even if it isn't 100% necessary.

            1 Reply Last reply Reply Quote 1
            • CSawatzky
              CSawatzky last edited by

              Hi Mike,

              As a follow up, I forwarded your suggestions to our Webmasters. The adjusted the robots.txt and now reads this, which I think still might cause issues and am not 100% sure why this is:

              User-agent: *
              Allow: /htfu/
              Disallow: /htfu/app_data/
              Disallow: /htfu/bin/
              Disallow: /htfu/PrecompiledApp.config
              Disallow: /htfu/web.config
              Disallow: /
              
              Now, this page is being indexed: http://www2.learningtree.com/htfu/uswd74/alexandria/it-and-management-training
              
              But, a more niched page still isn't being indexed:  http://www2.learningtree.com/htfu/usny27/new-york/sharepoint-training
              
              Suggestions?
              
              MikeRoberts 1 Reply Last reply Reply Quote 0
              • MikeRoberts
                MikeRoberts @CSawatzky last edited by

                It possibly just hasn't been long enough for the spiders to re-crawl everything yet. Have you done a fetch request in Webmaster Tools for the page and/or site to see if you can jumpstart things a little? Its also possible that the spiders haven't found a path to it yet. Do you have enough (or any) pages linking into that second page that isn't being indexed yet?

                1 Reply Last reply Reply Quote 0
                • CSawatzky
                  CSawatzky last edited by

                  Hi Mike,

                  Thanks for the reply. I'm out of the country right now, so reply might be somewhat slow.

                  Yes, we have links to the pages on our sitemaps and I have done fetch requests. I did a check now and it seems that the niched "New York" page is being crawled now. Might have been a time issue as you suggested. But, our DC page still isn't being crawled. I'll check up on it periodically and see the progress. I really appreciate your suggestions - it's already helping. Thank you!

                  1 Reply Last reply Reply Quote 0
                  • 1 / 1
                  • First post
                    Last post
                  • Do home page carry more seo benefit than other pages?
                    RossKernez
                    RossKernez
                    0
                    3
                    51

                  • Robots.txt file in Shopify - Collection and Product Page Crawling Issue
                    slatronica
                    slatronica
                    0
                    4
                    3.5k

                  • Why would a blank page rank? What am I missing about this page?
                    DonnaDuncan
                    DonnaDuncan
                    1
                    3
                    148

                  • Should we remove our "index" pages (alphabetical link list to all of the products on the site)?
                    ChrisRoberts-MTI
                    ChrisRoberts-MTI
                    1
                    3
                    102

                  • Does Google crawl and index dynamic pages?
                    KevinBudzynski
                    KevinBudzynski
                    0
                    3
                    8.9k

                  • Need clarification on what is a landing page vs. doorway page
                    Keszi
                    Keszi
                    1
                    2
                    3.4k

                  • Multiple links to different pages from same page
                    JerDoggMckoy
                    JerDoggMckoy
                    0
                    3
                    700

                  • Influence of users' comments on a page (on-page SEO)
                    gt3
                    gt3
                    0
                    6
                    856

                  Get started with Moz Pro!

                  Unlock the power of advanced SEO tools and data-driven insights.

                  Start my free trial
                  Products
                  • Moz Pro
                  • Moz Local
                  • Moz API
                  • Moz Data
                  • STAT
                  • Product Updates
                  Moz Solutions
                  • SMB Solutions
                  • Agency Solutions
                  • Enterprise Solutions
                  • Digital Marketers
                  Free SEO Tools
                  • Domain Authority Checker
                  • Link Explorer
                  • Keyword Explorer
                  • Competitive Research
                  • Brand Authority Checker
                  • Local Citation Checker
                  • MozBar Extension
                  • MozCast
                  Resources
                  • Blog
                  • SEO Learning Center
                  • Help Hub
                  • Beginner's Guide to SEO
                  • How-to Guides
                  • Moz Academy
                  • API Docs
                  About Moz
                  • About
                  • Team
                  • Careers
                  • Contact
                  Why Moz
                  • Case Studies
                  • Testimonials
                  Get Involved
                  • Become an Affiliate
                  • MozCon
                  • Webinars
                  • Practical Marketer Series
                  • MozPod
                  Connect with us

                  Contact the Help team

                  Join our newsletter
                  Moz logo
                  © 2021 - 2026 SEOMoz, Inc., a Ziff Davis company. All rights reserved. Moz is a registered trademark of SEOMoz, Inc.
                  • Accessibility
                  • Terms of Use
                  • Privacy