The Moz Q&A Forum

    • Forum
    • Questions
    • My Q&A
    • Users
    • Ask the Community

    Welcome to the Q&A Forum

    Browse the forum for helpful insights and fresh discussions about all things SEO.

    1. SEO and Digital Marketing Q&A Forum
    2. Categories
    3. Intermediate & Advanced SEO
    4. How Google Carwler Cached Orphan pages and directory?

    How Google Carwler Cached Orphan pages and directory?

    Intermediate & Advanced SEO
    13 7 2.4k
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as question
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • darshit21
      darshit21 last edited by

      I have website www.test.com

      I have made some changes in live website and upload it to "demo" directory (which is recently created) for client approval.

      Now, my demo link will be www.test.com/demo/

      I am not doing any type of link building or any activity which pass referral link to www.test.com/demo/

      Then how Google crawler find it and cached some pages or entire directory?

      Thanks

      1 Reply Last reply Reply Quote 0
      • DanDeceuster
        DanDeceuster last edited by

        Did you block the /demo/ directory in your robots.txt file? This is step number one to try and ensure they don't get crawled. Also, are you using wordpress? If so, wordpress automatically pings search engines when you add a post and if you use the common sitemap plugin, when it creates the sitemap it submits it automatically to Google, so that's another way Google could have found it.

        darshit21 1 Reply Last reply Reply Quote 0
        • Theo-NL
          Theo-NL last edited by

          Did this actually happen or are we talking about a hypothetical situation here? It could be that there is a link to the demo directory you've overlooked? Has the /demo folder perhaps been used in the past and there were still old links to it?

          As a meta-solution to this problem: prevent crawlers and nosy people from accessing the content by adding a .htpasswd login to the area used for client approval.

          darshit21 1 Reply Last reply Reply Quote 0
          • Guest
            Guest last edited by

            This post is deleted!
            darshit21 1 Reply Last reply Reply Quote 0
            • darshit21
              darshit21 @DanDeceuster last edited by

              Hi Dan,

              No, i am not exclude "demo" directory from robots.txt for any search engine.

              I am not using wordpress its simple stattic HTML website (Not using any type of CMS).

              1 Reply Last reply Reply Quote 0
              • darshit21
                darshit21 @Theo-NL last edited by

                Hi Thetjo,

                I know about it.

                My question is that how Google Crawl it without any referral link?

                Thanks.

                1 Reply Last reply Reply Quote 0
                • darshit21
                  darshit21 @Guest last edited by

                  Hi JoelHit,

                  NO, There is not any single refferal link to "Demo" directory from entire website and also from third party websites.

                  I am aware about Google Crawling and Indexing Systems.

                  Thanks.

                  1 Reply Last reply Reply Quote 0
                  • StalkerB
                    StalkerB last edited by

                    <conspiracy-hat></conspiracy-hat>

                    Did either you or your client use gmail when you sent him the demo link?

                    Regardless, Dan's advice to noindex and block the directory from spiders is the future when doing development work.

                    darshit21 1 Reply Last reply Reply Quote 0
                    • darshit21
                      darshit21 @StalkerB last edited by

                      Hi Barry,

                      Yes, We were used Gmail for reporting.

                      Is it make any sense??

                      ChrisMacNaughton StalkerB darshit21 3 Replies Last reply Reply Quote 0
                      • ChrisMacNaughton
                        ChrisMacNaughton @darshit21 last edited by

                        The <conspiracy hat="">side of things was him commenting that Google is sometimes accused of processing everything in Gmail and could have possibly pulled your link to the demo directory from that.</conspiracy>

                        1 Reply Last reply Reply Quote 1
                        • StalkerB
                          StalkerB @darshit21 last edited by

                          Yup, correct.

                          I was certain I'd replied to this 😕

                          Anyway, you ever notice how the ads in gmail are always relevant to the content of your emails? Google are totally reading them 😉

                          1 Reply Last reply Reply Quote 1
                          • darshit21
                            darshit21 @darshit21 last edited by

                            Is google crawling our mails?

                            Is it possible?

                            1 Reply Last reply Reply Quote 0
                            • KeriMorgret
                              KeriMorgret last edited by

                              Try putting the URL into Google and see if you find any pages linking to it.

                              I knew a company that created a test site that was a copy of a live site (made with a specific hosted CMS). Didn't exclude the test site in robots because "we all know we won't link to it so it'll be ok". Site got indexed, and it was because a person at the company was having problems with the implementation of the test site, went to the help forum (which person didn't think would be indexed) and posted the URL to the test site.

                              I found the above by just putting in the URL of the test site into Google, and I saw the post in the help desk. You might try the same to see if somehow there is a rogue link.

                              1 Reply Last reply Reply Quote 0
                              • 1 / 1
                              • First post
                                Last post
                              • Magento 1.9 SEO. I have product pages with identical On Page SEO score in the 90's. Some pull up Google page 1 some won't pull up at all. I am searching for the exact title on that page.
                                CTOPDS
                                CTOPDS
                                0
                                3
                                63

                              • Google treats pages from main website and sub folder/sub directory differently?
                                Dalessi
                                Dalessi
                                0
                                3
                                552

                              • Does Google cache every page that is been indexed?
                                donsilvernail
                                donsilvernail
                                0
                                2
                                73

                              • Why isn't Google caching our pages?
                                KristinaKledzik
                                KristinaKledzik
                                0
                                2
                                79

                              • Why are some pages indexed but not cached by Google?
                                john4math
                                john4math
                                0
                                2
                                2.9k

                              • Google cached pages and search terms
                                Bio-RadAbs
                                Bio-RadAbs
                                0
                                4
                                272

                              • Adding Orphaned Pages to the Google Index
                                irvingw
                                irvingw
                                0
                                11
                                1.8k

                              • Google+ Pages on Google SERP
                                overalia
                                overalia
                                0
                                3
                                664

                              Get started with Moz Pro!

                              Unlock the power of advanced SEO tools and data-driven insights.

                              Start my free trial
                              Products
                              • Moz Pro
                              • Moz Local
                              • Moz API
                              • Moz Data
                              • STAT
                              • Product Updates
                              Moz Solutions
                              • SMB Solutions
                              • Agency Solutions
                              • Enterprise Solutions
                              • Digital Marketers
                              Free SEO Tools
                              • Domain Authority Checker
                              • Link Explorer
                              • Keyword Explorer
                              • Competitive Research
                              • Brand Authority Checker
                              • Local Citation Checker
                              • MozBar Extension
                              • MozCast
                              Resources
                              • Blog
                              • SEO Learning Center
                              • Help Hub
                              • Beginner's Guide to SEO
                              • How-to Guides
                              • Moz Academy
                              • API Docs
                              About Moz
                              • About
                              • Team
                              • Careers
                              • Contact
                              Why Moz
                              • Case Studies
                              • Testimonials
                              Get Involved
                              • Become an Affiliate
                              • MozCon
                              • Webinars
                              • Practical Marketer Series
                              • MozPod
                              Connect with us

                              Contact the Help team

                              Join our newsletter
                              Moz logo
                              © 2021 - 2026 SEOMoz, Inc., a Ziff Davis company. All rights reserved. Moz is a registered trademark of SEOMoz, Inc.
                              • Accessibility
                              • Terms of Use
                              • Privacy