The Moz Q&A Forum

    • Forum
    • Questions
    • My Q&A
    • Users
    • Ask the Community

    Welcome to the Q&A Forum

    Browse the forum for helpful insights and fresh discussions about all things SEO.

    1. SEO and Digital Marketing Q&A Forum
    2. Categories
    3. Technical SEO Issues
    4. Why isn't our new site being indexed?

    Why isn't our new site being indexed?

    Technical SEO Issues
    12 5 320
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as question
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • RobbieD91
      RobbieD91 @TimHolmes last edited by

      I noticed the robots.txt file returned a 404 and asked the developers to take a look and they said the content of it is fine.

      But yes, I'll doublecheck the WordPress settings now.

      1 Reply Last reply Reply Quote 0
      • EGOL
        EGOL last edited by

        I noticed the robots.txt file returned a 404 and asked the developers to take a look and they said the content of it is fine.

        Sometimes developers say this stuff.  If you are getting a 404, demonstrate it to them.

        RobbieD91 1 Reply Last reply Reply Quote 3
        • RobbieD91
          RobbieD91 @EGOL last edited by

          Strangely, there are two pages indexed on Google Search.

          The homepage and one other

          1 Reply Last reply Reply Quote 0
          • DirkC
            DirkC last edited by

            Hi,

            I can't access your site in Belgium - I guess you are redirecting your users based on ip address. If , like me, they are not located in your target country they are 302 redirected to https://www.woofadvisor.com/holding-page.php and there is only 1 page that is indexed.

            Not sure which country you are actually targeting - but could it be that you're accidentally redirecting Google bot as well?

            Check also this article from Google on ip based targeting.

            rgds

            Dirk

            1 Reply Last reply Reply Quote 1
            • DirkC
              DirkC last edited by

              I just did a quick check on your site with Webpagetest.org with California IP address http://www.webpagetest.org/result/150907_G1_TE9/ - as you can see here these IP's also go to the holding page - which is logically the only page which can be indexed as it's the only one Googlebot can access.

              rgds,

              Dirk

              1 Reply Last reply Reply Quote 0
              • Tom-Anthony
                Tom-Anthony last edited by

                I'd be concerned about the 404ing robots.txt file.

                You should check in Search Console:

                1. What does Search Console show in the robots.txt section?

                2. What happens if you fetch a page that is no indexed (e.g. https://www.woofadvisor.com/travel-tips.php) with the 'Fetch as Googlebot' tool?

                I checked and do not see any obvious indicators of why the pages are not being indexed - we need more info.

                1 Reply Last reply Reply Quote 0
                • DirkC
                  DirkC last edited by

                  To be very honest - I am quite surprised that this question is still marked as "Unanswered".

                  The owners of the site decided to block access for all non UK / Ireland adresses. The main Googlebot is using a Californian ip address to visit the site. Hence - the only page Googlebot can see is https://www.woofadvisor.com/holding-page.php which has no links to the other parts of the site (this is confirmed by the webpagetest.org test with Californian ip address)

                  As Google indicates - Googlebot can also use other IP adresses to crawl the site ("With geo-distributed crawling, Googlebot can now use IP addresses that appear to come from other countries, such as Australia.") - however it's is very likely that these bots do not crawl with the same frequency/depth as the main bot (the article clearly indicates " Google might not crawl, index, or rank all of your locale-adaptive content. This is because the default IP addresses of the Googlebot crawler appear to be based in the USA).

                  This can easily be solved by adding a link on /holding-page.php to the Irish/UK version which contains the full content (accessible for all ip adresses) which can be followed to index the full site (so - only put the ip detection on the homepage - not on the other pages)

                  The fact that the robots.txt gives a 404 is not relevant: if no robots.txt is found Google assumes that the site can be indexed (check this link) - quote: "You only need a robots.txt file if your site includes content that you don't want Google or other search engines to index."

                  Tom-Anthony 1 Reply Last reply Reply Quote 0
                  • Tom-Anthony
                    Tom-Anthony @DirkC last edited by

                    I am in California right now, and can access the website just fine, which is why I didn't mark the question as answered - I don't think we have enough info yet. I think the 'fetch as googlebot' will help us resolve that.

                    You are correct that if there is no robots.txt then Google assumes the site is open, but my concern is that the developers on the team say that there IS a robots.txt file there and it has some contents. I have, on at least two occasions, come across a team that was serving a robots.txt that was only accessible to search bots (once they were doing that 'for security', another time because they mis-understood how it worked). That is why I suggested that Search Console is checked to see what shows up for robots.txt.

                    DirkC 1 Reply Last reply Reply Quote 0
                    • DirkC
                      DirkC @Tom-Anthony last edited by

                      Hi Tom,

                      I am not questioning your knowledge - I re-ran the test on webpagetest.org and I see that the site is now accessible for Californian ip (http://www.webpagetest.org/result/150911_6V_14J6/) which wasn't the case a few days ago (check the result on http://www.webpagetest.org/result/150907_G1_TE9/) - so there has been a change on the ip redirection. I also checked from Belgium - the site is now also accessible from here.

                      I also notice that if I now do a site:woofadvisor.com in Google I get 19 pages indexed rather than 2 I got a few days ago.

                      Apparently removing the ip redirection solved (or is solving) the indexation issue - but still this question remains marked as "unanswered"

                      rgds,

                      Dirk

                      Tom-Anthony 1 Reply Last reply Reply Quote 0
                      • Tom-Anthony
                        Tom-Anthony @DirkC last edited by

                        Hey Dirk,

                        No worries - I visited the question first time today and considered it unanswered as the site is perfectly accessible in California. I like to confirm what Search Console says as that is 'straight from the horses mouth'.

                        Thanks for confirming that the IP redirect has changed, that is interesting. It is impossible for us to know when that happened - I would have expected thing to get indexed quite fast when it changed.

                        With the extra info I'm happy to mark this as answered, but would be good to hear from the OP.

                        Best,

                        -Tom

                        1 Reply Last reply Reply Quote 1
                        • 1 / 1
                        • First post
                          Last post
                        • Redirects and site map isn't showing
                          WebQuest
                          WebQuest
                          0
                          6
                          40

                        • Site address change: new site isn't showing up in Google, old site is gone.
                          GlobeRunner
                          GlobeRunner
                          0
                          5
                          107

                        • Anything new if determining how many of a sites pages are in Google's supplemental index vs the main index?
                          SEMPassion
                          SEMPassion
                          0
                          4
                          390

                        • Why isn't my site not searchable from google?
                          Czubmeister
                          Czubmeister
                          0
                          15
                          1.8k

                        • My beta site (beta.website.com) has been inadvertently indexed. Its cached pages are taking traffic away from our real website (website.com). Should I just "NO INDEX" the entire beta site and if so, what's the best way to do this? Please advise.
                          Vuly
                          Vuly
                          0
                          5
                          1.6k

                        • Walking into a site I didn't build, easy way to fix this # indexing problem?
                          SanketPatel
                          SanketPatel
                          0
                          2
                          211

                        • Best way to handle indexed pages you don't want indexed
                          NakulGoyal
                          NakulGoyal
                          0
                          11
                          786

                        • Merged old wordpress site to new theme and have crazy amount of 4xx and duplicate content that wasn't there before?
                          entourage212
                          entourage212
                          0
                          2
                          400

                        Get started with Moz Pro!

                        Unlock the power of advanced SEO tools and data-driven insights.

                        Start my free trial
                        Products
                        • Moz Pro
                        • Moz Local
                        • Moz API
                        • Moz Data
                        • STAT
                        • Product Updates
                        Moz Solutions
                        • SMB Solutions
                        • Agency Solutions
                        • Enterprise Solutions
                        • Digital Marketers
                        Free SEO Tools
                        • Domain Authority Checker
                        • Link Explorer
                        • Keyword Explorer
                        • Competitive Research
                        • Brand Authority Checker
                        • Local Citation Checker
                        • MozBar Extension
                        • MozCast
                        Resources
                        • Blog
                        • SEO Learning Center
                        • Help Hub
                        • Beginner's Guide to SEO
                        • How-to Guides
                        • Moz Academy
                        • API Docs
                        About Moz
                        • About
                        • Team
                        • Careers
                        • Contact
                        Why Moz
                        • Case Studies
                        • Testimonials
                        Get Involved
                        • Become an Affiliate
                        • MozCon
                        • Webinars
                        • Practical Marketer Series
                        • MozPod
                        Connect with us

                        Contact the Help team

                        Join our newsletter
                        Moz logo
                        © 2021 - 2026 SEOMoz, Inc., a Ziff Davis company. All rights reserved. Moz is a registered trademark of SEOMoz, Inc.
                        • Accessibility
                        • Terms of Use
                        • Privacy