The Moz Q&A Forum

    • Forum
    • Questions
    • My Q&A
    • Users
    • Ask the Community

    Welcome to the Q&A Forum

    Browse the forum for helpful insights and fresh discussions about all things SEO.

    1. SEO and Digital Marketing Q&A Forum
    2. Categories
    3. Technical SEO Issues
    4. Googlebot and other spiders are searching for odd links in our website trying to understand why, and what to do about it.

    Googlebot and other spiders are searching for odd links in our website trying to understand why, and what to do about it.

    Technical SEO Issues
    5 3 41
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as question
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • linkjuiced
      linkjuiced last edited by

      I recently began work on an existing Wordpress website that was revamped about 3 months ago. https://thedoctorwithin.com. I'm a bit new to Wordpress, so I thought I should reach out to some of the experts in the community.Checking ‘Not found’ Crawl Errors in Google Search Console, I notice many irrelevant links that are not present in the website, nor the database, as near as I can tell. When checking the source of these irrelevant links, I notice they’re all generated from various pages in the site, as well as non-existing pages, allegedly in the site, even though these pages have never existed.

      For instance:

      • https://thedoctorwithin.com/category/seminars/newsletters/page/7/newsletters/page/3/feedback-and-testimonials/        allegedly linked from:
      • https://thedoctorwithin.com/category/seminars/newsletters/page/7/newsletters/page/3/ (doesn’t exist)

      In other cases, these goofy URLs are even linked from the sitemap. BTW - all the URLs in the sitemap are valid URLs.

      Currently, the site has a flat structure. Nearly all the content is merely URL/content/ without further breakdown (or subdirectories). Previous site versions had a more varied page organization, but what I'm seeing doesn't seem to reflect the current page organization, nor the previous page organization.

      Had a similar issue, due to use of Divi's search feature. Ended up with some pretty deep non-existent links branching off of /search/, such as:

      • https://thedoctorwithin.com/search/newsletters/page/2/feedback-and-testimonials/feedback-and-testimonials/online-continuing-education/consultations/  allegedly linked from:
      • https://thedoctorwithin.com/search/newsletters/page/2/feedback-and-testimonials/feedback-and-testimonials/online-continuing-education/ (doesn't exist).

      I blocked the /search/ branches via robots.txt. No real loss, since neither /search/ nor any of its subdirectories are valid.

      There are numerous pre-existing categories and tags on the site. The categories and tags aren't used as pages. I suspect Google, (and other engines,) might be creating arbitrary paths from these. Looking through the site’s 404 errors, I’m seeing the same behavior from Bing, Moz and other spiders, as well.

      I suppose I could use Search Console to remove URL/category/ and URL/tag/. I suppose I could do the same, in regards to other legitimate spiders / search engines. Perhaps it would be better to use Mod Rewrite to lead spiders to pages that actually do exist.

      • Looking forward to suggestions about best way to deal with these errant searches.
      • Also curious to learn about why these are occurring.

      Thank you.

      1 Reply Last reply Reply Quote 0
      • Vijay-Gaur
        Vijay-Gaur last edited by

        Hi There,

        Your website is built on WordPress and it looks like that there might be spurious entries in the DB, which might also not be getting deleted due to the WP super cache plugin. You may try to empty your cache and install 'all 404 redirect' and 301 management plugins.

        I hope this helps.

        Regards,

        Vijay

        linkjuiced 1 Reply Last reply Reply Quote 1
        • KevnJr
          KevnJr last edited by

          I have the same issue, I have stopped using tags because of all the irrelevant links they cause. Looking forward to reading the comments on this thread.

          KJr

          linkjuiced 1 Reply Last reply Reply Quote 1
          • linkjuiced
            linkjuiced @Vijay-Gaur last edited by

            Thanks, Vjay.

            Did a lot of work fixing links in the database.

            The issue was occurring even before implementation of WP super cache, and before the link fixing.

            Being new-ish to WP, it seems strange that it's so willing to:

            • provide access via directories that don't really exist:

            • categories, tags, even search, if using a theme-provided site search.

            I'm getting better at .htaccess, so I'm able to handle a lot of the old incoming links fairly well. In the case of these weird 'in the mind of the spiders' links, will be try to address these as well.

            Thanks for your advice about 404 and 301 plugins. Time to look around and see what other useful tools are out there.

            1 Reply Last reply Reply Quote 0
            • linkjuiced
              linkjuiced @KevnJr last edited by

              Thanks, Kevin.

              Glad I'm not the only one.

              Disabling tags and categories aren't an option, in my case. Guess I need to look at more of the potential upside. Seems tags and categories, if handled correctly, could provide a new way to engage visitors and search engines.

              I've heard people refer to 'spidering budgets, or whatnot'. Guess it's an entirely new topic of discussion... if limiting the spurious spider searching, (from good spiders,) means that said spiders will spend more time on the conventional pathways of a site.

              1 Reply Last reply Reply Quote 0
              • 1 / 1
              • First post
                Last post
              • Webmaster tools not showing links but Moz OSE is showing links. Why can't I see them in the Google Search Console
                Eric_Rohrback
                Eric_Rohrback
                0
                4
                4.0k

              • Are links still considered reciprocal if the link from one website is rel="nofollow" and the other isnt ?
                MattRoney
                MattRoney
                1
                5
                226

              • Links below linking (not sitelinks)
                Joseph-Vodafone
                Joseph-Vodafone
                0
                3
                127

              • What I doing wrong when trying to search for links from external websites to my website
                Adam.Whittles
                Adam.Whittles
                0
                2
                201

              • Links to Website Author
                donford
                donford
                0
                3
                256

              • Internal website search
                CodyWheeler
                CodyWheeler
                0
                3
                845

              • How to recover after blocking all the search engine spiders?
                GroupM
                GroupM
                0
                3
                831

              • How to handle .mobi and normal website for mobile search and regular search
                StevenMapes
                StevenMapes
                0
                6
                4.8k

              Get started with Moz Pro!

              Unlock the power of advanced SEO tools and data-driven insights.

              Start my free trial
              Products
              • Moz Pro
              • Moz Local
              • Moz API
              • Moz Data
              • STAT
              • Product Updates
              Moz Solutions
              • SMB Solutions
              • Agency Solutions
              • Enterprise Solutions
              • Digital Marketers
              Free SEO Tools
              • Domain Authority Checker
              • Link Explorer
              • Keyword Explorer
              • Competitive Research
              • Brand Authority Checker
              • Local Citation Checker
              • MozBar Extension
              • MozCast
              Resources
              • Blog
              • SEO Learning Center
              • Help Hub
              • Beginner's Guide to SEO
              • How-to Guides
              • Moz Academy
              • API Docs
              About Moz
              • About
              • Team
              • Careers
              • Contact
              Why Moz
              • Case Studies
              • Testimonials
              Get Involved
              • Become an Affiliate
              • MozCon
              • Webinars
              • Practical Marketer Series
              • MozPod
              Connect with us

              Contact the Help team

              Join our newsletter
              Moz logo
              © 2021 - 2026 SEOMoz, Inc., a Ziff Davis company. All rights reserved. Moz is a registered trademark of SEOMoz, Inc.
              • Accessibility
              • Terms of Use
              • Privacy