The Moz Q&A Forum

    • Forum
    • Questions
    • My Q&A
    • Users
    • Ask the Community

    Welcome to the Q&A Forum

    Browse the forum for helpful insights and fresh discussions about all things SEO.

    1. SEO and Digital Marketing Q&A Forum
    2. Categories
    3. Technical SEO Issues
    4. How to identify orphan pages?

    How to identify orphan pages?

    Technical SEO Issues
    4 4 13.6k
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as question
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • MarieHaynes
      MarieHaynes last edited by

      I've read that you can use Screaming Frog to identify orphan pages on your site, but I can't figure out how to do it.  Can anyone help?

      I know that Xenu Link Sleuth works but I'm on a Mac so that's not an option for me.

      Or are there other ways to identify orphan pages?

      1 Reply Last reply Reply Quote 0
      • AgentsofValue
        AgentsofValue last edited by

        Well, because they are 'orphans', you probably can't find them using a spider tool!  I'd recommend the following process to find your orphan pages:

        1.  get a list of all the pages created by your CMS

        2.  get the list of all the pages found by Screaming Frog

        3.  add the two url lists into Excel and find the URLs in your CMS that are not in the Screaming Frog list.

        You could probably use an Excel trick like this one:

        http://superuser.com/questions/289650/how-to-compare-two-columns-and-find-differences-in-excel

        1 Reply Last reply Reply Quote 1
        • Cyrus-Shepard
          Cyrus-Shepard last edited by

          Hi Marie!

          Sadly, I don't use Xenu anymore either. Most of the solutions to find orphaned pages are either hit-and-miss manual methods (search OSE, search your server files). Or you could use a method like Agents of Value describes here.

          Couple of posts that may help:

          1. Find Orphaned Pages From Your Sitemap.xml File with Excel and IIS Toolkit

          Requires IIS toolkit, which unless your installing on an external machine, isn't mac friendly

          2. 4 Tips for Technical SEO

          Ian has some great tips here, including:

          • Search the server log files for every unique URL loaded over a 6-month period. Compare that to all unique URLs found in a site crawl. People have a funny way of stumbling into pages you’ve accidentally blocked or orphaned. Chances are, blocked pages will show up in your log file, even if they’re blocked.
          • Do a database export. If you’re using WordPress or another content management system, you can export a full list of every page/post on the site, as well as the URL generated. Then compare that to a site crawl.
          • Run two crawls of your site using your favorite crawler. Do the first one with the default settings. Then do a second with the crawler set to ignore robots.txt and nofollow. If the second crawl has more URLs than the first, and you want 100% of your site indexed, then check your robots.txt and look for meta ROBOTS issues.

          3. Supposedly, Webseo has an automated option to find orphaned files, but I haven't used it nor can I vouch for it:http://www.webseo.com/

          Hope this helps! Let us know what works. 🙂

          1 Reply Last reply Reply Quote 2
          • Fr3sh3gg
            Fr3sh3gg last edited by

            DeepCrawl.co.uk is another great resource here.  This tool gives a full list of URLs, including number of internal links to each page.  Filter this list by "No. links in" = 0, and this will give you a good list of orphaned pages.

            Cheers,
            Mike | Fresh Egg Australia

            1 Reply Last reply Reply Quote 0
            • 1 / 1
            • First post
              Last post
            • Very wierd pages. 2900 403 errors in page crawl for a site that only has 140 pages.
              H.M.N.
              H.M.N.
              0
              6
              64

            • Moving Some Content From Page A to Page B
              khi5
              khi5
              0
              10
              66

            • If the order of products on a page changes each time the page is loaded, does this have a negative effect on the SEO of those pages?
              Kurt_Steinbrueck
              Kurt_Steinbrueck
              2
              4
              119

            • I need help compiling solid documentation and data (if possible) that having tons of orphaned pages is bad for SEO - Can you help?
              danatanseo
              danatanseo
              0
              15
              726

            • 2 links on home page to each category page ..... is page rank being watered down?
              QubaSEO
              QubaSEO
              0
              6
              436

            • I am trying to correct error report of duplicate page content. However I am unable to find in over 100 blogs the page which contains similar content to the page SEOmoz reported as having similar content is my only option to just dlete the blog page?
              evolvingSEO
              evolvingSEO
              0
              4
              344

            • Page rank 2 for home page, 3 for service pages
              Alex-Harford
              Alex-Harford
              0
              8
              498

            • Discrepency between # of pages and # of pages indexed
              Dan-Petrovic
              Dan-Petrovic
              0
              14
              990

            Get started with Moz Pro!

            Unlock the power of advanced SEO tools and data-driven insights.

            Start my free trial
            Products
            • Moz Pro
            • Moz Local
            • Moz API
            • Moz Data
            • STAT
            • Product Updates
            Moz Solutions
            • SMB Solutions
            • Agency Solutions
            • Enterprise Solutions
            • Digital Marketers
            Free SEO Tools
            • Domain Authority Checker
            • Link Explorer
            • Keyword Explorer
            • Competitive Research
            • Brand Authority Checker
            • Local Citation Checker
            • MozBar Extension
            • MozCast
            Resources
            • Blog
            • SEO Learning Center
            • Help Hub
            • Beginner's Guide to SEO
            • How-to Guides
            • Moz Academy
            • API Docs
            About Moz
            • About
            • Team
            • Careers
            • Contact
            Why Moz
            • Case Studies
            • Testimonials
            Get Involved
            • Become an Affiliate
            • MozCon
            • Webinars
            • Practical Marketer Series
            • MozPod
            Connect with us

            Contact the Help team

            Join our newsletter
            Moz logo
            © 2021 - 2026 SEOMoz, Inc., a Ziff Davis company. All rights reserved. Moz is a registered trademark of SEOMoz, Inc.
            • Accessibility
            • Terms of Use
            • Privacy