The Moz Q&A Forum

    • Forum
    • Questions
    • My Q&A
    • Users
    • Ask the Community

    Welcome to the Q&A Forum

    Browse the forum for helpful insights and fresh discussions about all things SEO.

    1. SEO and Digital Marketing Q&A Forum
    2. Categories
    3. Moz Tools
    4. Changing the way SEOmoz Detects Duplicate Content

    Changing the way SEOmoz Detects Duplicate Content

    Moz Tools
    2 2 277
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as question
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • KeriMorgret
      KeriMorgret last edited by

      Hey everyone,

      I wanted to highlight today's blog post in case you missed it. In short, we're using a different algorithm to detect duplicate pages. http://moz.com/blog/visualizing-duplicate-web-pages

      If you see a change in your crawl results and you haven't done anything, this is probably why. Here's more information taken directly from the post:

      1. Fewer duplicate page errors: a general decrease in the number of reported duplicate page errors. However, it bears pointing out that:

      • **We may still miss some near-duplicates. **Like the current heuristic, only a subset of the near-duplicate pages is reported.
      • **Completely identical pages will still be reported. **Two pages that are completely identical will have the same simhash value, and thus a difference of zero as measured by the simhash heuristic. So, all completely identical pages will still be reported.

      2. Speed, speed, speed: The simhash heuristic detects duplicates and near-duplicates approximately 30 times faster than the legacy fingerprints code. This means that soon, no crawl will spend more than a day working its way through post-crawl processing, which will facilitate significantly faster delivery of results for large crawls.

      1 Reply Last reply Reply Quote 2
      • William.Lau
        William.Lau last edited by

        That is good news. It will ease some minds that are going nuts over the duplicate content reporting. Thanks!

        1 Reply Last reply Reply Quote 0
        • 1 / 1
        • First post
          Last post
        • Why are there significant changes in the amount of duplicate content without any known action?
          allurez
          allurez
          0
          9
          159

        • Since July 1, we've had a HUGE jump in errors on our weekly crawl. We don't think anything has changed on our website. Has MOZ changed something that would account for a large leap in duplicate content and duplicate title errors?
          KristyFord
          KristyFord
          0
          3
          77

        • SEOMOZ Support: Domain Name Change in SEOMOZ
          DynoSaur
          DynoSaur
          0
          3
          627

        • How does SEOmoz pull its duplicate page title and content information?
          GManSEO
          GManSEO
          0
          2
          191

        • SEOmoz crawler and duplicate content
          jeffreytrull1
          jeffreytrull1
          0
          2
          535

        • SEOmoz indicating duplicate page content on one of my campaigns
          ckilgore
          ckilgore
          0
          5
          605

        • Seomoz & Duplicate Page Content Issue?
          stefanok
          stefanok
          0
          4
          540

        Get started with Moz Pro!

        Unlock the power of advanced SEO tools and data-driven insights.

        Start my free trial
        Products
        • Moz Pro
        • Moz Local
        • Moz API
        • Moz Data
        • STAT
        • Product Updates
        Moz Solutions
        • SMB Solutions
        • Agency Solutions
        • Enterprise Solutions
        • Digital Marketers
        Free SEO Tools
        • Domain Authority Checker
        • Link Explorer
        • Keyword Explorer
        • Competitive Research
        • Brand Authority Checker
        • Local Citation Checker
        • MozBar Extension
        • MozCast
        Resources
        • Blog
        • SEO Learning Center
        • Help Hub
        • Beginner's Guide to SEO
        • How-to Guides
        • Moz Academy
        • API Docs
        About Moz
        • About
        • Team
        • Careers
        • Contact
        Why Moz
        • Case Studies
        • Testimonials
        Get Involved
        • Become an Affiliate
        • MozCon
        • Webinars
        • Practical Marketer Series
        • MozPod
        Connect with us

        Contact the Help team

        Join our newsletter
        Moz logo
        © 2021 - 2026 SEOMoz, Inc., a Ziff Davis company. All rights reserved. Moz is a registered trademark of SEOMoz, Inc.
        • Accessibility
        • Terms of Use
        • Privacy