The Moz Q&A Forum

    • Forum
    • Questions
    • My Q&A
    • Users
    • Ask the Community

    Welcome to the Q&A Forum

    Browse the forum for helpful insights and fresh discussions about all things SEO.

    1. SEO and Digital Marketing Q&A Forum
    2. Categories
    3. Moz Tools
    4. A suggestion to help with linkscape crawling and data processing

    A suggestion to help with linkscape crawling and data processing

    Moz Tools
    3 3 762
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as question
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • seanmccauley
      seanmccauley last edited by

      Since you guys are understandably struggling with crawling and processing the sheer number of URLs and links, I came up with this idea:

      In a similar way to how SETI@Home (is that still a thing? Google says yes: http://setiathome.ssl.berkeley.edu/) works, could SEOmoz use distributed computing amongst SEO moz users to help with the data processing? Would people be happy to offer up their idle processor time and (optionally) internet connections to get more accurate, broader data?

      Are there enough users of the data to make distributed computing worthwhile?

      Perhaps those who crunched the most data each month could receive moz points or a free month of Pro.

      I have submitted this as a suggestion here:
      http://seomoz.zendesk.com/entries/20458998-crowd-source-linkscape-data-processing-and-crawling-in-a-similar-way-to-seti-home

      1 Reply Last reply Reply Quote 1
      • randfish
        randfish last edited by

        Thanks a ton Sean! We have considered distributed computing as a way to help crawl, index, process, etc. It's so flattering and humbling to hear that you'd be willing to help out and that the community would, too 🙂

        For now, we believe we can get to the index size/quality/freshness using our hosted system, but the engineering team will certainly be encouraged to hear that folks in our community might contribute to this. Distributed systems present their own challenges, and we'd have to write that code from scratch, but if we find that we can't do what we want with our existing network, we might reach out.

        BTW - I wanted to let folks know that the team here does feel very confident that come December/January, we're going to be producing indices that reach exceptional quality bars. The problems we face are largely known, and we now have the team and the solutions to tackle it, so we're pretty excited.

        1 Reply Last reply Reply Quote 2
        • katemats
          katemats last edited by

          Sean - I share Rand' sentiments, thanks so much for the suggestion!

          We have considered distributed crawling in the past (or even distributed rank checking because then it would be in that user's locale) but there are a whole different set of challenges.  For example, you have to handle all the edge cases: what if a user's computer isn't on, or loses connectivity, what if we crawl too fast and the user gets blocked from a site, how do you write all that data securely?

          Of course all of these concerns can be overcome, but right now we feel like we have a good handle on the problems, and it will be much faster for us to just fix what we have 🙂

          Although, I know all of us are so appreciative of the ideas and support, and we will have something really great soon!

          1 Reply Last reply Reply Quote 2
          • 1 / 1
          • First post
            Last post
          • GOOGLE ANALYTIC SKEWED DATA BECAUSE OF GHOST REFERRAL SPAM ND CRAWL BOTS
            solvid
            solvid
            0
            5
            389

          • How do I retrieve crawl and ranking data about a site from the past?
            benjaminspak
            benjaminspak
            0
            3
            242

          • SEOmoz showing crawl errors but webmastertools says no errors, need help!
            bobsnowzell
            bobsnowzell
            0
            5
            399

          • Why does Linkscap API request hang while extracting data ?
            Ravi_Pathak
            Ravi_Pathak
            0
            5
            519

          • Help Understanding Crawl results on this site
            KeriMorgret
            KeriMorgret
            0
            2
            488

          • Crawl Diagnostics finding pages that dont exist. Will Rel Canon Help?
            CompleteOffice
            CompleteOffice
            0
            4
            838

          • What happened to OSE/Linkscape data?
            KeriMorgret
            KeriMorgret
            0
            3
            664

          • Can you help me get started using the crawl diagnostics report?
            KeriMorgret
            KeriMorgret
            0
            4
            1.1k

          Get started with Moz Pro!

          Unlock the power of advanced SEO tools and data-driven insights.

          Start my free trial
          Products
          • Moz Pro
          • Moz Local
          • Moz API
          • Moz Data
          • STAT
          • Product Updates
          Moz Solutions
          • SMB Solutions
          • Agency Solutions
          • Enterprise Solutions
          • Digital Marketers
          Free SEO Tools
          • Domain Authority Checker
          • Link Explorer
          • Keyword Explorer
          • Competitive Research
          • Brand Authority Checker
          • Local Citation Checker
          • MozBar Extension
          • MozCast
          Resources
          • Blog
          • SEO Learning Center
          • Help Hub
          • Beginner's Guide to SEO
          • How-to Guides
          • Moz Academy
          • API Docs
          About Moz
          • About
          • Team
          • Careers
          • Contact
          Why Moz
          • Case Studies
          • Testimonials
          Get Involved
          • Become an Affiliate
          • MozCon
          • Webinars
          • Practical Marketer Series
          • MozPod
          Connect with us

          Contact the Help team

          Join our newsletter
          Moz logo
          © 2021 - 2026 SEOMoz, Inc., a Ziff Davis company. All rights reserved. Moz is a registered trademark of SEOMoz, Inc.
          • Accessibility
          • Terms of Use
          • Privacy