Which crawler/tool gets the tf–idf for a given number of URLs?
-
I read a really nice blogpost this weekend about the tf-idf. I wonder if there's a tool/crawler out there that can get me this data for any number of given URLs. Anybody has any experience in this field?
-
I don't have an answer to your question (sorry), but do you mind posting the link to that tf-idf blog post? I'm sure I'm not the only one who'd like to read it. Thanks!
-
-
I'm not aware of a tool that will spit this out for a range of pages. Most people calculating this are programming at least a portion of it themselves. There appears to be a few Github resources but again, you'll need to do some of the heavy lifting yourself I think.
As Ian hinted at in the post, it's simpler to just spit out the top words or 2-3 word phrases per page.
-
There is a tool called SEOlyze (https://www.seolyze.com) which does exact provide what your are looking for.
The Analysis in SEOlyze are based on the TF-IDF principle to crawl and analyze content.
You can do analyses based on the Google Top-Results for your specific keyword or input own URLs you want to be analysed.For testing purpose you can register for a 30 days free trial.
-
Hi Phil,
Welcome to Moz, and thanks for this comment! We do ask that you disclose any affiliation you have with products that you recommend. It appears that you are or were the CEO of SEOlyze, per http://www.pactas.com/assets/case-study/seolyze-cs-en.pdf.
Please feel free to reach out if you have any questions regarding Q&A or Moz in general. Thanks!
Keri