What's the best tool(s) to find/detect content scrapers?
-
We want to be prepared against content scrapers as we'll be launching webpages that will show content/data of our own.
-
I use Tynt. It doesn't stop content scraping per se but it adds a link back to my site for everything that is copied and pasted (if they don't remove it)
What is more useful though is I get a weekly report of everything that has been copied and that helps to identify which of my content is seen as valuable and helps me proactively approach people who may have copied it.
You may also want to look at your feed and disable or find some other way of protecting that. Using wordpress it is relatively easy to create a new site composed of content published in other people's RSS feeds
-
I know some people use http://www.copyscape.com/