How Does Google Treat Scapper Sites?
-
I have seen several sites that have data or some type of information about a website. Often the site appears to be trying show the value of a site. How does Google treat the links this scrapper type sites give? Should these types of links be ignored or should they be disavowed?
Here is an example: http://www.sitetracer.com/www.tourexperience.com
-
If you're in the process of disavowing URLs and you would come across this one I would definitely put them on the list to make sure it won't get you intro trouble later on. However if you're not having any issues I would definitely ignore the URL as I'm pretty sure that it won't do any damage as Google should be perfectly fine to figure out it's a terrible site.
-
Ok so John Mueller at Google has said not to worry about sites like this, Google bot is clever enough to figure these types of sites out. To add to that this site links to all external sites with a nofollow link, so disavowing would do nothing anyway.
However there instances where it might be worthwhile adding it to a disavow list and that would be if you believe this site might changes those links to dofollow at a later stage for some reason (highly unlikely).
Also if it were a similar site but links were dofollow, it would not hurt to add it considering you have found it and deem it to be of no value.
However as John has said in the past, if you did not create it don't worry about it unless you really need to disassociate yourself from it.
-
One should never rely on Googlebot for anything - we've all seen it change title tags, change meta descriptions, disobey rel=canonical, etc.
-
If you're worried about scraper sites--at Moz, there are a ton of sites scraping our blog content, for example--I strongly recommend implementing the rel=canonical tag on your site. Often scrapers grab the entire HTML of your page, and thus, they will grab rel=canonical, which will signal back to search engines that your content is the original content.