Here's a script from !Ask! that'll stop the scrapers like bhomiyo.com from grabbing your content. http://forums.maccvs.org/phpld-scraper-blocker-t12019.html http://bhomiyo.com/en.xliterate/ask-dir.com If you have questions about the script, you can post your questions there as well.
thanks for this script creed, this scraper should be dealt accordingly, this hurting all directories in the market.
Thanks for the script CReed Need to registere in the forum to get it Why not just block the IP of that proxy site??
Thanks for the heads up.. Good work by !Ask! Will imply that in the directory.. Man that scraper bhomiyo is still in top for aviva brand name keyword!
Am I right in thinking that this is a proxy, and not a scraper? If this is the case, it's important not to link to it at all.
Obelia, that is correct, however certain people are using this proxy caching as a way to hack Google. Please see this article for details.
I read that article, and this is what I want to highlight from it: As far as I can see, the proxies aren't doing any scraping or caching, which is why when you implement a .htaccess ban on their IP the page disappears immediately. If this doesn't happen, you know you have a scraper problem and not just a proxy. The practical implications of this are that banning bots won't make any difference to the proxy problem (although that shouldn't stop you from banning any unwanted bots which suck up your bandwidth). You just have to ban by IP. And linking to these proxy sites is going to mess up some people's SERPs.
Here's a relevant thread to help some people with the proxy problem: http://forums.digitalpoint.com/showthread.php?p=4698107