My site keeps getting pounded everynite by some kind of bot thats stealing my content ... How can i stop it? So far all i know is that the ip's (different every day) are all from Vietnam? I thought there was a way of blocking this via .htaccess??? please help
sorry - just found the following post that gives me my answers ... http://forums.digitalpoint.com/showthread.php?t=326179#post3082441
lol you can't do anything about it. They can use proxy to access your site or some proxy browser. So, blocking ips will not make good to it. Here is the ans. to your ques. Increase the ranking of your site, let google crawl your page before it crawls stealer's website. Once your articles gets index b4 the stealer's article, after few days google will remove their article which was stolen from your site and will consider it as a duplicate content.
Use robots.txt to prevent access, either by IP address or ban the bot completely. Use copyscape.com to check for duplicate content from your site. Then email the hosting company and they'll take the site down until the content is removed.
I mentioned. Let google crawl your site. Use sitemap and let google check your site map as soon as possible after to post an article.
You can't stop them from stealing your content but those content won't give any help to them. This will only put their site in supplemental result.
You can disable right mouse button but this is primitive and easy to breake. There are some content securing companies, try them
I don't think a scraper will care about your robots file. When you say the IP changes every day, does it change a lot or stay within a certain class c? If the first 2 or 3 blocks stay the same, block the entire IP range in your .htaccess e.g. deny from 234.32.21 will block this entire class c block from 234.32.21.0 to 234.32.21.255 Alternatively you can try to download GeoIP.dat from MaxMind (the free database) and redirect all traffic to check the Geolocation first. If the country comes up to be Vietnam, you can simply block them. Or you could analyze the referrer or user agent if they are unique to this scraper and block those easily in .htaccess. There are many things you can try. Disabling right mouseclick, robots.txt and Copyscape won't prevent bots from stealing your stuff though. Since traffic from Vietnam probably isn't converting in a lot of sales for you, I'd simply block the entire country. Or you could try even funnier things and redirect them somewhere else. If I can tie a bot to a certain site, I usually redirect them to themselves and let them scrape their own stuff. That can look funny on their site if they scrape, post then find the new post, scrape again and post again ..... You can also try to do something on the firewall level. I have a brute force script installed and if a bot gets too nasty it gets blocked at the firewall level thus saving my apache resources.
Try using copyscape.com . Add that banner to your site, use their search, see who is copying your content, and sue them (or threaten to do so). If you have a Copyright mark at your footer of your site I am sure a nice threat will stop them.
Yes, a Bot from china (or the guy who wrote it) are certainly going to care about US copyright laws. Makes perfect sense