I've recently banned two robots for failing to respect my robots.txt file, and both where Chinese. Here are my PF rules to ban the bots: # Block the sogou spider block in on bge0 from 220.181.19.184 to any # Block the qihoo bot block in on bge0 from 220.181.34.0/24 to any Code (markup): I'm not sure why Chinese bots would be particularly misbehaved?
They were ignoring your robots.txt, so you decided to ban them using the same robots.txt?? Well.. only few decent robots obey robots.txt, you cant expect the spammers to obey your instructions. Better would be to ban them using .htaccess, use the IP deny manager if you are using cPanel.
Yahoo seems to follow the instructions in my robots.txt file, but if it doesn't then it gets put into the PF rules too!