I intend to create a robot.txt file only allowing the usefull bots or user agents and disallowing all others. Is this recomended? Will it harm me in any way?? Please do comment also help me in listing the useful bots which I should allow. For example Googlebot, Yahoo Slurp, Alexa (IA Archiver), MSNBot, Ask, MSNBot-media, The web archive (IA Archiver), Google Sitemaps, Netcraft. These are the genuine bots I came across in my site logs. Any more to include in the list. BTW then how do I specify them in the robot.txt file.
lease do comment also help me in listing the useful bots which I should allow. For example Googlebot, Yahoo Slurp, Alexa (IA Arc