Hi, Internet Shinchakubin robot is hitting one of my sites everyday and taking up a lot of my bandwidth, ive tried various things in robot.txt to block them but havent had any luck. My robot.txy looks like this User-agent: shinchakubin Disallow: / User-agent: myweb Disallow: / User-agent: sharp-info-agent Disallow: / User-agent: * Disallow: /faq.php tried the first three in various combos but no luck, anyone have any suggestions or text that has workded?
there is information on http://www.robotstxt.org/wc/active/html/myweb.html about this. i hope it will help you
thanks, ive seen that page and i used the exclusion tag from that site, but no luck either im doing it wrong or the bot is ignoring
have you checked your logs? what is the exact name of this bot? you must write the exact name of this bot in your robots.txt
ichiro/2.0 (http://help.goo.ne.jp/door/crawler.html) this is the name that my logs show for this robot
so ive searched about ichiro/2.0 and seems that a lot of people having same problem with this bot eating their bandwidth, but have yet to find a way to stop them. it doesnt appear at least from awstats that this bot is hitting my robot.txt file, so doubt that changing this will stop them, any ideas?
I just started noticing it too. http://www.robotslookup.com/Robot/myweb.asp has info on the bot and blocking it.
let me know if you~ve had luck stopping them with robots.txt because it looks like from awstats that they dont even hit my robots.txt page. i blocked an ip from japan that was showing up as ichiro/2.0 spider in my raw access logs and stopped it, however my awstats still shows that Shinchakubin is still hitting my site. not sure if they are the same bot or not.
How did you ID this as the "Shinchakubin robot" ? The IP should show up in the same log file. IF not... look thru other log files... and if you use cPanel... you will find the IP they are using in the "Latest Visitors" or "Raw Access" (if enabled) logs.
Look for the "Latest Visitors" icon / link in cpanel (in the Logs section)... that will show the IP... as long as they were a recent visitor.