eBay - Buy WoW Gold - Montana Music - Per Insurance - Boston Attractions

PDA

View Full Version : how to block all other BOT except Google, MSN, Yahoo


danieloffice
Jan 9th 2008, 1:17 am
Hi,

Need help,

in robot.txt, how to exclude all other BOT except Google, MSN, Yahoo?


Thanks

SwapsRulez
Jan 12th 2008, 11:57 pm
You have to allow Google, MSN, Yahoo to crawl the whole site & disallow other bots to look into the root also..

here is the sample code..

User-agent: google
Disallow:

User-agent: yahoo
Disallow:

User-agent: msn
Disallow:

User-agent: *
Disallow: /


The above code will match the user agent by checking the substrings of the name of the robot of the perticular system... So that they will work perfectly fine... peace!

sajidmm
Mar 5th 2008, 5:05 pm
You have to allow Google, MSN, Yahoo to crawl the whole site & disallow other bots to look into the root also..

here is the sample code..

User-agent: google
Disallow:

User-agent: yahoo
Disallow:

User-agent: msn
Disallow:

User-agent: *
Disallow: /


The above code will match the user agent by checking the substrings of the name of the robot of the perticular system... So that they will work perfectly fine... peace!

wrong info.

i know about google. it should be

User-agent: googlebot
Disallow:

i am not sure about yahoo and msn bots.

lhughes33309
Mar 21st 2008, 6:47 pm
Hi,

The only problem with using the robots.txt file, is many bots don't follow the rules and are there to simply scrape the site. You need to use a script like perl that will feed your html etc. to all users and MSN, GOOGLE and Yahoo, but will trap the others and either give them a 500 server error or redirect them some where else.
I have written many scripts to do just that.

Thanks,
lhughes33309

manish.chauhan
Apr 6th 2008, 11:25 pm
Check this...This will help you in basic understanding of robots.txt

http://seocrazy.blogspot.com/2008/04/robotstxt-stop.html

al-zabir
Jul 11th 2008, 11:56 pm
I think this will help all: :)

User-agent: *
Disallow: /

User-agent: Googlebot
Allow: /

User-agent: Yahoo-slurp
Disallow:

User-agent: Msnbot
Disallow:

rozane
Sep 11th 2008, 5:38 am
it should be
User-agent: Googlebot
Disallow:



I think this will help all: :)

User-agent: *
Disallow: /

User-agent: Googlebot
Allow: /

User-agent: Yahoo-slurp
Disallow:

User-agent: Msnbot
Disallow: