Help needed - ban robots from crawling redirect script

Discussion in 'robots.txt' started by bryanw, May 7, 2008.

  1. #1
    Hello,

    I have a redirect script /go.php on my site.

    It has a lot of different parameters.

    For example,
    /go.php?site=google
    /go.php?site=yahoo
    ...

    In the robots.txt, I write as following,
    User-agent: *
    Disallow: /go.php*

    It seems Google has been banned from go.php,
    But Msnbot still crawls go.php?site=google etc

    Is there methods to ban msnbot?

    Thanks a lot!

    Bryan
     
    bryanw, May 7, 2008 IP
  2. Trusted Writer

    Trusted Writer Banned

    Messages:
    1,370
    Likes Received:
    52
    Best Answers:
    0
    Trophy Points:
    160
    #2
    Add to your robots.txt the following line

    
    User-agent: MSNBot
    Disallow: / 
    
    Code (markup):
     
    Trusted Writer, May 7, 2008 IP
  3. rsrikanth05

    rsrikanth05 Well-Known Member

    Messages:
    1,362
    Likes Received:
    25
    Best Answers:
    0
    Trophy Points:
    190
    #3
    Copy the same thing and replace googlebot with msnbot?
     
    rsrikanth05, May 7, 2008 IP
  4. bryanw

    bryanw Peon

    Messages:
    197
    Likes Received:
    0
    Best Answers:
    0
    Trophy Points:
    0
    #4
    Thanks for your reply.

    I don't want to block msnbot from index the whole site, and I want to block /go.php?google etc.
    This is the whole robots.txt
    I didn't do anything special for googlebot.
     
    bryanw, May 7, 2008 IP
  5. Trusted Writer

    Trusted Writer Banned

    Messages:
    1,370
    Likes Received:
    52
    Best Answers:
    0
    Trophy Points:
    160
    #5
    I'm sorry, I should post this

    
    User-agent: *
    Disallow: /go.php* 
    
    User-agent: MSNBot
    Disallow: /go.php* 
    
    Code (markup):
    The implicit name of that crawler ensures the wildcard works for MSNbot. There is a workaround with mod_rewrite but I'm not sure about it after reading there are new robots.txt rules for MSN since the introduction of live.com
     
    Trusted Writer, May 7, 2008 IP
  6. bryanw

    bryanw Peon

    Messages:
    197
    Likes Received:
    0
    Best Answers:
    0
    Trophy Points:
    0
    #6
    Thank you Trusted Writer.

    I will give a try.
     
    bryanw, May 7, 2008 IP
  7. Trusted Writer

    Trusted Writer Banned

    Messages:
    1,370
    Likes Received:
    52
    Best Answers:
    0
    Trophy Points:
    160
    #7
    No problem, hope this helps ;)
     
    Trusted Writer, May 7, 2008 IP
  8. bryanw

    bryanw Peon

    Messages:
    197
    Likes Received:
    0
    Best Answers:
    0
    Trophy Points:
    0
    #8
    bryanw, May 7, 2008 IP
  9. bryanw

    bryanw Peon

    Messages:
    197
    Likes Received:
    0
    Best Answers:
    0
    Trophy Points:
    0
    #9
    Just used this code,
    Will watch the result.
     
    bryanw, May 7, 2008 IP
  10. Trusted Writer

    Trusted Writer Banned

    Messages:
    1,370
    Likes Received:
    52
    Best Answers:
    0
    Trophy Points:
    160
    #10
    Thanks a lot for let us know and please update us if this way MSNbot refrain to index your page.
     
    Trusted Writer, May 7, 2008 IP