Disallow

Discussion in 'robots.txt' started by frio, Dec 20, 2007.

  1. #1
    Hi Guys,

    How can I disallow this link from Yahoo?
    blogs/join.aspx?ReturnUrl=/members/

    I have this on my robots.txt file

    Disallow: blogs/join.aspx

    but yahoo is still crawling this blogs/join.aspx?ReturnUrl=/members/

    can someone help me out?

    Thanks,


    Frio
     
    frio, Dec 20, 2007 IP
  2. prlinker

    prlinker Peon

    Messages:
    18
    Likes Received:
    0
    Best Answers:
    0
    Trophy Points:
    0
    #2
    google it you will fid the string which you need to specify

    the method is correct
     
    prlinker, Dec 24, 2007 IP
  3. Ladadadada

    Ladadadada Peon

    Messages:
    382
    Likes Received:
    36
    Best Answers:
    0
    Trophy Points:
    0
    #3
    How long was the gap between updating your robots.txt file and Yahoo crawling the page ?

    In my experience, Yahoo has one of the highest page -> robots.txt request ratios I have seen. The only one I have seen higher was Baidu. This means that it might be a day or two before it notices that your robots.txt has changed.

    Google have multiple bots and they don't appear to share information. This means that even though one of the bots has seen your new robots.txt, the others may not have and will keep crawling. Yahoo may also do the same thing but I haven't actually observed this behaviour.
     
    Ladadadada, Jan 6, 2008 IP
  4. SwapsRulez

    SwapsRulez Peon

    Messages:
    32
    Likes Received:
    1
    Best Answers:
    0
    Trophy Points:
    0
    #4
    Try writing the below code...

    User-agent: yahoo
    Disallow: /blogs/join.aspx
    Code (markup):

    I think you have missed the forward slash(/) sign which indicates the root.
    Cheers mate... Peace!
     
    SwapsRulez, Jan 12, 2008 IP
  5. Angelys

    Angelys Peon

    Messages:
    21
    Likes Received:
    0
    Best Answers:
    0
    Trophy Points:
    0
    #5
    +1 SwapsRulez
     
    Angelys, Jan 19, 2008 IP
  6. Bohol

    Bohol Peon

    Messages:
    2,680
    Likes Received:
    75
    Best Answers:
    0
    Trophy Points:
    0
    #6
    What I heard from an SEO guru is that Yahoo may not obey robots.txt in your site. But, Google, yes, it does.
     
    Bohol, Jan 20, 2008 IP
  7. SwapsRulez

    SwapsRulez Peon

    Messages:
    32
    Likes Received:
    1
    Best Answers:
    0
    Trophy Points:
    0
    #7

    its not true.... but in some cases.... the advanced syntax like using * or $.... will be the real pain for the yahoo slurp to understand the meaning of them.... instead the google & MSN bots understands them easily... :D
     
    SwapsRulez, Jan 21, 2008 IP
  8. catanich

    catanich Peon

    Messages:
    1,921
    Likes Received:
    40
    Best Answers:
    0
    Trophy Points:
    0
    #8
    This should work

    User-agent: yahoo
    Disallow: /blogs/join.aspx
    Disallow: /blogs/join.aspx?ReturnUrl=/members/​

    But, the problem you are having will be the same for all SEs... I would block all of them with:

    User-agent: *
    Disallow: /blogs/join.aspx​
     
    catanich, Feb 2, 2008 IP