Does robots.txt work for all search engines?

Discussion in 'robots.txt' started by Markas, Sep 29, 2007.

  1. #1
    Does robots.txt work for all search engines?
    Simple question I hope
     
    Markas, Sep 29, 2007 IP
  2. Agent_Dweeb

    Agent_Dweeb Peon

    Messages:
    5,607
    Likes Received:
    384
    Best Answers:
    0
    Trophy Points:
    0
    #2
    Yes it works for all search engines, you can indicate which search engines youcan restrict I think
     
    Agent_Dweeb, Sep 29, 2007 IP
  3. evera

    evera Peon

    Messages:
    283
    Likes Received:
    8
    Best Answers:
    0
    Trophy Points:
    0
    #3
    It should work for all search engines, but some simply ignore the robots.txt
     
    evera, Sep 29, 2007 IP
  4. strivinglife

    strivinglife Peon

    Messages:
    250
    Likes Received:
    5
    Best Answers:
    0
    Trophy Points:
    0
    #4
    evera is correct - a deal of rogue engines will simply ignore the robots file. However, assuming the robots file is valid (written correctly), the major bots will follow it.

    Based on your question, you may want to give Google's Webmaster Central area a look; Google has a pretty good robots.txt 'tester' as a part of their Webmaster tools.

    Note that some bots will specifically crawl directories you deny, and many will cache the robots file for a period of time (which is generally a good thing).
     
    strivinglife, Sep 30, 2007 IP
  5. LeoNel

    LeoNel Member

    Messages:
    52
    Likes Received:
    1
    Best Answers:
    0
    Trophy Points:
    45
    #5
    Google, Yahoo and Microsoft have agreed to support the Sitemaps protocol. See http://sitemaps.org/ for details on how the use of robots.txt is sufficient for informing a search engine of the presence of your sitemap. That said, I've found it better to use the management interface for each search engine provider to test your robots.txt file and also improve on general site submission issues.

    For Google use: http://www.google.com/webmasters/

    For Yahoo use: https://siteexplorer.search.yahoo.com/

    For Microsoft use: http://blogs.msdn.com/livesearch/ar...al-and-an-invitation-to-the-private-beta.aspx
     
    LeoNel, Oct 1, 2007 IP
  6. harrysmith

    harrysmith Well-Known Member

    Messages:
    466
    Likes Received:
    7
    Best Answers:
    0
    Trophy Points:
    110
    #6
    I think not all SE bots respect robots.txt file so better password protect directories which you don't want to get crawl.
     
    harrysmith, Oct 2, 2007 IP