Robot.txt?

Discussion in 'robots.txt' started by sudip_dg77, Apr 4, 2008.

  1. #1
    When I check the traffic stats in my domain, I keep seeing this


    /robots.txt
    Http Code: 404 Date: Apr 04 12:49:14 Http Version: HTTP/1.0 Size in Bytes: -
    Referer: -
    Agent: Mozilla/4.0 (compatible; MSIE 6.0; Windows NT 5.1; SV1)

    What is this?

    I have checked my domain and there is no such file as robot.txt. Do I need one there? If yes then what should it contain?
     
    sudip_dg77, Apr 4, 2008 IP
  2. jakeruston

    jakeruston Banned

    Messages:
    1,363
    Likes Received:
    89
    Best Answers:
    0
    Trophy Points:
    0
    #2
    Thats just simply Google trying to check to see if you have a Robots.txt ;)
     
    jakeruston, Apr 4, 2008 IP
  3. Corey Bryant

    Corey Bryant Texan at Heart

    Messages:
    1,126
    Likes Received:
    51
    Best Answers:
    0
    Trophy Points:
    0
    #3
    Usually the "good" seach engines will read the robots.txt and see what you do not want searched. The "bad" ones won't though.

    And you should have the robots.txt in there with a link to your XML sitemap since most search engines are using this method to find your XML sitemap.

    You just add something like
    Sitemap: http://www.exmaple.com/sitemap.xml
    Code (markup):
     
    Corey Bryant, Apr 4, 2008 IP
  4. manish.chauhan

    manish.chauhan Well-Known Member

    Messages:
    1,682
    Likes Received:
    35
    Best Answers:
    0
    Trophy Points:
    110
    #4
    Whenever bot come to your website it first look for robots.txt to check whether you have prohibited any SE Bot or private pages to index..if there is not any robots.txt in your website, it simply means you don't want to restrict any of the SE Bot or any private pages...
     
    manish.chauhan, Apr 5, 2008 IP
  5. Ikki

    Ikki Peon

    Messages:
    474
    Likes Received:
    34
    Best Answers:
    0
    Trophy Points:
    0
    #5
    Ikki, Apr 10, 2008 IP
  6. clasione

    clasione Notable Member

    Messages:
    2,362
    Likes Received:
    158
    Best Answers:
    0
    Trophy Points:
    228
    #6
    Some people say if your not interested in excluding certain areas of your site, it doesn't matter if you have one at all.... But I always like to put one anyway, just so that it doesn't draw a 404 log hit and it gives the se's confidence that it is there and your interested in using it to communicate.
     
    clasione, Apr 10, 2008 IP
  7. SubmitShop

    SubmitShop Banned

    Messages:
    844
    Likes Received:
    106
    Best Answers:
    0
    Trophy Points:
    0
    #7
    Search engine crawler first try to access Robots.txt to ascertain webmaster instruction given to crawler. The log file writes the crawler access wither it is on server or not
     
    SubmitShop, Apr 10, 2008 IP