What is robots.txt?

Discussion in 'Search Engine Optimization' started by Agnel, May 5, 2009.

  1. #1
    Hello,

    Anybody can tell me about robots.txt?

    Regards,
    Agnel
     
    Agnel, May 5, 2009 IP
  2. dealeris

    dealeris Active Member

    Messages:
    247
    Likes Received:
    13
    Best Answers:
    0
    Trophy Points:
    80
    #2
    Robots.txt is a text file usually kept in root directory of your website, to tell search engines what to index, how often etc.
     
    dealeris, May 5, 2009 IP
  3. ericajoieake

    ericajoieake Guest

    Messages:
    556
    Likes Received:
    6
    Best Answers:
    0
    Trophy Points:
    0
    #3
    this text file is very essential if you want to restrict the search engines to crawl or get the information of the pages of your site.
     
    ericajoieake, May 5, 2009 IP
  4. stephen082

    stephen082 Active Member

    Messages:
    843
    Likes Received:
    81
    Best Answers:
    0
    Trophy Points:
    95
    #4
    Sorry but this is not correct. :confused: Robots.txt file is actually to tell search engine what not to index. There are certain pages on your site that you don't want to get indexed such as admin pages, login pages, or other internal information. You can just put those url's in Robots.txt file and these pages will not get indexed.
     
    stephen082, May 5, 2009 IP
  5. ilabsaft

    ilabsaft Peon

    Messages:
    94
    Likes Received:
    0
    Best Answers:
    0
    Trophy Points:
    0
    #5
    i don't know robots.txt :) it is exactly text file. Please clear explaint for me and Agnel?
     
    ilabsaft, May 5, 2009 IP
  6. The SEO Man

    The SEO Man Well-Known Member

    Messages:
    448
    Likes Received:
    11
    Best Answers:
    1
    Trophy Points:
    140
    #6
    Some consider robots.txt to be more of a security risk than a benefit, as it potentially exposes system structure.

    Only well behaved spiders take robots.txt into consideration.


    Malicious and non-complying spiders are abound, which ignore or may take advantage of robots.txt in a nefarious way.
     
    The SEO Man, May 5, 2009 IP
  7. jitendraag

    jitendraag Notable Member

    Messages:
    3,982
    Likes Received:
    324
    Best Answers:
    1
    Trophy Points:
    270
    #7
    A simple google search will help you find more details about robots.txt and Robot exclusion standard.

    As stephen082 said, it's used to inform bots adhering to robots exclusion standard about what not to index.
     
    jitendraag, May 5, 2009 IP
  8. rena

    rena Peon

    Messages:
    1,987
    Likes Received:
    13
    Best Answers:
    0
    Trophy Points:
    0
    #8
    its a file to controll indexing site from google.. in this can specfy to index or non idex the pages on google
     
    rena, May 5, 2009 IP
  9. f44

    f44 Peon

    Messages:
    8
    Likes Received:
    0
    Best Answers:
    0
    Trophy Points:
    0
    #9
    it is also use to avoid having a double content penalty from google
     
    f44, May 5, 2009 IP
  10. Traffic-Bug

    Traffic-Bug Active Member

    Messages:
    1,866
    Likes Received:
    8
    Best Answers:
    0
    Trophy Points:
    80
    #10
    It tells the search bots and indexing bots what should be indexed and what is prevented from indexing.l
     
    Traffic-Bug, May 5, 2009 IP
  11. jockson

    jockson Guest

    Messages:
    101
    Likes Received:
    0
    Best Answers:
    0
    Trophy Points:
    0
    #11
    jockson, May 5, 2009 IP
  12. technomart

    technomart Guest

    Messages:
    240
    Likes Received:
    0
    Best Answers:
    0
    Trophy Points:
    0
    #12
    Robots.txt is a text file usually kept in root directory of your website, to tell search engines which page crawl or get the information of the pages of your site.
     
    technomart, May 5, 2009 IP
  13. Roger_Silvester

    Roger_Silvester Peon

    Messages:
    277
    Likes Received:
    2
    Best Answers:
    0
    Trophy Points:
    0
    #13
    "Robots.txt" is a regular text file that through its name, has special meaning to the majority of "honorable" robots on the web. By defining a few rules in this text file, you can instruct robots to not crawl and index certain files, directories within your site, or at all. For example, you may not want Google to crawl the /images directory of your site, as it's both meaningless to you and a waste of your site's bandwidth. "Robots.txt" lets you tell Google just that.
     
    Roger_Silvester, May 6, 2009 IP