Wat is robots.txt??

Discussion in 'Search Engine Optimization' started by meenka, Feb 22, 2010.

  1. #1
    Wat is robots.txt file???
     
    meenka, Feb 22, 2010 IP
  2. funkymario

    funkymario Notable Member

    Messages:
    2,836
    Likes Received:
    369
    Best Answers:
    0
    Trophy Points:
    230
    #2
    The robots.txt file is a set of instructions for search engine bots (spiders). it basically tells them what they can, and cannot index..

    it's not a "must" file to have in your root directory.. if you don't have any specific use for it, no need to have it.
     
    funkymario, Feb 22, 2010 IP
  3. seo555

    seo555 Peon

    Messages:
    1,035
    Likes Received:
    6
    Best Answers:
    0
    Trophy Points:
    0
    #3
    robots.txt file is one type of file which help in which URL you want follow and which link you want dis follow.

    Follow code is:
    check my website and after domain name put this robots.txt you got some links you can easily understand
     
    seo555, Feb 22, 2010 IP
  4. meenka

    meenka Peon

    Messages:
    158
    Likes Received:
    1
    Best Answers:
    0
    Trophy Points:
    0
    #4
    is it compulsory that a website should have robot.txt file
     
    meenka, Feb 22, 2010 IP
  5. ameto

    ameto Peon

    Messages:
    126
    Likes Received:
    0
    Best Answers:
    0
    Trophy Points:
    0
    #5
    robot.txt is the text file which tells search engine crawler about that pages of sites that should not be indexed and followed by crawler..

    syntax:

    User-Agent: [Spider name]
    Disallow: [File Name to be excluded]
     
    ameto, Feb 22, 2010 IP
  6. soman4u

    soman4u Member

    Messages:
    233
    Likes Received:
    1
    Best Answers:
    0
    Trophy Points:
    30
    #6
    soman4u, Feb 22, 2010 IP
  7. seo555

    seo555 Peon

    Messages:
    1,035
    Likes Received:
    6
    Best Answers:
    0
    Trophy Points:
    0
    #7
    its not compulsory but you have robots.txt file so google update pages quickly and you get good result and updates
     
    seo555, Feb 22, 2010 IP
  8. Provenzano

    Provenzano Active Member

    Messages:
    190
    Likes Received:
    0
    Best Answers:
    0
    Trophy Points:
    51
    #8
    It is not a must-have file but a better-have one
     
    Provenzano, Feb 22, 2010 IP
  9. extraspecial

    extraspecial Member

    Messages:
    788
    Likes Received:
    4
    Best Answers:
    1
    Trophy Points:
    45
    #9
    Not much important... but if you use a cms you will have one automatically :)
     
    extraspecial, Feb 22, 2010 IP
  10. Stiker

    Stiker Greenhorn

    Messages:
    51
    Likes Received:
    0
    Best Answers:
    0
    Trophy Points:
    16
    #10
    robot.txt is the scrpting text file If in which apply in the site then search engine crawler not read you site Content and your site link.
     
    Stiker, Feb 22, 2010 IP
  11. dailyearner

    dailyearner Banned

    Messages:
    2,227
    Likes Received:
    11
    Best Answers:
    0
    Trophy Points:
    0
    #11
    it instruct the search engines to crawl your website content
     
    dailyearner, Feb 22, 2010 IP
  12. ap09.com

    ap09.com Guest

    Messages:
    199
    Likes Received:
    0
    Best Answers:
    0
    Trophy Points:
    0
    #12
    ap09.com, Feb 22, 2010 IP
  13. Traffic-Bug

    Traffic-Bug Active Member

    Messages:
    1,866
    Likes Received:
    8
    Best Answers:
    0
    Trophy Points:
    80
    #13
    Robots.txt file is a text file that lists the search engine spiders and what they can and cannot index according to various paths found in the list of directories in that file. You can prevent or enable indexing of content in specific directories using the robots.txt file.
     
    Traffic-Bug, Feb 22, 2010 IP
  14. fancyui

    fancyui Peon

    Messages:
    17
    Likes Received:
    0
    Best Answers:
    0
    Trophy Points:
    0
    #14
    You can check any website's robots.txt by
    http://www.domain.com/robots.txt
     
    fancyui, Feb 22, 2010 IP
  15. Aryans

    Aryans Well-Known Member

    Messages:
    1,854
    Likes Received:
    31
    Best Answers:
    1
    Trophy Points:
    178
    #15
    check these links for more details www.robotstxt.org & en.wikipedia.org/wiki/Robots_exclusion_standard
     
    Aryans, Feb 23, 2010 IP
  16. Thomasan

    Thomasan Active Member

    Messages:
    310
    Likes Received:
    0
    Best Answers:
    0
    Trophy Points:
    53
    #16
    robot.txt is file in which we can recommend the google spider..which webpage it follow and which it do not follow..
     
    Thomasan, Feb 23, 2010 IP
  17. link2add

    link2add Peon

    Messages:
    86
    Likes Received:
    1
    Best Answers:
    0
    Trophy Points:
    0
    #17
    from robots.txt you can restrict search engine crawler to crawl your web-pages and folders . for this you need to type code in robots.txt and save it at root directory of site.
     
    link2add, Feb 23, 2010 IP
  18. hannahx

    hannahx Peon

    Messages:
    107
    Likes Received:
    0
    Best Answers:
    0
    Trophy Points:
    0
    #18
    In robots.txt file you can tell the crawler about those pages which you do not want to index in the google r any other search engine. you can simply add those URLS that you don not want to index.
     
    hannahx, Feb 23, 2010 IP
  19. Tom32

    Tom32 Peon

    Messages:
    56
    Likes Received:
    0
    Best Answers:
    0
    Trophy Points:
    0
    #19
    It is always better have robots.txt on your web server. It is because if spiders check for this file and not found ,it may report it as a broken link (only a chance).So better we should avoid this chance.
     
    Tom32, Feb 23, 2010 IP
  20. anthony123

    anthony123 Peon

    Messages:
    58
    Likes Received:
    0
    Best Answers:
    0
    Trophy Points:
    0
    #20
    robots.txt is the text file instruct the google crawler to index your website page or not.
     
    anthony123, Feb 23, 2010 IP