What is robots.txt ?

Discussion in 'robots.txt' started by riteshvyas52, May 4, 2012.

  1. #1
    What is robots.txt and what use in the website and seo field ?

    please tell me in some more details or give some link so i can read.. please
     
    riteshvyas52, May 4, 2012 IP
  2. atanu_das1985

    atanu_das1985 Member

    Messages:
    97
    Likes Received:
    1
    Best Answers:
    0
    Trophy Points:
    40
    #2
    Robots.txt is a text (not html) file you put on your site to tell search robots which pages you would like them not to visit. Robots.txt is by no means mandatory for search engines but generally search engines obey what they are asked not to do. It is important to clarify that robots.txt is not a way from preventing search engines from crawling your site (i.e. it is not a firewall, or a kind of password protection) and the fact that you put a robots.txt file is something like putting a note “Please, do not enter” on an unlocked door – e.g. you cannot prevent thieves from coming in but the good guys will not open to door and enter. That is why we say that if you have really sen sitive data, it is too naïve to rely on robots.txt to protect it from being indexed and displayed in search results.
    The location of robots.txt is very important. It must be in the main directory because otherwise user agents (search engines) will not be able to find it – they do not search the whole site for a file named robots.txt. Instead, they look first in the main directory and if they don't find it there, they simply assume that this site does not have a robots.txt file and therefore they index everything they find along the way. So, if you don't put robots.txt in the right place, do not be surprised that search engines index your whole site.


    In general, we prefer that our webpages are indexed by the search engines. But there may be some content that we don’t want to be crawled & indexed. Like the personal images folder, website administration folder, customer’s test folder of a web developer, no search value folders like cgi-bin, and many more. The main idea is we don’t want them to be indexed.
     
    atanu_das1985, May 4, 2012 IP
  3. mikebudhani

    mikebudhani Member

    Messages:
    57
    Likes Received:
    0
    Best Answers:
    0
    Trophy Points:
    36
    #3
    This is a command file which set direction for the search engine robot for crawl or not. simple :)
     
    mikebudhani, May 14, 2012 IP
  4. riteshvyas52

    riteshvyas52 Member

    Messages:
    110
    Likes Received:
    1
    Best Answers:
    0
    Trophy Points:
    40
    #4
    thanks u so much.. and i read your blog it's nice and very useful Knowledge for seo... thanks
     
    riteshvyas52, Jun 1, 2012 IP
  5. customautoblogs

    customautoblogs Member

    Messages:
    38
    Likes Received:
    0
    Best Answers:
    0
    Trophy Points:
    41
    #5
    With it you can control the way bots are crawling and indexing your information. You can deny them to crawl certain parts of your website, you can deny certain types of bots.
     
    customautoblogs, Jun 1, 2012 IP
  6. JFSG

    JFSG Active Member

    Messages:
    55
    Likes Received:
    2
    Best Answers:
    0
    Trophy Points:
    90
    #6
    robots.txt is a file that tells search engine robots what to and what not to do.
     
    JFSG, Jun 3, 2012 IP