robots.txt?

Discussion in 'Search Engine Optimization' started by vlad230, Sep 17, 2007.

  1. #1
    Hi guys!

    I saw a lot of people here that are talking about a certain file called robots.txt...
    Why is it necessary?
    What should I write there?
    Where should I put it?

    Thanks,
    Vlad
     
    vlad230, Sep 17, 2007 IP
  2. solvman

    solvman Peon

    Messages:
    50
    Likes Received:
    0
    Best Answers:
    0
    Trophy Points:
    0
    #2
    It manages how search engines crawl your pages. Google it and you'll find tons of info on format, contents of it an so on, and so forth.

    Regards.
     
    solvman, Sep 17, 2007 IP
  3. awaken

    awaken Guest

    Messages:
    149
    Likes Received:
    1
    Best Answers:
    0
    Trophy Points:
    0
    #3
    Vlad,

    The robots.txt file can come in handy in many ways, but the main benefit in my opinion is that it allows you to prevent SE spiders from crawling certain pages in your website. By doing so, you can minimize duplicate content and keep unnecessary pages from being indexed.

    Reading this site helped me a ton: http://www.robotstxt.org/
     
    awaken, Sep 17, 2007 IP
  4. chesca

    chesca Banned

    Messages:
    81
    Likes Received:
    1
    Best Answers:
    0
    Trophy Points:
    0
    #4
    Robotz.txt tells search engines which files not to crawl.
     
    chesca, Sep 18, 2007 IP
  5. abixalmon

    abixalmon Well-Known Member

    Messages:
    378
    Likes Received:
    7
    Best Answers:
    0
    Trophy Points:
    120
    #5
    Robotz.txt??? i think it should be robots.txt

    Why is it necessary?

    To tell Search Engine Bots which files to access

    What should I write there?

    Mostly like this

    
    User-agent:  *
    Disallow:
    
    Code (markup):
    Where should I put it?

    In the root directory of your domain (i mean website!)
     
    abixalmon, Sep 18, 2007 IP
  6. trichnosis

    trichnosis Prominent Member

    Messages:
    13,785
    Likes Received:
    333
    Best Answers:
    0
    Trophy Points:
    300
  7. devat

    devat Peon

    Messages:
    670
    Likes Received:
    2
    Best Answers:
    0
    Trophy Points:
    0
    #7
    devat, Sep 18, 2007 IP
  8. ravi3984

    ravi3984 Banned

    Messages:
    171
    Likes Received:
    1
    Best Answers:
    0
    Trophy Points:
    0
    #8
    Hi,

    if u have some duplicate pages in your websites you can write them in robot file, search engine will not crawl the specified links

    or you can do same thing by no follow attribute in your html file

    thanks
     
    ravi3984, Sep 19, 2007 IP
  9. quaffapint

    quaffapint Active Member

    Messages:
    299
    Likes Received:
    6
    Best Answers:
    0
    Trophy Points:
    58
    #9
    Also - a nice new feature of the robots.txt is that you can add a link to your sitemap.xml file to help the SEs find all your pages.
     
    quaffapint, Sep 19, 2007 IP
  10. vlad230

    vlad230 Active Member

    Messages:
    544
    Likes Received:
    8
    Best Answers:
    0
    Trophy Points:
    95
    #10
    So, I followed the links that you guys provided and I decided to add this in my robots.txt file:
    User-agent: *
    Disallow:
    Code (markup):
    To allow all the bots to crawl my website :)

    sitemap.xml ?
    I found www.xml-sitemaps.com which apparently generates a sitemap.xml page, is it ok to save that file in my root directory or do I need to modify it ?
     
    vlad230, Sep 19, 2007 IP
  11. Web Gazelle

    Web Gazelle Well-Known Member

    Messages:
    3,590
    Likes Received:
    259
    Best Answers:
    0
    Trophy Points:
    155
    #11
    It is a set of instructions for bots that visit your site. You need one because bots will be looking for it. You should create one and save it in your root directory.

    Try this link http://www.seochat.com/seo-tools/robots-generator/
     
    Web Gazelle, Sep 19, 2007 IP
  12. boyponga

    boyponga Banned

    Messages:
    1,013
    Likes Received:
    11
    Best Answers:
    0
    Trophy Points:
    0
    #12
    It filters bots from crawling a certain page. :D

    User-agent: robotname
    Disallow: /pagename
    PHP:
    Put it on notepad and upload it in your web FTP server.
     
    boyponga, Sep 19, 2007 IP
  13. quaffapint

    quaffapint Active Member

    Messages:
    299
    Likes Received:
    6
    Best Answers:
    0
    Trophy Points:
    58
    #13
    Check out the post on...

    SEO : Help Search Engines Find All Your Pages

    ...This explains how to add your sitemap xml file to your robots.txt file.
     
    quaffapint, Sep 20, 2007 IP