1. Advertising
    y u no do it?

    Advertising (learn more)

    Advertise virtually anything here, with CPM banner ads, CPM email ads and CPC contextual links. You can target relevant areas of the site and show ads based on geographical location of the user if you wish.

    Starts at just $1 per CPM or $0.10 per CPC.

Robots.txt?

Discussion in 'robots.txt' started by tweetylover8402, Oct 25, 2005.

  1. #1
    What's Robots.txt used for?

    On CPanel, it's the #1 'not found' page. What is it used for, why?
     
    tweetylover8402, Oct 25, 2005 IP
  2. mcfox

    mcfox Wind Maker

    Messages:
    7,526
    Likes Received:
    716
    Best Answers:
    0
    Trophy Points:
    360
    #2
    It tells the search engine spiders what they can and can't look at on the site.

    Just upload a blank text file made with notepad called robots.txt and the errors will disappear.
     
    mcfox, Oct 25, 2005 IP
    JohnScott likes this.
  3. seo-ireland

    seo-ireland Peon

    Messages:
    243
    Likes Received:
    12
    Best Answers:
    0
    Trophy Points:
    0
    #3
    Also check out this tutorial to learn how to use the robots.txt file.

    Good luck.
     
    seo-ireland, Oct 25, 2005 IP
  4. Cricket

    Cricket Well-Known Member

    Messages:
    182
    Likes Received:
    41
    Best Answers:
    0
    Trophy Points:
    155
    #4
    When a robot crawls your site it looks for the robots.txt file. If it doesn't find one it assumes automatically that it may crawl and index the entire site. Not having a robots.txt file can also create unnecessary 404 errors in your server logs, making it more difficult to track "real" 404 errors.

    Assuming you want your entire site indexed and only want to stop the unnecessary 404 errors from occurring you have a couple of options.
    • Upload a blank robots.txt file to the root directory of your domain.
    • Upload a simple robots.txt file to the root directory of your domain.
    ===========

    I have an article on my site that covers the BASICS of how to Create Robots.txt File that may help you get started.




    Cricket :)
     
    Cricket, Oct 25, 2005 IP
  5. tweetylover8402

    tweetylover8402 Peon

    Messages:
    730
    Likes Received:
    45
    Best Answers:
    0
    Trophy Points:
    0
    #5
    TY for your kind responses. :)

    But, why would you never want to stop a spider from hitting the page?
     
    tweetylover8402, Oct 25, 2005 IP
  6. Cricket

    Cricket Well-Known Member

    Messages:
    182
    Likes Received:
    41
    Best Answers:
    0
    Trophy Points:
    155
    #6
    Ooops! Sorry! I didn't realize you had already answered this. Our posts must have crossed paths :eek:


    Cricket
     
    Cricket, Oct 25, 2005 IP
  7. seo-ireland

    seo-ireland Peon

    Messages:
    243
    Likes Received:
    12
    Best Answers:
    0
    Trophy Points:
    0
    #7
    No probs Cricket.

    There are lots of reasons you may want to keep spiders away but the biggest reasons are to keep the spiders away from sensitive data or pages you do not want in the database and to also give the spider more direction so that it only reads the pages you want listed in their database.
     
    seo-ireland, Oct 25, 2005 IP
  8. minstrel

    minstrel Illustrious Member

    Messages:
    15,082
    Likes Received:
    1,243
    Best Answers:
    0
    Trophy Points:
    480
    #8
    Yes. To prevent it spidering your images, scripts, stats, mail, etc.

    By the way, a basic "spider everything" robots.txt file would look like this:

    User-agent: *
    Disallow: 
    
    Code (markup):
    Save as plain text (ASCII/ANSI) and upload it to the ROOT of your site.

    This translates to "all spiders, please crawl everything (disallow nothing)".
     
    minstrel, Oct 26, 2005 IP
  9. Bibofa

    Bibofa Peon

    Messages:
    100
    Likes Received:
    1
    Best Answers:
    0
    Trophy Points:
    0
    #9
    User-agent: *
    Disallow:

    I;m using this
     
    Bibofa, Oct 30, 2005 IP
  10. sufi

    sufi Well-Known Member

    Messages:
    2,095
    Likes Received:
    108
    Best Answers:
    0
    Trophy Points:
    105
    #10
    Hi Guys, Whats the command to stop spiders and robots to crawl image files as it eats up bandwidth.

    Thanks
     
    sufi, Nov 3, 2005 IP
  11. minstrel

    minstrel Illustrious Member

    Messages:
    15,082
    Likes Received:
    1,243
    Best Answers:
    0
    Trophy Points:
    480
    #11
    Under:

    User-agent: *
    Code (markup):
    add this line:

    Disallow: /images/
    Code (markup):
    substituting the name of your images folder for "images".

    If your images aren't in a separate folder, add lines like this instead:

    Disallow: /image1.gif
    Disallow: /image2.gif
    Disallow: /image3.jpg
    
    Code (markup):
     
    minstrel, Nov 3, 2005 IP
    PYJAMA likes this.
  12. PYJAMA

    PYJAMA Peon

    Messages:
    79
    Likes Received:
    1
    Best Answers:
    0
    Trophy Points:
    0
    #12
    minstrel,
    U Truely R a minstrel! Well Said!

    -PYJAMA
     
    PYJAMA, Nov 3, 2005 IP
  13. deepnuke

    deepnuke Peon

    Messages:
    12
    Likes Received:
    0
    Best Answers:
    0
    Trophy Points:
    0
    #13
    have a good robots.txt for increase the rangking on search engine??
     
    deepnuke, Nov 10, 2005 IP
  14. minstrel

    minstrel Illustrious Member

    Messages:
    15,082
    Likes Received:
    1,243
    Best Answers:
    0
    Trophy Points:
    480
    #14
    No. The robots.txt file is a "limiter" for spiders -- it tells them what parts of your site you do not want them to crawl/index.

    There is nothing you can put in a robots.txt file to increase search engine ranking.
     
    minstrel, Nov 11, 2005 IP
  15. jazzylee77

    jazzylee77 Peon

    Messages:
    578
    Likes Received:
    36
    Best Answers:
    0
    Trophy Points:
    0
    #15
    Is there a robots text file that will make me run faster? Jump Higher?
     
    jazzylee77, Nov 13, 2005 IP