Robots.txt Question

Discussion in 'robots.txt' started by steveb, Mar 21, 2007.

  1. #1
    I read somewhere that having a blank robots.txt file is better than having none at all.

    Can someone explain to me the difference between a blank robots.txt file and not having the file at all?

    Thanks
     
    steveb, Mar 21, 2007 IP
  2. itrana123

    itrana123 Peon

    Messages:
    177
    Likes Received:
    2
    Best Answers:
    0
    Trophy Points:
    0
    #2
    No idea about also want to know about robots.txt will someone help...?
     
    itrana123, Mar 22, 2007 IP
  3. Dediwebspace

    Dediwebspace Active Member

    Messages:
    469
    Likes Received:
    2
    Best Answers:
    0
    Trophy Points:
    55
    #3
    It depends what bots you want crawling your site
     
    Dediwebspace, Mar 23, 2007 IP
  4. trichnosis

    trichnosis Prominent Member

    Messages:
    13,785
    Likes Received:
    333
    Best Answers:
    0
    Trophy Points:
    300
    #4
    where have you read it? having a robots.txt file will always help you
     
    trichnosis, May 11, 2007 IP
  5. mohammad_x

    mohammad_x Peon

    Messages:
    127
    Likes Received:
    1
    Best Answers:
    0
    Trophy Points:
    0
    #5
    It appears have empty txt file not be bad
     
    mohammad_x, May 24, 2007 IP
  6. tinkerbox

    tinkerbox Peon

    Messages:
    55
    Likes Received:
    1
    Best Answers:
    0
    Trophy Points:
    0
    #6
    Only honest/good bot will read your robots.txt Bad bots will just ignored it.
    robots.txt cannot control bots, it just a txt file that stay there and give info to bots what directory you want them to index.
     
    tinkerbox, May 25, 2007 IP
  7. fedarik

    fedarik Well-Known Member

    Messages:
    144
    Likes Received:
    2
    Best Answers:
    0
    Trophy Points:
    103
    #7
    According to http://www.robotstxt.org/ faq


    How does a robot decide where to visit?
    This depends on the robot, each one uses different strategies. In general they start from a historical list of URLs, especially of documents with many links elsewhere, such as server lists, "What's New" pages, and the most popular sites on the Web.

    Most indexing services also allow you to submit URLs manually, which will then be queued and visited by the robot.

    Sometimes other sources for URLs are used, such as scanners through USENET postings, published mailing list achives etc.

    Given those starting points a robot can select URLs to visit and index, and to parse and use as a source for new URLs.

    How does an indexing robot decide what to index?
    If an indexing robot knows about a document, it may decide to parse it, and insert it into its database. How this is done depends on the robot: Some robots index the HTML Titles, or the first few paragraphs, or parse the entire HTML and index all words, with weightings depending on HTML constructs, etc. Some parse the META tag, or other special hidden tags.

    We hope that as the Web evolves more facilities becomes available to efficiently associate meta data such as indexing information with a document. This is being worked on...
     
    fedarik, May 26, 2007 IP
  8. kirby009

    kirby009 Peon

    Messages:
    608
    Likes Received:
    4
    Best Answers:
    0
    Trophy Points:
    0
    #8
    i leave mine blank it does seen to help.
     
    kirby009, Jun 12, 2007 IP