Robots.txt Question

steveb Well-Known Member

Messages:: 1,434

Likes Received:: 66

Best Answers:: 0

Trophy Points:: 175

#1

I read somewhere that having a blank robots.txt file is better than having none at all.

Can someone explain to me the difference between a blank robots.txt file and not having the file at all?

Thanks

steveb, Mar 21, 2007 IP

itrana123 Peon

Messages:: 177

Likes Received:: 2

Best Answers:: 0

Trophy Points:: 0

#2

No idea about also want to know about robots.txt will someone help...?

itrana123, Mar 22, 2007 IP

Dediwebspace Active Member

Messages:: 469

Likes Received:: 2

Best Answers:: 0

Trophy Points:: 55

#3

It depends what bots you want crawling your site

Dediwebspace, Mar 23, 2007 IP

trichnosis Prominent Member

Messages:: 13,785

Likes Received:: 333

Best Answers:: 0

Trophy Points:: 300

#4

where have you read it? having a robots.txt file will always help you

trichnosis, May 11, 2007 IP

mohammad_x Peon

Messages:: 127

Likes Received:: 1

Best Answers:: 0

Trophy Points:: 0

#5

It appears have empty txt file not be bad

mohammad_x, May 24, 2007 IP

tinkerbox Peon

Messages:: 55

Likes Received:: 1

Best Answers:: 0

Trophy Points:: 0

#6

Only honest/good bot will read your robots.txt Bad bots will just ignored it.
robots.txt cannot control bots, it just a txt file that stay there and give info to bots what directory you want them to index.

tinkerbox, May 25, 2007 IP

fedarik Well-Known Member

Messages:: 144

Likes Received:: 2

Best Answers:: 0

Trophy Points:: 103

#7

According to http://www.robotstxt.org/ faq

How does a robot decide where to visit?
This depends on the robot, each one uses different strategies. In general they start from a historical list of URLs, especially of documents with many links elsewhere, such as server lists, "What's New" pages, and the most popular sites on the Web.

Most indexing services also allow you to submit URLs manually, which will then be queued and visited by the robot.

Sometimes other sources for URLs are used, such as scanners through USENET postings, published mailing list achives etc.

Given those starting points a robot can select URLs to visit and index, and to parse and use as a source for new URLs.

How does an indexing robot decide what to index?
If an indexing robot knows about a document, it may decide to parse it, and insert it into its database. How this is done depends on the robot: Some robots index the HTML Titles, or the first few paragraphs, or parse the entire HTML and index all words, with weightings depending on HTML constructs, etc. Some parse the META tag, or other special hidden tags.

We hope that as the Web evolves more facilities becomes available to efficiently associate meta data such as indexing information with a document. This is being worked on...

fedarik, May 26, 2007 IP

kirby009 Peon

Messages:: 608

Likes Received:: 4

Best Answers:: 0

Trophy Points:: 0

#8

i leave mine blank it does seen to help.

kirby009, Jun 12, 2007 IP

Log in or Sign up