its just any other web crawler that crawls the web and finds sites. the crawler, known as googlebot, visits a site and fetch the data. google then analyze the data and figure out what to do with it. main purpose of the crawler is to fetch data for google
yes google crawler crawl good content websites and neglect the bad ones. So make a good content site to make google crawler happy.
When crawler comes to your site it looks for the robots.txt and then crawl all the associated pages which will lead the crawler to the second phase of crawling which includes inspecting all the links present on a web page and follow links one to another.
Google crawler is more common in talk but I want to say that what about Yahoo and Bing crawler. How they work? I saw that most of the people got ran in Google but suffer in Yahoo and Bing so can we do for that?
Crawling is the process by which Googlebot discovers new and updated pages to be added to the Google index. Google's crawl process begins with a list of web page URLs, generated from previous crawl processes, and augmented with Sitemap data provided by webmasters. As Googlebot visits each of these websites it detects links on each page and adds them to its list of pages to crawl.
Googlebot really just does the discovery of content. It makes some determinations about whether to crawl a link, but only on a basic level. However, regarding software and identifying content: it's not difficult, even on a simple level, to identify bad content. It's all about finding particular signals/matches. Or if the page has no words, but hundreds or links, then you know there's a problem, etc. These are just the basics. Categorising content is a bit trickier, but if it appears to be about cats and then mentions apples, there's something bad with the content. Look up regular expressions and you could do some in PHP to analyse a page and make some determinations. (Simplest way if you want to experiment.)
you can check out this site to know more about it... http://www.google-dance-tool.com/google_crawler_history.html
A Good source can tell you better, go through with the following wikipedia article on web crawlers: Continued: http://en.wikipedia.org/wiki/Web_crawler