How a Search engine is worked?

Discussion in 'All Other Search Engines' started by alisha0512, Jun 19, 2008.

  1. #1
    Please anybody tell me that how a search engine is prepared and howi t works?
     
    alisha0512, Jun 19, 2008 IP
  2. firmaterra

    firmaterra Peon

    Messages:
    756
    Likes Received:
    16
    Best Answers:
    0
    Trophy Points:
    0
    #2
    Take a look a HADOOP or LUCENE to get a basic understanding
     
    firmaterra, Jun 19, 2008 IP
  3. alisha0512

    alisha0512 Banned

    Messages:
    345
    Likes Received:
    0
    Best Answers:
    0
    Trophy Points:
    0
    #3
    please tell the work of Search engine.
     
    alisha0512, Jun 23, 2008 IP
  4. kmofo

    kmofo Active Member

    Messages:
    442
    Likes Received:
    24
    Best Answers:
    0
    Trophy Points:
    85
    #4
    In a few words: A search engine has two modules. One for aquiring the data and one for providing search results. Of course, they are linked together by a database. The first module, the so-called "spider" is a software agent that crawls web pages, according to previous set rules. During crawling, the agent adds to the database some info (again, depends on the rules defined) about the webpage. Among this info there might be: URL, title, a copy of the page itself, etc... The other module pulls the data from the database whenever a search is made.

    This is the simplest possible model. There are many type of search engines: based on popularity, similarity or authority, etc... You can get more info here.
     
    kmofo, Jun 23, 2008 IP
  5. alisha0512

    alisha0512 Banned

    Messages:
    345
    Likes Received:
    0
    Best Answers:
    0
    Trophy Points:
    0
    #5
    Thanks to provide me this information.
    please tell me what is On-page and Off-page Optimization?
     
    alisha0512, Jun 27, 2008 IP
  6. alisha0512

    alisha0512 Banned

    Messages:
    345
    Likes Received:
    0
    Best Answers:
    0
    Trophy Points:
    0
    #6
    what is Spider ?
     
    alisha0512, Jun 29, 2008 IP
  7. jigolo

    jigolo Peon

    Messages:
    312
    Likes Received:
    2
    Best Answers:
    0
    Trophy Points:
    0
    #7
    It is a website/site that has an ability to show some results of products and services crawled or being indexed on the data center.

    To be able find their/his/our products/services quick and easier using the search form on the site.


    Can you correct me guys if this is not the correct explanation on how/what the search engine is? :)
     
    jigolo, Jun 30, 2008 IP
  8. skypeace

    skypeace Well-Known Member

    Messages:
    2,348
    Likes Received:
    125
    Best Answers:
    0
    Trophy Points:
    130
    #8
    A Web search engine is a search engine designed to search for information on the World Wide Web. Information may consist of web pages, images and other types of files. Some search engines also mine data available in newsgroups, databases, or open directories. Unlike Web directories, which are maintained by human editors, search engines operate algorithmically or are a mixture of algorithmic and human input.

    How Web search engines work:


    A search engine operates, in the following order

    1. Web crawling
    2. Indexing
    3. Searching

    Web search engines work by storing information about many web pages, which they retrieve from the WWW itself. These pages are retrieved by a Web crawler (sometimes also known as a spider) — an automated Web browser which follows every link it sees. Exclusions can be made by the use of robots.txt. The contents of each page are then analyzed to determine how it should be indexed (for example, words are extracted from the titles, headings, or special fields called meta tags). Data about web pages are stored in an index database for use in later queries. Some search engines, such as Google, store all or part of the source page (referred to as a cache) as well as information about the web pages, whereas others, such as AltaVista, store every word of every page they find. This cached page always holds the actual search text since it is the one that was actually indexed, so it can be very useful when the content of the current page has been updated and the search terms are no longer in it. This problem might be considered to be a mild form of linkrot, and Google's handling of it increases usability by satisfying user expectations that the search terms will be on the returned webpage. This satisfies the principle of least astonishment since the user normally expects the search terms to be on the returned pages. Increased search relevance makes these cached pages very useful, even beyond the fact that they may contain data that may no longer be available elsewhere.

    When a user enters a query into a search engine (typically by using key words), the engine examines its index and provides a listing of best-matching web pages according to its criteria, usually with a short summary containing the document's title and sometimes parts of the text. Most search engines support the use of the boolean operators AND, OR and NOT to further specify the search query. Some search engines provide an advanced feature called proximity search which allows users to define the distance between keywords.

    The usefulness of a search engine depends on the relevance of the result set it gives back. While there may be millions of webpages that include a particular word or phrase, some pages may be more relevant, popular, or authoritative than others. Most search engines employ methods to rank the results to provide the "best" results first. How a search engine decides which pages are the best matches, and what order the results should be shown in, varies widely from one engine to another. The methods also change over time as Internet usage changes and new techniques evolve.

    Most Web search engines are commercial ventures supported by advertising revenue and, as a result, some employ the controversial practice of allowing advertisers to pay money to have their listings ranked higher in search results. Those search engines which do not accept money for their search engine results make money by running search related ads alongside the regular search engine results. The search engines make money every time someone clicks on one of these ads.

    The vast majority of search engines are run by private companies using proprietary algorithms and closed databases, though some are open source.

    Revenue in the web search portals industry is projected to grow in 2008 by 13.4 percent, with broadband connections expected to rise by 15.1 percent. Between 2008 and 2012, industry revenue is projected to rise by 56 percent as Internet penetration still has some way to go to reach full saturation in American households. Furthermore, broadband services are projected to account for an ever increasing share of domestic Internet users, rising to 118.7 million by 2012, with an increasing share accounted for by fiber-optic and high speed cable lines

    Hope this helps! :)
     
    skypeace, Jul 3, 2008 IP
  9. manemdeepthi

    manemdeepthi Peon

    Messages:
    12
    Likes Received:
    0
    Best Answers:
    0
    Trophy Points:
    0
    #9
    please tell me optimization tips for a job board website
     
    manemdeepthi, Jul 8, 2008 IP
  10. alisha0512

    alisha0512 Banned

    Messages:
    345
    Likes Received:
    0
    Best Answers:
    0
    Trophy Points:
    0
    #10
    thanks to provide me such information.
     
    alisha0512, Jul 10, 2008 IP
  11. love123

    love123 Banned

    Messages:
    23
    Likes Received:
    0
    Best Answers:
    0
    Trophy Points:
    0
    #11
    thnx to introduce working of search engine
     
    love123, Jul 11, 2008 IP
  12. MRniceGuy007

    MRniceGuy007 Peon

    Messages:
    446
    Likes Received:
    6
    Best Answers:
    0
    Trophy Points:
    0
    #12
    the basics and know hows is okay, but there is more to that to the big search engines, they have complex math and algorithms behind their architecture
     
    MRniceGuy007, Jul 11, 2008 IP
  13. CafeSaiGon

    CafeSaiGon Banned

    Messages:
    51
    Likes Received:
    0
    Best Answers:
    0
    Trophy Points:
    0
    #13
    Hope this help you webmasterseoblog.com/how-a-search-engine-works/
     
    CafeSaiGon, Jul 12, 2008 IP
  14. ramgay

    ramgay Active Member

    Messages:
    313
    Likes Received:
    15
    Best Answers:
    0
    Trophy Points:
    73
    #14
    looks interesting
     
    ramgay, Jul 12, 2008 IP
  15. alisha0512

    alisha0512 Banned

    Messages:
    345
    Likes Received:
    0
    Best Answers:
    0
    Trophy Points:
    0
    #15


    please tell me the work of Click Popularity to increase the traffic of site.
     
    alisha0512, Jul 28, 2008 IP
  16. alisha0512

    alisha0512 Banned

    Messages:
    345
    Likes Received:
    0
    Best Answers:
    0
    Trophy Points:
    0
    #16
    How does the google crawl a page?
     
    alisha0512, Aug 12, 2008 IP
  17. holidayhoteldirect

    holidayhoteldirect Banned

    Messages:
    154
    Likes Received:
    2
    Best Answers:
    0
    Trophy Points:
    0
    #17
    thats secret only google knows
     
    holidayhoteldirect, Aug 12, 2008 IP
  18. Teodor

    Teodor Banned

    Messages:
    382
    Likes Received:
    4
    Best Answers:
    0
    Trophy Points:
    58
    #18
    Although Google crawls billions of pages, it's inevitable that some sites will be missed. When their so-called spiders miss a site, it's frequently for one of the following reasons:
    - The site isn't well connected through multiple links to other sites on the web.
    - The site launched after Google's most recent crawl was completed.
    - The design of the site makes it difficult for Google to effectively crawl its content.
    - The site was temporarily unavailable when we tried to crawl it or we received an error when we tried to crawl it. You can use Google Webmaster Tools to see if we received errors when trying to crawl your site.
     
    Teodor, Aug 12, 2008 IP
  19. alisha0512

    alisha0512 Banned

    Messages:
    345
    Likes Received:
    0
    Best Answers:
    0
    Trophy Points:
    0
    #19


    thank to tell me
     
    alisha0512, Aug 13, 2008 IP
  20. bposervices

    bposervices Banned

    Messages:
    82
    Likes Received:
    0
    Best Answers:
    0
    Trophy Points:
    0
    #20
    thanks for cool info
     
    bposervices, Aug 13, 2008 IP