1. Advertising
    y u no do it?

    Advertising (learn more)

    Advertise virtually anything here, with CPM banner ads, CPM email ads and CPC contextual links. You can target relevant areas of the site and show ads based on geographical location of the user if you wish.

    Starts at just $1 per CPM or $0.10 per CPC.

Search engine theory - post yours here!

Discussion in 'Programming' started by davedx, Mar 29, 2005.

  1. #1
    So a lot of webmaster forums have plenty of threads saying why Google is broken, or not as good as it could be. Here's your chance to say how you think a search engine should work! Serious or silly ideas both welcome. I'm personally semi-serious about this, from a hobby/programming perspective. My own theory is to take the original Google idea to the extreme, and base almost everything off the "links are votes" concept - give surrounding text of the backlink, anchor text (and rep of page) most of the weight.

    If I started my own engine, I'd probably start by crawling dmoz and yahoo, giving pages on those directories an arbitrary high weight as a placeholder until I could write a decent ranking algo.

    Anyone else have mad whacky theories? :D I remember some posts by Owlcroft a few weeks back that held the complete opposite view to mine -- that a really decent SE should be able to determine the relevancy of a site by its content. I love discussing this stuff!

    Edit - also, any constructive points about how other search engines (Google) fall down would also be great. i.e. the 302 redirect problem.
     
    davedx, Mar 29, 2005 IP
  2. redking

    redking Member

    Messages:
    93
    Likes Received:
    2
    Best Answers:
    0
    Trophy Points:
    43
    #2
    A search engine actually isn't that hard to program; I made one for a class project in college. However, the key is the algorithms used to sort and search through all the data that you have spidered. You'll need a background in computer science or math because the algorithms used are very theoretical. All search engine theory relies on the same type of algorithms and most of the information is public because it's presented in conferences. So, over time all search engines have to evolve or clever people will take the public information and start using it to manipulate the results.

    My whole point is that search engine theory is a young, evolving field based on one of the oldest hard sciences - math. If you have a strong math background, then you can create and manipulate search engines. Troof.
     
    redking, Mar 29, 2005 IP
  3. sadcox66

    sadcox66 Spirit Walker

    Messages:
    496
    Likes Received:
    16
    Best Answers:
    0
    Trophy Points:
    0
    #3
    Google is not broken... It is a smart move on google part to keep you in the sandbox (many of my sites are in it now, so it exists) so you that you end up paying them through Adwords for traffic. However it does look like google is not the only game in town and I may consider not optimizing for them altogether.
     
    sadcox66, Mar 29, 2005 IP
  4. davedx

    davedx Peon

    Messages:
    429
    Likes Received:
    21
    Best Answers:
    0
    Trophy Points:
    0
    #4
    BSc Hons. Computer Science, check :)

    In agreement there. Even though Google has evolved and updated its algorithm over time, the basic theory underlying it still holds and link buying, 302's etc. are examples of people using the knowledge of how the algos work to manipulate the results. Maybe there's some way that's not the same type of algorithm though, kinda my reason for starting this thread to see if anyone has any original ideas...

    Yeah, another reason I'm interested in it, precisely because it's a young field. Although I don't really have any truly new & original ideas I firmly believe there's still some waiting to be discovered.
     
    davedx, Mar 29, 2005 IP
  5. clasione

    clasione Notable Member

    Messages:
    2,362
    Likes Received:
    158
    Best Answers:
    0
    Trophy Points:
    228
    #5

    Good post and I agree with you 100%, a good background in Math somehow plays a role in understanding the algo....
     
    clasione, Mar 29, 2005 IP
  6. J.D.

    J.D. Peon

    Messages:
    1,198
    Likes Received:
    65
    Best Answers:
    0
    Trophy Points:
    0
    #6
    I think Google was good when there were thousands of websites, but now when the count is in millions, their results are less than good, except when you are looking for a very specific term. With many searches I end up getting back tons of forum posts dated back to 2002 and before, link-exchage pages, some blogs and it just is getting longer and longer to sieve through these.

    As for page ranking, it simply can't be just links. Otherwise cross-linkers would be the only ones in the search results. I can guarantee you that they are sampling actual clicks and browsing patterns.

    J.D.
     
    J.D., Mar 29, 2005 IP
  7. davedx

    davedx Peon

    Messages:
    429
    Likes Received:
    21
    Best Answers:
    0
    Trophy Points:
    0
    #7
    How do they sample browsing patterns, by using the G. cookie?
     
    davedx, Mar 31, 2005 IP
  8. J.D.

    J.D. Peon

    Messages:
    1,198
    Likes Received:
    65
    Best Answers:
    0
    Trophy Points:
    0
    #8
    They stick JS into their links from time to time and use redirection.

    J.D.
     
    J.D., Mar 31, 2005 IP