20M search queries released by aol

Discussion in 'All Other Search Engines' started by LemonTree, Aug 7, 2006.

  1. mad4

    mad4 Peon

    Messages:
    6,986
    Likes Received:
    493
    Best Answers:
    0
    Trophy Points:
    0
    #21
    Don't forget that AOL may have seeded the data with fake search terms to catch search engine spammers.

    Actually, forget that. They are clearly too stupid.:rolleyes: Google have probably seeded their data though.
     
    mad4, Aug 7, 2006 IP
  2. JasonBartholme

    JasonBartholme Peon

    Messages:
    396
    Likes Received:
    23
    Best Answers:
    0
    Trophy Points:
    0
    #22
    I got the completed torrent overnight, and I'm starting to tackle the data.

    It would be nice to see someone, with much more data mining experience, post a results summary. I beleive there is tons of good information to be learned from that list.
     
    JasonBartholme, Aug 7, 2006 IP
  3. LemonTree

    LemonTree Peon

    Messages:
    23
    Likes Received:
    1
    Best Answers:
    0
    Trophy Points:
    0
    #23
    If they were that smart they wouldn't have released the datas in the first place.
     
    LemonTree, Aug 7, 2006 IP
  4. moso

    moso Peon

    Messages:
    103
    Likes Received:
    1
    Best Answers:
    0
    Trophy Points:
    0
    #24
    I don't see why is so important? You could find related terms for a word from ouverture or adwords. I don't see the value of this information.
     
    moso, Aug 7, 2006 IP
  5. LemonTree

    LemonTree Peon

    Messages:
    23
    Likes Received:
    1
    Best Answers:
    0
    Trophy Points:
    0
    #25
    the value is in the fact that the data is unfiltered
     
    LemonTree, Aug 7, 2006 IP
  6. amaze

    amaze Active Member

    Messages:
    594
    Likes Received:
    4
    Best Answers:
    0
    Trophy Points:
    60
    #26
    Oops found it now..
     
    amaze, Aug 7, 2006 IP
  7. 1EightT

    1EightT Guest

    Messages:
    2,646
    Likes Received:
    71
    Best Answers:
    0
    Trophy Points:
    0
    #27
    By afternoon i'll have a section where you can search a live database of the data and get back results. I'll post when it is up and running
     
    1EightT, Aug 7, 2006 IP
  8. Monty

    Monty Peon

    Messages:
    1,363
    Likes Received:
    132
    Best Answers:
    0
    Trophy Points:
    0
    #28
    Would be great 1EightT, thanks.
     
    Monty, Aug 7, 2006 IP
  9. wyuguy

    wyuguy Peon

    Messages:
    103
    Likes Received:
    2
    Best Answers:
    0
    Trophy Points:
    0
    #29
    awesome work,hope see it soon
     
    wyuguy, Aug 7, 2006 IP
  10. Glen

    Glen Peon

    Messages:
    1,852
    Likes Received:
    91
    Best Answers:
    0
    Trophy Points:
    0
    #30
    Has this thread came up today? or yesterday..if not im surprised

    Check out www.techcrunch.com for more info if you havnt :)

    It's certainly a must need to know about "thing" ;)
     
    Glen, Aug 7, 2006 IP
  11. przemek

    przemek Guest

    Messages:
    49
    Likes Received:
    2
    Best Answers:
    0
    Trophy Points:
    0
    #31
    I found a page with links to download the AOL search data release. Many mirrors.

    http://aol-search-data.real-sol.com/

    Hope this is usefull.

    I forgot to mention i will design a tool to analize this list, maybe something like overture. any one have suggestions of what function would be usefull?
     
    przemek, Aug 8, 2006 IP
  12. seo-ireland

    seo-ireland Peon

    Messages:
    243
    Likes Received:
    12
    Best Answers:
    0
    Trophy Points:
    0
    #32
    Please add frequencies into your tool if you can. Pretty please :)

    There is already a tool online here http://www.aolsearchdatabase.com
     
    seo-ireland, Aug 8, 2006 IP
  13. przemek

    przemek Guest

    Messages:
    49
    Likes Received:
    2
    Best Answers:
    0
    Trophy Points:
    0
    #33
    This results arent very good, i think some processing needs to be done to the data so that i can be more usefull maybe like overture. Frequencies? are you reffering to how many searches per month?
     
    przemek, Aug 8, 2006 IP
  14. seo-ireland

    seo-ireland Peon

    Messages:
    243
    Likes Received:
    12
    Best Answers:
    0
    Trophy Points:
    0
    #34
    By frequencies I mean the number of times a particular keyphrase was used. E.g. 'blue widgets' was searched for 1233 times
     
    seo-ireland, Aug 8, 2006 IP
  15. reteep

    reteep Active Member

    Messages:
    181
    Likes Received:
    5
    Best Answers:
    0
    Trophy Points:
    58
    #35
    That makes me thinking of how dumb people must be at AOL.
     
    reteep, Aug 8, 2006 IP
  16. RedCardinal

    RedCardinal Peon

    Messages:
    349
    Likes Received:
    10
    Best Answers:
    0
    Trophy Points:
    0
    #36
    Well it takes a couple of hours to load this into a MySQL DB. But there is 2.1GB of search data goodness in there.

    Actually at worst looking at the data is good for the amusement factor alone, at best this is going to throw up some really interesting stuff from an SEO perspective.

    I am taking a look at the iterative approach people use to building search queries.

    Correct me if I'm wrong but isn't AOL search powered by Google? Are these results just repagkaged Google results?
     
    RedCardinal, Aug 8, 2006 IP
  17. wyuguy

    wyuguy Peon

    Messages:
    103
    Likes Received:
    2
    Best Answers:
    0
    Trophy Points:
    0
    #37
    is this data *only* from using AOL’s search engine, or does it include ALL searches done by AOL members (meaning searching using Google, Yahoo, etc) while logged in to AOL?
     
    wyuguy, Aug 8, 2006 IP
  18. venturefox

    venturefox Notable Member

    Messages:
    1,327
    Likes Received:
    38
    Best Answers:
    0
    Trophy Points:
    245
    #38
    Yes, AOL is powered by Google.. you're essentially seeing Google results.
     
    venturefox, Aug 8, 2006 IP
  19. mad4

    mad4 Peon

    Messages:
    6,986
    Likes Received:
    493
    Best Answers:
    0
    Trophy Points:
    0
    #39
    AOL is powered by google. So the data is AOL members using the google serps.
     
    mad4, Aug 8, 2006 IP
  20. cffoodie

    cffoodie Guest

    Messages:
    27
    Likes Received:
    2
    Best Answers:
    0
    Trophy Points:
    0
    #40
    Hey all.. we made a usable search tool based on the AOL data.. it basically works exactly like overture except that it returns 1000 results. Also, you can select the individual keyword returned and see the site that were clicked and the average position.. the domains is still replicating but you can visit

    dontdelete.com
    OR hit the IP
    63.212.167.185

    Let me know your thoughts..
     
    cffoodie, Aug 8, 2006 IP