We're vuilding full text search indexes on our site right now. When that is done the searchs should be lightning fast, and pretty relavent.
ok cool thanks. I had tried to put in % and * but I forgot about the [] thingies. The server seems overloaded right now though so I can't try it out.
Download location: http://www.gregsadetsky.com/aol-data/ Tool to grep data on linux etc: http://pegasus.rutgers.edu/~elflord/unix/grep.html Tool to grep data on Windows: http://www.wingrep.com/download.htm Tool to open massive text files instantly: http://www.swiftgear.com/ltfviewer/features.html Mini Search Engines: http://simplifiedsec.com/KeywordDigger.html www.aolsearchdatabase.com http://dontdelete.com Thanks Dr N
I couldn't think of anything useful I could do with that weird little program, what are you using it for? I can add SPSS to the list, it has quiete a learning curve but you can do a lot with it. If you can get your hands on a copy, SPSS Text Analyzer would probably be even better.
Peach all resources are welcome Please list and it is up to everyone to go and experiment Thanks Dr n
How can I get a list of these search phrases in order of popularity? Just simply the most frequently searched to the least in order?
this isnt a gonna earn anyone a quick buck but if you think in terms of research and long term strategy then it is a gold mine.
Yep.. basically the value in this list is the fact that you can see what keywords are searched for and what sites people are going to. It's the other half that Overture doesn't give you. So, if you go to www.dontdelete.com and search for "food" you'll find any and every keyword with "food" in it... the results are ordered by the number of times it was searched over the three month period. Then.. if you find a keyword/phrase of interest.. click it to see the sites that people are acually visiting from that keyword search. Let me know if that clarifies..
Some really weired stats are displayed. Some very popular keywords showing less than 50 search count :x
It's nice to click on the results on www.dontdelete.com and see what sites people were visiting. A co-worker said we might as well delete anything containing "www" in the query column because those results are about useless. Anyone have any reason we should keep those query strings?
You would not believe how many people don't know the difference between the address box and a search box. I would put up a site and get asked why it wasn't coming up; then I find they are actually searching for it, and don't know it. Don't know if that helps figure out a use for them, but I would imagine there are a lot of them.
Right, cffoodie, I understand that, and I am definitely using that information for particular keywords and niches. What I'm looking for is someone who has simply sorted the list by popularity (# of times searched) only, or a tool that can do this. This way, I can see which terms overall are the most commonly searched. That information would also be very valuable.
looking for the most popular keywords is pointless, that information is available on the web anyway if you look hard enough, this data is worth so much more than just a list of the highest searched for keywords. The value is not in the short one or two keyword queries but in the three to four keyword queries which have less competition and often convert very well. Also by anaylsing the data you can see how a user refines their searche when they dont find what they are looking for.
Not true at all, trust me. And to know the order of search terms' popularity in a fairly large sampling of what AOL (google) users are searching is priceless. Sure, you can shell out a few thousand for other lists (alexa top 100k, etc). Can any one provide the list sorted by popularity of keywords?
Anyone trying to open these files on a Mac like I did may not have luck with Stuffit. For some reason it kept extracting *.txt.gz as folders instead of files. Use UnTar instead to extract the files! I've been dumping to my SQL DB for the last 30 minutes... I don't even know what I want to do with the data... can't stop pinching myself... ooh, I can look up "pinching" and see if there's a niche!!!
This is not a large sampling. Compared to lists like Keyword Discovery, Overture or Wordtracker it is tiny. For example, for the keyword "vacation", we get the following number of searches from various sources: Wordtracker: 154,699 Overture: 18,021,562 Keyword Discovery: 29,952,402 AOL Data: 211 Yikes! That's a pretty small sample of data to base your business plans on.
See: http://fuaol.com/ I think that might be based on just the first 1/10 of the data, but I'm not sure.