It can return a simple summary title/description, in any programming language, ... it's more like a text processor/formattor. Is it possible? Thanks!
Possible? defintely. You may be able to pick up some PHP script and do some customizations to fit your needs. Lots of similar PHP scripts are freely available.
Do you want to do this, server-side.. or client-side? Ie.. do you want a web-spider? or a script that you can run on your server?
I can not agree. Yes PHP is slower than compiled applications, but it's server side and easier to implement. For test I've added time counters to Keyword Density Analyzer. So the results for test url http://www.w3c.org (~35 KB) : Download time 1.0035 sec., processing time 1.5908 sec. I think it is not so bad for scripting language to process 35 KB of text. Sure implementation depends on goals, so if performance is important, you need a compiled language. But PHP still good for simple tasks like that.
Client-side will be very good! That do not cost to my server! Server-side is also good, but it should not cost a lot of my server's CPU, otherwise... Are there any JavaScript, PHP or C/C++ code available? I have not find any... Thanks!
I wrote a spider in PHP While indivual page downloads to retrieve the text / links was not a problem at all, it was the recurring find links, add links to the unvisited list, and continue going that was causing problems. I wrote it using CURL / Sockets, and was a little disapointed in it. Had I wrote it in PERL the same way, I would have seen much better performance. With that being said, I still love PHP as it is my baby Just not very good for this type of thing!
sounds like a case where http://developer.yahoo.com/search/content/V1/termExtraction.html would be perfect