URL Reader and content extracter automatic script in PHP or Java?

Discussion in 'Programming' started by varun8211, Nov 4, 2006.

  1. #1
    I need URL Reader and content extracter automatic script ?

    I want to extract URL, Title, short description from the RSS feed.
    2. The script should automatically extract the long description from the URL (taken from the feed).

    Here step 1 is very easy and straight.
    eg for step 2:
    I want to extract the main story from URLs like these:
    1. http://indiabullion.com/viewnews.php?n_id=34355
    2. http://timesofindia.indiatimes.com/articleshow/324892.cms
    and so on..

    and then insert it in a text file..

    Is it possible ?
     
    varun8211, Nov 4, 2006 IP
  2. jeremyweber

    jeremyweber Peon

    Messages:
    112
    Likes Received:
    1
    Best Answers:
    0
    Trophy Points:
    0
    #2
    Completely possible. The hardest part of something like this is always extracting the story. Usually you have to have custom code for each site are extracting from.
     
    jeremyweber, Nov 9, 2006 IP
  3. ccoonen

    ccoonen Well-Known Member

    Messages:
    1,606
    Likes Received:
    71
    Best Answers:
    0
    Trophy Points:
    160
    #3
    yup - find out the template structure each site uses, and extract each article using that template per site :)
     
    ccoonen, Nov 10, 2006 IP