How do I parse incoming RSS feeds for XML headers?

Discussion in 'XML & RSS' started by jwalzer, Jan 7, 2008.

  1. #1
    I'd like to parse my incoming RSS feeds to view the XML and then strip out the URL, title, description, author, tags, and any other relevant information from XML headers to dump into a database that I could search - is this possible? What tools would I need to accomplish this?

    Thanks,
    Jeff
     
    jwalzer, Jan 7, 2008 IP
  2. akram

    akram Peon

    Messages:
    39
    Likes Received:
    0
    Best Answers:
    0
    Trophy Points:
    0
    #2
    akram, Jan 9, 2008 IP
  3. ajsa52

    ajsa52 Well-Known Member

    Messages:
    3,426
    Likes Received:
    125
    Best Answers:
    0
    Trophy Points:
    160
    #3
    I'm doing that in PERL using the module XML::XPath;
     
    ajsa52, Jan 9, 2008 IP
  4. jwalzer

    jwalzer Peon

    Messages:
    2
    Likes Received:
    0
    Best Answers:
    0
    Trophy Points:
    0
    #4
    Thannks for the reply akram

    Is there a site or document that spells out the headers within the RSS feed itself? I'd like to determine if there's any relevant info I can pull out of a RSS feed before dropping it into my database - basically, is there enough info within the headers of the feed itself to have the article be useful for analysis?

    Thanks,
    Jeff
     
    jwalzer, Jan 10, 2008 IP
  5. sharqi

    sharqi Guest

    Messages:
    105
    Likes Received:
    2
    Best Answers:
    0
    Trophy Points:
    0
    #5
    Magpie is great, I have used it often.

    Read the docs, there are good examples of what you need to be doing.
     
    sharqi, Jan 25, 2008 IP