getting rss entries that are not in the main xml file

Discussion in 'XML & RSS' started by ra_y_mon, Jul 13, 2009.

  1. #1
    Hi,

    I’m using PHP to get blog entries from a newspaper blog (that I know
    uses typepad). Everything is working: I can download, parse and print
    out the 10 first rss entries of the blog. The feed for the blog is in http://blogs.elpais.com/juan_cruz/rss.xml

    However, I need to download more than 10 entries… In fact, I need all entries from 2009. And here is where I am running crazy: that rss.xml file
    only contains 10 entries, and I cannot find a way to retrieve the
    following entries.

    I’ve been searching a solution for two days already, and I’ve found
    many places where they explain how to limit the number of entries
    retrieved. But I can’t find a way to retrieve those extra entries that
    do not appear in the RSS xml. In a document somewhere I read that sometimes XMLs contain a link to the continuation of the feed (a rel next tag or a continuation tag), but this particular feed doesn't appear to have one of those tags.

    I know this can be done, because if I subscribe to that blog in Google
    Reader and then scroll to the bottom of the entry list, Google Reader
    will automatically retrieve more entries and show them to me.
    Therefore, Google Reader is somehow finding out how to retrieve the
    next entries...

    I hope you can help me…

    Thank you for your time,

    Ramon
     
    ra_y_mon, Jul 13, 2009 IP
  2. ra_y_mon

    ra_y_mon Peon

    Messages:
    4
    Likes Received:
    0
    Best Answers:
    0
    Trophy Points:
    0
    #2
    I found the 'solution':

    - there is no way to retrieve 'old' posts using the RSS feed unless the provider has set it to do so...

    - you can bypass this problem by accessing old posts via Google. Google keeps all posts in their servers, so even if the RSS provider is no longer serving old posts, Google is. For example, in the RSS feed I was mentioning above, if you wanted to get the last 120 posts, you can access those doing:

    http://www.google.com/reader/atom/feed/http://blogs.elpais.com/juan_cruz/atom.xml?n=120


    I hope this helps people arriving to this post in the future...
     
    ra_y_mon, Jul 13, 2009 IP