Little Help : Parsing Hmtl Syntax with WP web scraper

Discussion in 'WordPress' started by hectox, Aug 4, 2010.

  1. #1
    Hello,

    I need to use a wordpress plugins : http://wordpress.org/extend/plugins/wp-web-scrapper WP Web Scraper to extract the link of an audio tracks on a itunes web page.

    here's the page where i want to extract the link :

    http://itunes.apple.com/us/album/guero/id52311104
    Code (markup):
    here’s the link I want to extract on this page :

    http://a1.phobos.apple.com/us/r1000/033/Music/d2/3d/ac/mzm.pdxkvtef.aac.p.m4a
    Code (markup):
    And here's the part of the code i want to scrape :

    <tbody> <tr metrics-loc="Track_" adam-id="52311106" audio-preview-url="http://a1.phobos.apple.com/us/r1000/045/Music/b0/74/15/mzm.xueigfme.aac.p.m4a" preview-album="Guero" preview-artist="Beck" class="song music" preview-title="E-Pro" preview-duration="30000" row-number="0">
    
    Code (markup):
    So in my post I have type the following code :

    [wpws url="http://itunes.apple.com/us/album/guero/id52311104" selector="@audio-preview-url:eq(0)"]

    or

    [wpws url="http://itunes.apple.com/us/album/guero/id52311104" selector="audio-preview-url:eq(0)"]

    Unfortunately it doesn't work the syntax for the selector for the argument of the table seems not be correct!

    Someone Have an idea or know a similar plugins or Php function to have the same result?

    Thanks!
     
    hectox, Aug 4, 2010 IP