How to find data using pattern ????

Discussion in 'Programming' started by vinay_fop, Aug 22, 2007.

  1. #1
    Hi Everybody,

    i want to kw, how to extract data between "<table cellspacing="0">" and "</table>", ie a reguler expression pattern. the data should be come in array for each "<table cellspacing="0">" and "</table>" and the demo string from where to find is given below ..
    Help pls to find the data b/w pattern



    Regards
    Binay k




    <table cellspacing="0">
    <tr>
    <th rowspan="3" scope="row">General</th>
    <td class="ttl"><a href=# onClick="helpW('h_ntype.htm');">Network</a></td>
    <td class="nfo">UMTS / GSM 850 / 900 / 1800 / 1900</td>
    </tr><tr>
    <td class="ttl"><a href=# onClick="helpW('h_year.htm');">Announced</a></td>
    <td class="nfo">2007, August</td>
    </tr><tr>
    <td class="ttl"><a href=# onClick="helpW('h_status.htm');">Status</a></td>

    <td class="nfo">Coming Soon</td>
    </tr>
    </table><table cellspacing="0">
    <tr>
    <th rowspan="2" scope="row">Size</th>
    <td class="ttl"><a href=# onClick="helpW('h_dimens.htm');">Dimensions</a></td>
    <td class="nfo">112 x 45 x 11.3 mm, 55 cc</td>
    </tr><tr>
    <td class="ttl"><a href=# onClick="helpW('h_weight.htm');">Weight</a></td>
    <td class="nfo">101 g</td>
    </tr>

    </table><table cellspacing="0">
    <tr>
    <th rowspan="2" scope="row">Display</th>
    <td class="ttl"><a href=# onClick="helpW('h_dtype.htm');">Type</a></td>
    <td class="nfo">OLED, 16M colors</td>
    </tr><tr>
    <td class="ttl"><a href=# onClick="helpW('h_dsize.htm');">Size</a></td>
    <td class="nfo">240 x 320 pixels, 2 inches</td>
    </tr>


    </table><table cellspacing="0">
    <tr>

    <th rowspan="3" scope="row">Ringtones</th>
    <td class="ttl"><a href=# onClick="helpW('h_ringtype.htm');">Type</a></td>
    <td class="nfo">Polyphonic (64 channels), MP3</td>
    </tr>



    <tr><td class="ttl"><a href=# onClick="helpW('h_ringcustom.htm');">Customization</a></td>
    <td class="nfo">Download</td></tr>

    <tr>
    <td class="ttl"><a href=# onClick="helpW('h_vibrat.htm');">Vibration</a> </td>

    <td class="nfo">Yes</td>
    </tr>

    </table><table cellspacing="0">
    <tr>
    <th rowspan="4" scope="row">Memory</th>
    <td class="ttl"><a href=# onClick="helpW('h_number.htm');">Phonebook</a></td>
    <td class="nfo">Yes</td>
    </tr>
    <tr>
    <td class="ttl"><a href=# onClick="helpW('h_callr.htm');">Call records</a></td>
    <td class="nfo">20 dialed, 20 received, 20 missed calls</td>

    </tr><tr>
    <td class="ttl"><a href=# onClick="helpW('h_cslot.htm');">Card slot</a></td>

    <td class="nfo">No</td></tr>
    <tr><td class="ttl">&nbsp;</td><td class="nfo">- 1GB internal memory</td></tr>


    </td>
    </tr>
    </table><table cellspacing="0">
    <tr>
    <th rowspan="8" scope="row">Data</th>
    <td class="ttl"><a href=# onClick="helpW('h_gprs.htm');">GPRS</a></td>

    <td class="nfo">Class 32, 107 kbps</td>
    </tr>
    <tr>
    <td class="ttl"><a href=# onClick="helpW('h_hscsd.htm');">HSCSD</a></td>
    <td class="nfo">Yes</td>
    </tr>
    <tr>
    <td class="ttl"><a href=# onClick="helpW('h_edge.htm');">EDGE</a></td>
    <td class="nfo">Class 32, 296 kbps</td>
    </tr>
    <tr>
    <td class="ttl"><a href=# onClick="helpW('h_3g.htm');">3G</a></td>

    <td class="nfo">Yes, 384 kbps</td>
    </tr>
    <tr>
    <td class="ttl"><a href=# onClick="helpW('h_wlan.htm');">WLAN</a></td>
    <td class="nfo">No</td>
    </tr>
    <tr>
    <td class="ttl"><a href=# onClick="helpW('h_bluetooth.htm');">Bluetooth</a></td>
    <td class="nfo">Yes, v2.0</td>
    </tr>
    <tr>
    <td class="ttl"><a href=# onClick="helpW('h_infrar.htm');">Infrared port</a></td>

    <td class="nfo">No</td>
    </tr>
    <tr>
    <td class="ttl"><a href=# onClick="helpW('h_usb.htm');">USB</a></td>
    <td class="nfo">Yes, microUSB</td>
    </tr>
    </table><table cellspacing="0">
    <tr>
    <th rowspan="8" scope="row">Features</th>

    <td class="ttl"><a href=# onClick="helpW('h_messg.htm');">Messaging</a></td>
    <td class="nfo">SMS, MMS, Email</td>

    </tr><tr>
    <td class="ttl"><a href=# onClick="helpW('h_browser.htm');">Browser</a></td>
    <td class="nfo">WAP 2.0/xHTML</td>
    </tr><tr>
    <td class="ttl"><a href=# onClick="helpW('h_games.htm');">Games</a></td>
    <td class="nfo">Yes + downloadable</td>
    </tr><tr>
    <td class="ttl"><a href=# onClick="helpW('h_colors.htm');">Colors</a></td>
    <td class="nfo">Black</td>
    </tr>


    <tr>
    <td class="ttl"><a href=# onClick="helpW('h_camera.htm');">Camera</a></td>
    <td class="nfo">2 MP, 1600x1200 pixels, video(QCIF), flash</td>
    </tr>
    <tr><td class="ttl">&nbsp;</td><td class="nfo">- Java MIDP 2.0<br />
    - UPnP technology<br />
    - MP3/AAC/eAAC+/WMA player<br />
    - FM radio<br />
    - T9<br />
    - Calendar<br />

    - Calculator <br />
    - Built-in handsfree<br />
    - Voice memo<br />
    - Living wallpaper</td></tr>


    </table><table cellspacing="0">
    <tr>
    <th rowspan="3" scope="row">Battery</th>
    <td class="ttl">&nbsp;</td>
    <td class="nfo">Standard battery, Li-Ion 850 mAh (BL-6P)</td>
    </tr><tr>

    <td class="ttl"><a href=# onClick="helpW('h_standb.htm');">Stand-by</a></td>
    <td class="nfo">Up to 240 h</td>
    </tr><tr>
    <td class="ttl"><a href=# onClick="helpW('h_talkta.htm');">Talk time</a></td>
    <td class="nfo">Up to 3 h</td>
    </tr>

    </table>
     
    vinay_fop, Aug 22, 2007 IP
  2. krt

    krt Well-Known Member

    Messages:
    829
    Likes Received:
    38
    Best Answers:
    0
    Trophy Points:
    120
    #2
    This should get you started (note it does not take into account category headings):
    <?php
    
    // get contents, either fopen or curl
    $contents = '<html>...';
    
    // get data titles/values
    preg_match_all('/<td class="(ttl|nfo)">(.+?)</td>/s', $contents, $m);
    print_r($m);
    
    ?>
    PHP:
    I hope you aren't planning on using this data for your own site or something similar.
     
    krt, Aug 23, 2007 IP