I think I did post this some time ago, but can't find original thread to rehash. I have some questions regarding lifting data from a particular webpage. What makes this unusual and why I need to ask some questions is that within the tags, there is a lot of white space and the data doesn't actually sit within closed tags eg. <span>data</span>, it falls like this
With the "£6.99" being what I want to extract and use. for example, this code works perfectly for a different website.Code:<td width="55%"><div class="value"> £6.99 <font size="3"> </font></div> </td>
Where the url is http://www.directcosmetics.com/resul...mer&code=34744 which is where the first example where the whitespace is, how can I adjust the above regex to obtain the information required, in this case literally "6.99" and nothing more. Is it possible because it's not within closed brackets?Code:$url = 'http://www.cheapsmells.com/viewProduct.php?id=3462'; $html = file_get_contents($url); preg_match('/<div class=\'productOurPrice\'?>(.+?)(\d+\.\d+)(.+?)?<\/div>/', $html, $match); $out = $match[2];
Any help you can shed my way wouldbe greatly apprecviated.
Cheers ;D



Reply With Quote

Bookmarks