简体   繁体   中英

Get/filter table-row content from PHP

How do I get all the TITLE and LINKS reading from a file ?

A sample content of the file below

<tr class="odd">
 <td align="left" valign="top" class="text_cont_normal"> TITLE </td>
 <td align="left" valign="top" class="normal_text_link">
  <img border="0" onclick="javascript:window.location.href='LINK'" style="cursor: pointer;" alt="Download"  src="btn.jpg"/></td>
</tr>
<tr class="even">
 <td align="left" valign="top" class="text_cont_normal"> TITLE2 </td>
 <td align="left" valign="top" class="normal_text_link">
  <img border="0" onclick="javascript:window.location.href='LINK2'" style="cursor: pointer;" alt="Download"  src="btn.jpg"/></td>
</tr>

I tried

$tags = $doc->getElementsByTagName('img');
foreach ($tags as $tag) {
 if ($tag->hasAttribute('onclick'))
    echo $tag->getAttribute('onclick').'<br>';
}

But not getting the data which I actually want !

Like this, for example

$doc = new DOMDocument();
$doc->loadHTMLFile($filename);
$xpath = new DOMXPath($doc);
$nodes = $xpath->query('//td[@class="text_cont_normal"]');
 foreach($nodes as $node)
 {
    echo $node->nodeValue.'<br>';   // title
 }
$nodes = $xpath->query('//td[@class="normal_text_link"]/img[@alt="Download"]');
 foreach($nodes as $node)
 {
  if ($node->hasAttribute('onclick'))
     echo $node->getAttribute('onclick').'<br>';  //click
 }

If you need exactly the LINK then rewrite

  if ($node->hasAttribute('onclick'))
  {
      echo $node->getAttribute('onclick').'<br>';  //click
      preg_match('/location\.href=(\'|")(.*?)\\1/i', 
                 $node->getAttribute('onclick'), $matches);
      if (isset($matches[2])) echo $matches[2].'<br>'; // the value
  }

Or do you need them in groups?

One possible way:

$nodes = $doc->getElementsByTagName('tr');
$max = $nodes->length;
for ($i = 0; $i < $max; $i++)
{
    echo $nodes->item($i)->firstChild->nodeValue . '<br>';  // TITLE
    $onclick = $nodes->item($i)->childNodes->item(2)->childNodes->item(1)->getAttribute('onclick');
    $parts = explode("'", $onclick);
    echo $parts[1] . '<br>';  // LINK
}

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM