从元描述中获取SimpleXMLElement [重复]

Question

这个问题已经在这里有了答案：

如何通过php获取网页的开放图谱协议？ 6个答案

我正在尝试检索一些包含在SimpleXMLElement中的元数据。 我正在使用XPATH，但我一直在努力获得自己感兴趣的价值。

以下是网页标题的摘录（摘自： http : //www.wayfair.de/CleverFurn-Couchtisch-Abby-69318X2-MFE2223.html ）

您知道如何检索包含以下内容的数组中的所有xmlns数据：

1）og：type 2）og：url 3）og：image .... x）og：upc

<meta xmlns:og="http://opengraphprotocol.org/schema/" property="og:title" content="CleverFurn Couchtisch &quot;Abby&quot;" />

这是我的PHP代码

<?php
$html = file_get_contents("http://www.wayfair.de/CleverFurn-Couchtisch-Abby-69318X2-MFE2223.html");
$doc = new DOMDocument();
$doc->strictErrorChecking = false;
$doc->recover=true;
@$doc->loadHTML("<html><body>".$html."</body></html>");

$xpath = new DOMXpath($doc);
$elements = $xpath->query("//*/meta[@property='og:url']");

if (!is_null($elements)) {
foreach ($elements as $element) {
echo "<br/>[". $element->nodeName. "]";
var_dump($element);
  $nodes = $element->childNodes;
  foreach ($nodes as $node) {
     echo $node->nodeValue. "\n";
     }
   }
 }
?>

Answer 1

刚找到答案：

如何通过php获取网页的开放图谱协议？

<?php
$html = file_get_contents("http://www.wayfair.de/CleverFurn-Couchtisch-Abby-69318X2-MFE2223.html");
libxml_use_internal_errors(true); // Yeah if you are so worried about using @ with warnings
$doc = new DomDocument();
$doc->loadHTML($html);
$xpath = new DOMXPath($doc);
$query = '//*/meta[starts-with(@property, \'og:\')]';
$metas = $xpath->query($query);
foreach ($metas as $meta) {
    $property = $meta->getAttribute('property');
    $content = $meta->getAttribute('content');
    $rmetas[$property] = $content;
}
var_dump($rmetas);
?>

从元描述中获取SimpleXMLElement [重复]

问题描述

1 个解决方案

解决方案1
1 2014-07-13 12:44:26

从元描述中获取SimpleXMLElement [重复]

问题描述

1 个解决方案

解决方案1 1 2014-07-13 12:44:26

解决方案1
1 2014-07-13 12:44:26