PHP DOMDocument如何獲取元素？

Question

我試圖閱讀一個網站的內容，但我有一個問題，我想得到圖像，鏈接這些元素，但我想得到他們自己的元素，而不是元素內容，例如我想得到它：我想得到整個元素。

我怎樣才能做到這一點..

<?php

    $ch = curl_init();

    curl_setopt($ch, CURLOPT_URL, "http://www.link.com");
    curl_setopt($ch, CURLOPT_RETURNTRANSFER, 1);

    $output = curl_exec($ch);

    $dom = new DOMDocument;
    @$dom->loadHTML($output);

    $items = $dom->getElementsByTagName('a');

    for($i = 0; $i < $items->length; $i++) {
        echo $items->item($i)->nodeValue . "<br />";
    }

    curl_close($ch);;
?>

Answer 1

您似乎要求獲取DOMElement的序列化html ？ 例如，您想要一個包含<a href="http://example.org">link text</a>的字符串嗎？ （請讓你的問題更清楚。）

$url = 'http://example.com';
$dom = new DOMDocument();
$dom->loadHTMLFile($url);

$anchors = $dom->getElementsByTagName('a');

foreach ($anchors as $a) {
    // Best solution, but only works with PHP >= 5.3.6
    $htmlstring = $dom->saveHTML($a);

    // Otherwise you need to serialize to XML and then fix the self-closing elements
    $htmlstring = saveHTMLFragment($a);
    echo $htmlstring, "\n";
}


function saveHTMLFragment(DOMElement $e) {
    $selfclosingelements = array('></area>', '></base>', '></basefont>',
        '></br>', '></col>', '></frame>', '></hr>', '></img>', '></input>',
        '></isindex>', '></link>', '></meta>', '></param>', '></source>',
    );
    // This is not 100% reliable because it may output namespace declarations.
    // But otherwise it is extra-paranoid to work down to at least PHP 5.1
    $html = $e->ownerDocument->saveXML($e, LIBXML_NOEMPTYTAG);
    // in case any empty elements are expanded, collapse them again:
    $html = str_ireplace($selfclosingelements, '>', $html);
    return $html;
}

但請注意，您正在做的事情很危險，因為它可能會混合編碼。 最好將輸出作為另一個DOMDocument，並使用importNode()復制所需的節點。 或者，使用XSL樣式表。

Answer 2

我假設你只是復制粘貼了一些示例代碼，並沒有費心去學習它是如何工作的......

無論如何， ->nodeValue部分獲取元素並返回文本內容（因為該元素有一個單獨的文本節點子節點 - 如果它還有其他內容，我不知道nodeValue會給出什么）。

所以，只需刪除->nodeValue獲得元素。

PHP DOMDocument如何獲取元素？

問題描述

2 個解決方案

解決方案1
3 2012-05-10 03:11:58

解決方案2
0 2012-05-10 00:40:37

PHP DOMDocument如何獲取元素？

問題描述

2 個解決方案

解決方案1 3 2012-05-10 03:11:58

解決方案2 0 2012-05-10 00:40:37

解決方案1
3 2012-05-10 03:11:58

解決方案2
0 2012-05-10 00:40:37