简体   繁体   中英

php DOMDocument->getElementById->nodeValue sripping html

I am using php's DOMDocument->getElementById->nodeValue to set a particular DOM element's HTML. The problem is that the string is converted to HTML entities: eg: nodeValue = html_entity_decode('<b>test</b>'); should output 'test' but instead it outputs ' &lt;b&gt;test&lt;/b&gt; '

Any ideas why? This happens even if i don't use the html_entity_decode function

Here is my updated script...which is NOW working:

// Construct a DOM object for updating the affected node
$html = new DOMDocument("1.0", "utf-8"); 
if (!$html) return FALSE;

// Load the HTML file in question
$loaded = $html->loadHTMLFile($data['page_path']);
if (!$loaded)
{
    print 'Failed to load file';
    return FALSE;
}

// Establish the node being updated within the file
foreach ($data['divids'] as $divid)
{
    $element = $html->getElementById($divid);
    if (is_null($element))
    {
        print 'Failed to get existing element';
        return FALSE;
    }

    $newelement = $html->createElement('div');
    if (is_null($newelement))
    {
        print 'Failed to create new element';
        return FALSE;
    }
    $newelement->setAttribute('id', $divid);
    $newelement->setAttribute('class', 'reusable-block');

    // Perform the replacement
    $newelement->nodeValue = $replacement;
    $parent = $element->parentNode;
    $parent->replaceChild($newelement, $element);

    // Save the file back to its location
    $saved = $html->saveHTMLFile($data['page_path']);
    if (!$saved)
    {
        print 'Failed to save file';
        return FALSE;
    }
}

// Replace HTML entities left over
$content = files::readFile($data['page_path']);
$content = str_replace('&lt;', '<', $content);
$content = str_replace('&gt;', '>', $content);
if (!@fwrite($handle, $content))
{
    print 'Failed to replace entities';
    return FALSE;
}

This is proper behavior - your tag is being converted to a string, and strings in XML can't contain angle brackets (only tags can). Try converting the HTML into a DOMNode and appending it instead:

$node = $mydoc->createElement("b");
$node->nodeValue = "test";
$mydoc->getElementById("whatever")->appendChild($node);

Update with working example:

$html = '<html>
    <body id="myBody">
        <b id="myBTag">my old element</b>
    </body>
</html>';

$mydoc = new DOMDocument("1.0", "utf-8");
$mydoc->loadXML($html);

// need to do this to get getElementById() to work
$all_tags = $mydoc->documentElement->getElementsByTagName("*");
foreach ($all_tags as $element) {
    $element->setIdAttribute("id", true);
}

$current_b_tag = $mydoc->getElementById("myBTag");
$new_b_tag = $mydoc->createElement("b");
$new_b_tag->nodeValue = "my new element";
$result = $mydoc->getElementById("myBody");
$result->replaceChild($new_b_tag, $current_b_tag);

echo $mydoc->saveXML($mydoc->documentElement);

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM