htmlentities（）在字符串中雙重編碼實體

Question

我只希望將未編碼的字符轉換為html實體，而不會影響已存在的實體。 我有一個以前編碼實體的字符串，例如：

gaIUSHIUGhj>&hyphen; hjb&times;jkn.jhuh>hh> &hellip;

當我使用htmlentities() ，實體的開始處的&再次被編碼。 這意味着&hyphen; 和其他實體有&編碼到& ：

&amp;times;

我嘗試解碼完整的字符串，然后再次編碼，但似乎沒有正常工作。 這是我試過的代碼：

header('Content-Type: text/html; charset=iso-8859-1');
...

$b = 'gaIUSHIUGhj>&hyphen; hjb&times;jkn.jhuh>hh> &hellip;';
$b = html_entity_decode($b, ENT_QUOTES, 'UTF-8');
$b = iconv("UTF-8", "ISO-8859-1//TRANSLIT", $b);
$b = htmlentities($b, ENT_QUOTES, 'UTF-8');

但它似乎沒有正確的方式。 有沒有辦法防止或阻止這種情況發生？

Answer 1

將可選的$double_encode變量設置為false 。 有關更多信息，請參閱文檔。

生成的代碼應如下所示：

$b = htmlentities($b, ENT_QUOTES, 'UTF-8', false);

Answer 2

你很好看文檔，但你錯過了最好的部分。 有時可能很難破譯這個：

//     >    >    >    >    >    >    Scroll    >>>    >    >    >    >    >     Keep going.    >    >    >    >>>>>>  See below.  <<<<<<
string htmlentities ( string $string [, int $flags = ENT_COMPAT | ENT_HTML401 [, string $encoding = 'UTF-8' [, bool $double_encode = true ]]] )

^{看看最后。}

我知道。 混亂。 我通常會忽略簽名行，然后直接進入下一個塊（ Parameters ），查看每個參數上的blurb。

所以你想在最后使用double_encoded參數告訴htmlentities不要重新編碼（你可能想要堅持使用UTF-8除非你有特殊的理由不這樣做）：

$str = "gaIUSHIUGhj>&hyphen; hjb&times;jkn.jhuh>hh> &hellip;";

// Double-encoded!
echo htmlentities($str, ENT_COMPAT, 'utf-8', true) . "\n";

// Not double-encoded!
echo htmlentities($str, ENT_COMPAT, 'utf-8', false);

https://ignite.io/code/513ab23bec221e4837000000

htmlentities（）在字符串中雙重編碼實體

問題描述

2 個解決方案

解決方案1
6 2013-03-09 03:52:56

解決方案2
5 2013-03-09 03:59:41

htmlentities（）在字符串中雙重編碼實體

問題描述

2 個解決方案

解決方案1 6 2013-03-09 03:52:56

解決方案2 5 2013-03-09 03:59:41

解決方案1
6 2013-03-09 03:52:56

解決方案2
5 2013-03-09 03:59:41