如果與號尚未編碼，如何對其進行編碼？

Question

我需要 ac# 方法來編碼＆符號，如果它們尚未編碼或另一個編碼的 epxression 的一部分

例如

"tom & jill" should become "tom &amp; jill"


"tom &amp; jill" should remain "tom &amp; jill"


"tom &euro; jill" should remain "tom &euro; jill"


"tom <&> jill" should become "tom <&amp;> jill"


"tom &quot;&&quot; jill" should become "tom &quot;&amp;&quot; jill"

Answer 1

您真正想要做的是首先解碼字符串，然后再次對其進行編碼。 不要費心嘗試修補編碼的字符串。

任何編碼只有在可以輕松解碼的情況下才有價值，因此請重用該邏輯以使您的生活更輕松。 並且您的軟件不易出錯。

現在，如果您不確定字符串是否已編碼 - 問題肯定不是字符串本身，而是產生字符串的生態系統。 你從哪里得到的？ 在它到達你面前之前，它經過了誰？ 你相信嗎？

如果你真的不得不求助於創建一個magic-fix-weird-data函數，那么考慮建立一個“編碼”及其對應字符的表：

&amp; -> &
&euro; -> €
&lt; -> <
// etc.

然后，首先根據表解碼所有遇到的編碼，然后重新編碼整個字符串。 當然，在不先解碼的情況下摸索時，您可能會得到更有效的方法。 但明年你就不會理智了。 這是你的運營商，對吧？ 你需要保持頭腦清醒！ 如果你太聰明，你就會失去理智。 當你發瘋時，你會失去工作。 那些讓維護他們的黑客破壞他們思想的人會發生可悲的事情......

編輯：當然，使用 .NET 庫會讓你免於瘋狂：

HttpUtility.HtmlDecode(string)
HttpUtility.HtmlEncode(string)

我剛剛對其進行了測試，解碼字符串中只有＆符號似乎沒有問題。 所以請繼續：

string magic(string encodedOrNot)
{
    var decoded = HttpUtility.HtmlDecode(encodedOrNot);
    return HttpUtility.HtmlEncode(decoded);
}

編輯#2 ：事實證明，解碼器HttpUtility.HtmlDecode將適用於您的目的，但編碼器不會，因為您不希望對尖括號（ < ， > ）進行編碼。 但是編寫編碼器真的很容易：

define encoder(string decoded):
    result is a string-builder
    for character in decoded:
        if character in encoding-table:
           result.append(encoding-table[character])
        else:
           result.append(character)
    return result as string

Answer 2

這應該做得很好：

text = Regex.Replace(text, @"
    # Match & that is not part of an HTML entity.
    &                  # Match literal &.
    (?!                # But only if it is NOT...
      \w+;             # an alphanumeric entity,
    | \#[0-9]+;        # or a decimal entity,
    | \#x[0-9A-F]+;    # or a hexadecimal entity.
    )                  # End negative lookahead.", 
    "&amp;",
    RegexOptions.IgnoreCase | RegexOptions.IgnorePatternWhitespace);

Answer 3

使用正則表達式，它可以通過負前瞻來完成。

&(?![^& ]+;)

測試示例在這里

如果與號尚未編碼，如何對其進行編碼？

問題描述

3 個解決方案

解決方案1
14 2011-10-11 07:45:49

解決方案2
10 已采納 2011-10-11 14:32:25

解決方案3
1 2011-10-11 07:52:30

如果與號尚未編碼，如何對其進行編碼？

問題描述

3 個解決方案

解決方案1 14 2011-10-11 07:45:49

解決方案2 10 已采納 2011-10-11 14:32:25

解決方案3 1 2011-10-11 07:52:30

解決方案1
14 2011-10-11 07:45:49

解決方案2
10 已采納 2011-10-11 14:32:25

解決方案3
1 2011-10-11 07:52:30