從字符串C＃中刪除單詞

Question

我正在使用ASP.NET 4.0 Web應用程序，它的主要目標是轉到MyURL變量中的URL，然后從上至下讀取它，搜索以“ description”開頭的所有行，並僅保留那些同時刪除所有HTML標簽。 我接下來要做的是從結果后綴中刪除“描述”文本，以便僅剩下設備名稱。 我該怎么做？

protected void parseButton_Click(object sender, EventArgs e)
    {
        MyURL = deviceCombo.Text;
        WebRequest objRequest = HttpWebRequest.Create(MyURL);
        objRequest.Credentials = CredentialCache.DefaultCredentials;
        using (StreamReader objReader = new StreamReader(objRequest.GetResponse().GetResponseStream()))
        {
            originalText.Text = objReader.ReadToEnd();
        }

        //Read all lines of file
        String[] crString = { "<BR>&nbsp;" };
        String[] aLines = originalText.Text.Split(crString, StringSplitOptions.RemoveEmptyEntries);

        String noHtml = String.Empty;

        for (int x = 0; x < aLines.Length; x++)
        {
            if (aLines[x].Contains(filterCombo.SelectedValue))
            {
                noHtml += (RemoveHTML(aLines[x]) + "\r\n");

            }
        }
        //Print results to textbox
        resultsBox.Text = String.Join(Environment.NewLine, noHtml);
    }
    public static string RemoveHTML(string text)
    {
        text = text.Replace("&nbsp;", " ").Replace("<br>", "\n");
        var oRegEx = new System.Text.RegularExpressions.Regex("<[^>]+>");
        return oRegEx.Replace(text, string.Empty);
    }

Answer 1

好的，所以我想出了如何通過現有功能之一刪除單詞：

public static string RemoveHTML(string text)
{
    text = text.Replace("&nbsp;", " ").Replace("<br>", "\n").Replace("description", "").Replace("INFRA:CORE:", "")
        .Replace("RESERVED", "")
        .Replace(":", "")
        .Replace(";", "")
        .Replace("-0/3/0", "");
        var oRegEx = new System.Text.RegularExpressions.Regex("<[^>]+>");
        return oRegEx.Replace(text, string.Empty);
}

Answer 2

public static void Main(String[] args)
{
    string str = "He is driving a red car.";

    Console.WriteLine(str.Replace("red", "").Replace("  ", " "));
}

輸出：他在開車。

注意：在第二個替換其雙精度空格。

鏈接： https : //i.stack.imgur.com/rbluf.png

試試這個。它將刪除所有要刪除的單詞。

Answer 3

改編自代碼項目

string value = "ABC - UPDATED";
int index = value.IndexOf(" - UPDATED");
if (index != -1)
{
    value = value.Remove(index);
}

它將不打印ABC - UPDATED

Answer 4

使用LINQ嘗試這樣的事情：

List<string> lines = new List<string>{
"Hello world",
"Description: foo",
"Garbage:baz",
"description purple"};

 //now add all your lines from your html doc.
 if (aLines[x].Contains(filterCombo.SelectedValue))
 {
       lines.Add(RemoveHTML(aLines[x]) + "\r\n");
 }

var myDescriptions = lines.Where(x=>x.ToLower().BeginsWith("description"))
                          .Select(x=> x.ToLower().Replace("description",string.Empty)
                                       .Trim());

// you now have "foo" and "purple", and anything else.

您可能需要調整冒號等。

Answer 5

void Main()
{
    string test = "<html>wowzers description: none <div>description:a1fj391</div></html>";
    IEnumerable<string> results = getDescriptions(test);
    foreach (string result in results)
    {
        Console.WriteLine(result);  
    }

    //result: none
    //        a1fj391
}

static Regex MyRegex = new Regex(
      "description:\\s*(?<value>[\\d\\w]+)",
    RegexOptions.Compiled);

IEnumerable<string> getDescriptions(string html)
{
    foreach(Match match in MyRegex.Matches(html))
    {
        yield return match.Groups["value"].Value;
    }
}

從字符串C＃中刪除單詞

問題描述

5 個解決方案

解決方案1
6 已采納 2012-12-27 20:33:46

解決方案2
2 2018-08-05 10:26:53

解決方案3
0 2014-11-21 17:11:59

解決方案4
0 2011-10-06 19:48:56

解決方案5
0 2011-10-25 11:32:56

從字符串C＃中刪除單詞

問題描述

5 個解決方案

解決方案1 6 已采納 2012-12-27 20:33:46

解決方案2 2 2018-08-05 10:26:53

解決方案3 0 2014-11-21 17:11:59

解決方案4 0 2011-10-06 19:48:56

解決方案5 0 2011-10-25 11:32:56

解決方案1
6 已采納 2012-12-27 20:33:46

解決方案2
2 2018-08-05 10:26:53

解決方案3
0 2014-11-21 17:11:59

解決方案4
0 2011-10-06 19:48:56

解決方案5
0 2011-10-25 11:32:56