逐行閱讀

Question

我有一個生成純文本文件的程序。 結構（布局）始終相同。 例：

文本文件：

LinkLabel
"Hello, this text will appear in a LinkLabel once it has been
added to the form. This text may not always cover more than one line. But will always be surrounded by quotation marks."
240, 780

因此，要解釋該文件中發生了什么：

Control
Text
Location

當單擊窗體上的按鈕，並且用戶從OpenFileDialog對話框中打開這些文件之一時，我需要能夠讀取每一行。 從頂部開始，我想檢查一下它是什么控件，然后從第二行開始，我需要能夠將所有文本括在引號內（不管是一行文本還是更多行），並且在下一行（右引號后），我需要提取位置（240、780）...我已經想到了一些解決方法，但是當我寫下來並付諸實踐時，它沒有多大意義，最終弄清了它行不通的方法。

有人做過嗎？ 有人能提供有關我將如何執行此操作的任何幫助，建議或意見嗎？

我查找了CSV文件，但是對於看起來如此簡單的東西來說似乎太復雜了。

謝謝傑瑟

Answer 1

我將嘗試寫下算法，以解決這些問題的方式（在注釋中）：

// while not at end of file
  // read control
  // read line of text
  // while last char in line is not "
    // read line of text
  // read location

嘗試編寫符合每個注釋要求的代碼，您應該能夠弄清楚。

HTH。

Answer 2

您可以使用正則表達式從文本中獲取行：

MatchCollection lines = Regex.Matches(File.ReadAllText(fileName), @"(.+?)\r\n""([^""]+)""\r\n(\d+), (\d+)\r\n");
foreach (Match match in lines) {
   string control = match.Groups[1].Value;
   string text = match.Groups[2].Value;
   int x = Int32.Parse(match.Groups[3].Value);
   int y = Int32.Parse(match.Groups[4].Value);
   Console.WriteLine("{0}, \"{1}\", {2}, {3}", control, text, x, y);
}

Answer 3

您正在嘗試實現一個解析器，而為此的最佳策略是將問題分成更小的部分。 並且您需要一個TextReader類，使您能夠讀取行。

您應該將ReadControl方法分為三個方法： ReadControlType ， ReadText和ReadLocation 。 每個方法僅負責讀取應讀取的項目，並將TextReader留在下一個方法可以使用的位置。 這樣的事情。

public Control ReadControl(TextReader reader)
{
    string controlType = ReadControlType(reader);
    string text = ReadText(reader);
    Point location = ReadLocation(reader);
    ... return the control ...
}

當然，ReadText是最有趣的一種，因為它跨越多行。 實際上，這是一個調用TextReader.ReadLine的循環，直到該行以引號結束為止：

private string ReadText(TextReader reader)
{
    string text;
    string line = reader.ReadLine();
    text = line.Substring(1); // Strip first quotation mark.
    while (!text.EndsWith("\"")) {
        line = reader.ReadLine();
        text += line;
    }
    return text.Substring(0, text.Length - 1); // Strip last quotation mark.
}

Answer 4

這種東西很煩人，從概念上講很簡單，但最終可能會產生粗糙的代碼。 您有一個相對簡單的案例：每個文件一個記錄，如果有很多記錄，則變得更加困難，並且您希望很好地處理格式錯誤的記錄（考慮為C＃這樣的語言編寫解析器。

對於大規模問題，可以使用如下語法驅動的解析器： link text

您的大部分復雜性來自文件缺乏規則性。 第一個字段以nwline終止，第二個字段以引號分隔，第三個字段以逗號終止...

我的第一個建議是調整文件的格式，以使其真正易於解析。 您編寫文件即可控制自己。 例如，僅在文本中沒有換行，並且每個項目都在自己的行上。 然后，您只需閱讀四行，就可以完成工作。

逐行閱讀

問題描述

文本文件：

4 個解決方案

解決方案1
2 2009-09-30 06:43:45

解決方案2
2 已采納 2009-09-30 06:47:37

解決方案3
2 2009-09-30 06:52:49

解決方案4
1 2009-09-30 06:48:02

逐行閱讀

問題描述

文本文件：

4 個解決方案

解決方案1 2 2009-09-30 06:43:45

解決方案2 2 已采納 2009-09-30 06:47:37

解決方案3 2 2009-09-30 06:52:49

解決方案4 1 2009-09-30 06:48:02

解決方案1
2 2009-09-30 06:43:45

解決方案2
2 已采納 2009-09-30 06:47:37

解決方案3
2 2009-09-30 06:52:49

解決方案4
1 2009-09-30 06:48:02