在C＃中解析大型XML（大小为1GB）的最佳方法是什么？

Question

I have a 1GB XML file and want to parse it. 我有一个1GB的XML文件，想解析它。 If I use XML Textreader or XMLDocument, the result is very slow and some times it hangs... 如果我使用XML Textreader或XMLDocument，则结果非常缓慢，有时会挂起...

Answer 1

You'll have to implement custom logic using xmlreader. 您必须使用xmlreader实现自定义逻辑。 xmlreader does not load the full XML into memory before using it, which means you can read it from a stream and process it as such. xmlreader不会在使用前将完整的XML加载到内存中，这意味着您可以从流中读取它并对其进行处理。

Answer 2

XmlDocument is not feasible in this scenario as it will attempt to suck that gigabyte into main memory. XmlDocument在这种情况下不可行，因为它将尝试将该GB数据吸入主内存。 I'm surprised that you're finding XmlTextReader to be too slow. 我惊讶于您发现XmlTextReader太慢。 Have you tried something like the following? 您是否尝试过以下方法？

using (XmlTextReader rdr = new XmlTextReader("MyBigFile.txt"))
{
     // use rdr to advance through the document.
}

Answer 3

XMLTextreader isn't supposed to hang as it's stream based and just works on chunks of the data. XMLTextreader不应挂起，因为它是基于流的，只适用于数据块。

If it hangs, it may well be that you are doing something wrong when loading the file. 如果挂起，则很可能是您在加载文件时做错了什么。

Answer 4

I'm not very familiar with this topic, but afaik the XmlReader-classes ought to work fine for your specific problem. 我对这个主题不是很熟悉，但是afaik XmlReader类应该可以很好地解决您的特定问题。 They are, after all, optimized for exactly this. 毕竟，它们为此进行了优化。

Answer 5

I would just like to back up everyone who promotes XmlReader with a performance comparison that I found: 我只想通过我发现的性能比较来备份所有推广XmlReader的人：

http://www.nearinfinity.com/blogs/joe_ferner/performance_linq_to_sql_vs.html http://www.nearinfinity.com/blogs/joe_ferner/performance_linq_to_sql_vs.html

在C＃中解析大型XML（大小为1GB）的最佳方法是什么？

问题描述

5 个解决方案

解决方案1
12 2009-01-22 12:41:06

解决方案2
8 2009-01-22 12:44:06

解决方案3
6 2009-01-22 12:42:01

解决方案4
1 2009-01-22 12:45:20

解决方案5
1 2009-01-22 13:36:12

在C＃中解析大型XML（大小为1GB）的最佳方法是什么？

问题描述

5 个解决方案

解决方案1 12 2009-01-22 12:41:06

解决方案2 8 2009-01-22 12:44:06

解决方案3 6 2009-01-22 12:42:01

解决方案4 1 2009-01-22 12:45:20

解决方案5 1 2009-01-22 13:36:12

解决方案1
12 2009-01-22 12:41:06

解决方案2
8 2009-01-22 12:44:06

解决方案3
6 2009-01-22 12:42:01

解决方案4
1 2009-01-22 12:45:20

解决方案5
1 2009-01-22 13:36:12