Remove all text nodes from XML file

Question

I want to remove all text nodes (but not any other type of node) from an XML file. How can I do this?

Example Input:

<root>
<slideshow id="1">
<Image>hii</Image>
<ImageContent>this</ImageContent>
<Thumbnail>is</Thumbnail>
<ThumbnailContent>A</ThumbnailContent>
</slideshow>
<slideshow id="2">
<Image>hii</Image>
<ImageContent>this</ImageContent>
<Thumbnail>is</Thumbnail>
<ThumbnailContent>B</ThumbnailContent>
</slideshow>
</root>

Expected Output:

<root>
<slideshow id="1">
<Image></Image>
<ImageContent></ImageContent>
<Thumbnail></Thumbnail>
<ThumbnailContent></ThumbnailContent>
</slideshow>
<slideshow id="2">
<Image></Image>
<ImageContent></ImageContent>
<Thumbnail></Thumbnail>
<ThumbnailContent></ThumbnailContent>
</slideshow>
</root>

Answer 1

How about:

var doc = XDocument.Load("test.xml");
doc.DescendantNodes()
   .Where(x => x.NodeType == XmlNodeType.Text ||
               x.NodeType == XmlNodeType.CDATA)
   .Remove();
doc.Save("clean.xml");

EDIT: Note that the above was before I realized that XCData derived from XText , leading to the simpler:

var doc = XDocument.Load("test.xml");
doc.DescendantNodes()
   .OfType<XText>()
   .Remove();
doc.Save("clean.xml");

Answer 2

This question should help: Linq to XML - update/alter the nodes of an XML Document

You can use Linq to open the document and alter the values or remove the nodes altogether.

Remove all text nodes from XML file

Question

2 answers

solution1
7 ACCPTED 2011-10-13 15:46:46

solution2
0 2011-10-13 15:50:47

Remove all text nodes from XML file

Question

2 answers

solution1 7 ACCPTED 2011-10-13 15:46:46

solution2 0 2011-10-13 15:50:47

solution1
7 ACCPTED 2011-10-13 15:46:46

solution2
0 2011-10-13 15:50:47