[英]XML ElementTree - indexing tags
我有一個XML文件:
<sentence id="en_BlueRibbonSushi_478218345:2">
<text>It has great sushi and even better service.</text>
</sentence>
<sentence id="en_BlueRibbonSushi_478218345:3">
<text>The entire staff was extremely accomodating and tended to my every need.</text>
</sentence>
<sentence id="en_BlueRibbonSushi_478218345:4">
<text>I've been to this restaurant over a dozen times with no complaints to date.</text>
</sentence>
使用XML ElementTree,我想插入一個具有屬性category=
的標簽<Opinion>
。 假設我有一個字符list = ['a', 'b', 'c']
,是否可以逐步將它們與每個文本對齊,所以我有:
<sentence id="en_BlueRibbonSushi_478218345:2">
<text>It has great sushi and even better service.</text>
<Opinion category='a' />
</sentence>
<sentence id="en_BlueRibbonSushi_478218345:3">
<text>The entire staff was extremely accomodating and tended to my every need.</text>
<Opinion category='b' />
</sentence>
<sentence id="en_BlueRibbonSushi_478218345:4">
<text>I've been to this restaurant over a dozen times with no complaints to date.</text>
<Opinion category='c' />
</sentence>
我知道我可以使用句子id屬性,但這需要對我的代碼進行大量重組。 基本上,我希望能夠索引每個句子條目以與我的列表索引對齊。
您可以使用SubElement
工廠函數向樹中添加元素。 假設您的XML數據位於名為data
的變量中,這會將元素添加到文檔樹中:
import xml.etree.ElementTree as ET
tree = ET.XML(data)
for elem, category in zip(tree.findall('sentence'), ['a', 'b', 'c']):
Opinion = ET.SubElement(elem, 'Opinion')
Opinion.set('category', category)
ET.dump(tree) # prints the tree; tree.write('output.xml') is another option
聲明:本站的技術帖子網頁,遵循CC BY-SA 4.0協議,如果您需要轉載,請注明本站網址或者原文地址。任何問題請咨詢:yoyou2525@163.com.