Find all attributes in an XML using Beautiful Soup

Question

I have an XML file which looks something like this:

<tagA key1="val1" key2="val2" key3="val3">
<tagB.1 key1="val1" key2="val2" key3="val3"/>
<tagB.2 key1="val1" key2="val2" key3="val3"/>
<tagB.3 key1="val1" key2="val2" key3="val3"/>
<tagB.4 key1="val1" key2="val2" key3="val3"/>
<tagB.5 key1="val1" key2="val2" key3="val3"/>
</tagA>

What I am trying to do is extract the name of key1 , key2 and key3 in tagB.x , and put them into a list. This way I can extract the values of it later. It should be able to handle more or less elements, being as each file is different. Thanks!

Answer 1

You should use an xml parser:

xml="""
<tagA key1="val1" key2="val2" key3="val3">
<tagB.1 key1="val1" key2="val2" key3="val3"/>
<tagB.2 key1="val1" key2="val2" key3="val3"/>
<tagB.3 key1="val1" key2="val2" key3="val3"/>
<tagB.4 key1="val1" key2="val2" key3="val3"/>
<tagB.5 key1="val1" key2="val2" key3="val3"/>
</tagA>
"""


import xml.etree.ElementTree as ET

root = ET.fromstring(xml)
for child in root:
    print child.tag, child.attrib.keys()

tagB.1 ['key3', 'key2', 'key1']
tagB.2 ['key3', 'key2', 'key1']
tagB.3 ['key3', 'key2', 'key1']
tagB.4 ['key3', 'key2', 'key1']
tagB.5 ['key3', 'key2', 'key1']

Find all attributes in an XML using Beautiful Soup

Question

1 answers

solution1
2 ACCPTED 2014-08-23 20:59:26

Find all attributes in an XML using Beautiful Soup

Question

1 answers

solution1 2 ACCPTED 2014-08-23 20:59:26

solution1
2 ACCPTED 2014-08-23 20:59:26