Using Python to extract information from a XML file?

Question

Can anyone offer some help with regards to using Python to extract information from a XML file? This will be my example XML.

<root>
    <number index="2">
        <info>
            <info.RANDOM>Random Text</info.RANDOM>
        </info>
</root>

What I want to print out is the information between the root tags. However, I want it to print it as is, which means all the tags, text in between the tags, and the content within the tag (in this case number index ="2") I have tried itertext(), but that removes the tags and prints only the text in between the root tags. So far, I have a makeshift solution that prints out only the element.tag and the element.text but that does not print out the end tags and the content within the tag. Any help would be appreciated! :)

Answer 1

With s as your input,

s='''<root>
      <number index="2">
        <info>
            <info.RANDOM>Random Text</info.RANDOM>
        </info>
        </number>
</root>'''

Find all tags with tag name number and convert the tag to string using ET.tostring()

import xml.etree.ElementTree as ET
root = ET.fromstring(s)
for node in root.findall('.//number'):
  print ET.tostring(node)

Output:

<number index="2">
        <info>
            <info.RANDOM>Random Text</info.RANDOM>
        </info>
        </number>

Answer 2

from bs4 import BeautifulSoup

xml = "<root><number index=\"2\"><info><info.RANDOM>Random Text</info.RANDOM></info></root>"
soup = BeautifulSoup(xml, "xml")

output = soup.prettify()
print(output[output.find("<root>") + 7:output.rfind("</root>")])

the + 7 accounts for root>\\n

Using Python to extract information from a XML file?

Question

2 answers

solution1
1 ACCPTED 2017-05-15 16:24:13

solution2
0 2017-05-15 17:52:16

Using Python to extract information from a XML file?

Question

2 answers

solution1 1 ACCPTED 2017-05-15 16:24:13

solution2 0 2017-05-15 17:52:16

solution1
1 ACCPTED 2017-05-15 16:24:13

solution2
0 2017-05-15 17:52:16