简体   繁体   English

ElementTree 解析 XML

[英]ElementTree parsing XML

I'm trying to parse below xml using Python and ElementTree however I'm running into some errors.我正在尝试使用 Python 和ElementTree解析以下 xml,但是我ElementTree了一些错误。

This is the code I'm using right now:这是我现在使用的代码:

tree = file.xml

for house in tree.findall('.//HOUSE'):
        HOUSE_ID = house.find('ID').text
        ASSET_ID = house.find('ASSET_ID').text
        IP = house.find('IP').text
        FOLLOW_UP= house.find('FOLLOW_UP').text
        NODE_ID= house.find('NODE_ID').text
        ADDRESS = house.find('ADDRESS')
        if ADDRESS is None:
            ADDRESS = ''
        else:
            ADDRESS = ADDRESS.text
        LAST_VISIT_DATETIME = host.find('LAST_VISIT_DATETIME').text
        DOMOTIC = house.find('DOMOTIC')
        if DOMOTIC is None:
            DOMOTIC = ''
        else:
            DOMOTIC = DOMOTIC.text
        ITEM_ID = house.find('ITEM_ID')
        if ITEM_ID is None:
            ITEM_ID = ''
        else:
            ITEM_ID = ITEM_ID.text


        print(HOUSE_ID + "," + ASSET_ID + "," + IP + "," + FOLLOW_UP + "," + NODE_ID + "," + ADDRESS + "," + LAST_VISIT_DATETIME + "," + ADDRESS + "," + LAST_VISIT_DATETIME)

This is the XML:这是 XML:

<OUTPUT>
    <RESPONSE>
        <DATETIME>2020-10-26T11:38:35Z</DATETIME>
        <HOUSE_LIST>
            <HOUSE>
                <ID>10808749</ID>
                <ASSET_ID>1234</ASSET_ID>
                <IP>192.168.0.1</IP>
                <TRACKING_METHOD>IP</TRACKING_METHOD>
                <NODE_ID>0</NODE_ID>
                <ADDRESS>
                    <![CDATA[XXX]]>
                </ADDRESS>
                <LAST_VISIT_DATETIME>2020-08-28T09:16:26Z</LAST_VISIT_DATETIME>
                <LAST_COMM_DATE>2020-08-28T08:48:38Z</LAST_COMM_DATE>
                <LAST_COMM_DURATION>982</LAST_COMM_DURATION>
                <ITEM_LIST>
                    <ITEM>
                        <ITEM_ID>13607</ITEM_ID>
                        <TYPE>Potential</TYPE>
                        <USAGE>3</USAGE>
                        <LOCATION>443</LOCATION>
                        <ABCD>aa</ABCD>>
                        <ABD>1</ABD>
                        <DESCRIPTION>
                            <![CDATA[AAAAAA]]>
                        </DESCRIPTION>
                        <STATUS>New</STATUS>
                        <FIRST_FOUND_DATETIME>2020-08-28T08:48:38Z</FIRST_FOUND_DATETIME>
                        <LAST_FOUND_DATETIME>2020-08-28T08:48:38Z</LAST_FOUND_DATETIME>
                        <TIMES_FOUND>1</TIMES_FOUND>
                        <LAST_TEST_DATETIME>2020-08-28T08:48:38Z</LAST_TEST_DATETIME>
                        <LAST_UPDATE_DATETIME>2020-08-28T09:16:26Z</LAST_UPDATE_DATETIME>
                        <IS_IGNORED>0</IS_IGNORED>
                        <IS_DISABLED>0</IS_DISABLED>
                        <LAST_PROCESSED_DATETIME>2020-08-28T09:16:26Z</LAST_PROCESSED_DATETIME>
                    </ITEM>
                    <ITEM>

I'd like to print the list of items in each house, however it only prints one of them.我想打印每个房子的物品清单,但它只打印其中一个。

That happens because ITEM element are repeated inside HOUSE/ITEM_LIST and to print list of items in each house it would be needed to iterate over ITEM elements in the same way as you iterate over HOUSE elements:发生这种情况是因为ITEM元素在HOUSE/ITEM_LIST中重复,并且要打印每个房子中的项目列表,需要以与迭代HOUSE元素相同的方式迭代ITEM元素:

for item in house.findall('./ITEM_LIST/ITEM'):
    ITEM_ID = item.find('ITEM_ID')
    if ITEM_ID is None:
        ITEM_ID = ''
    else:
        ITEM_ID = ITEM_ID.text
    print(ITEM_ID)

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM