無法訪問XML中的子元素

Question

我試圖將XML格式的字符串解析為一些普通的python對象，並且嘗試使用find和findall方法訪問某些子元素，但它不起作用。

這是我要解析的XML數據：

<?xml version="1.0" ?>
<ItemSearchResponse
    xmlns="http://webservices.amazon.com/AWSECommerceService/2011-08-01">

    <Items>
        <Request>
            <IsValid>True</IsValid>
            <ItemSearchRequest>
                <Keywords>iphone</Keywords>
                <ResponseGroup>ItemAttributes</ResponseGroup>
                <SearchIndex>All</SearchIndex>
            </ItemSearchRequest>
        </Request>
        <TotalResults>40721440</TotalResults>
        <TotalPages>4072144</TotalPages>

        <Item>
            <ASIN>B00YV50QU4</ASIN>
            <ParentASIN>B018GTHAKO</ParentASIN>
            <DetailPageURL>http://www.amazon.com/Apple-iPhone-MD439LL-Smartphone-Refurbished/dp/B00YV50QU4%3Fpsc%3D1%26SubscriptionId%3DAKIAIEEA4BKMTHTI2T7A%26tag%3Dshopit021-20%26linkCode%3Dxm2%26camp%3D2025%26creative%3D165953%26creativeASIN%3DB00YV50QU4</DetailPageURL>
            <ItemLinks>

            </ItemLinks>
            <ItemAttributes>
            </ItemAttributes>
        </Item>
        <Item>
            <ASIN>B00VHSXBUA</ASIN>
            <ParentASIN>B0152TROY8</ParentASIN>
            <ItemAttributes>
            </ItemAttributes>
        </Item>
    </Items>
</ItemSearchResponse>

為了使此樣本更短，我刪除了一些數據。

這是我的代碼。

data  = et.fromstring(response)
            items = data[0][3]
            print items.tag
            items = data[0].findall('item')
            print len(items.findall('.//item'))

訪問子節點（'item'）的第一種方法是使用列表索引符號，並且運行良好。 但是使用find all方法無法正常工作，並且len()始終返回0。

我嘗試使用XPath和其他方式，但是使用索引是使其工作的唯一方法。

為什么像find和findall這樣的方法不起作用？

Answer 1

為什么像find和findall這樣的方法不起作用？

因為沒有名為Item元素。 您的文檔定義的默認XML命名空間http://webservices.amazon.com/AWSECommerceService/2011-08-01 ，這意味着，看起來像一個元素<Item>您的文檔中實際上是包含在該命名空間，距離不同在沒有默認XML名稱空間（或具有其他XML名稱空間）的文檔中看起來像<Item>的元素。

您想要類似的東西：

>>> ns = 'http://webservices.amazon.com/AWSECommerceService/2011-08-01'
>>> items = data[0].findall('{%s}Item' % ns)
>>> items
[<Element {http://webservices.amazon.com/AWSECommerceService/2011-08-01}Item at 0x7f1cbaaba8c0>, <Element {http://webservices.amazon.com/AWSECommerceService/2011-08-01}Item at 0x7f1cbaaba680>]

或者，使用XPath：

>>> items = data[0].xpath('n:Item', namespaces={'n': ns})
>>> items
[<Element {http://webservices.amazon.com/AWSECommerceService/2011-08-01}Item at 0x7f1cbaaba8c0>, <Element {http://webservices.amazon.com/AWSECommerceService/2011-08-01}Item at 0x7f1cbaaba680>]

無法訪問XML中的子元素

問題描述

1 個解決方案

解決方案1
1 已采納 2016-02-03 19:03:30

無法訪問XML中的子元素

問題描述

1 個解決方案

解決方案1 1 已采納 2016-02-03 19:03:30

解決方案1
1 已采納 2016-02-03 19:03:30