使用Python解析Appengine中的xml的最佳方法

Question

我正在連接到isbndb.com獲取圖書信息，他們的回復如下：

<?xml version="1.0" encoding="UTF-8"?>
<ISBNdb server_time="2005-02-25T23:03:41">
 <BookList total_results="1" page_size="10" page_number="1" shown_results="1">
  <BookData book_id="somebook" isbn="0123456789">
   <Title>Interesting Book</Title>
   <TitleLong>Interesting Book: Read it or else..</TitleLong>
   <AuthorsText>John Doe</AuthorsText>
   <PublisherText>Acme Publishing</PublisherText>
  </BookData>
 </BookList>
</ISBNdb>

使用appengine（Python）將這些數據轉換為對象的最佳方法是什么？

我需要isbn數字（BookData中的標簽），但我還需要BookData所有子節點的內容（而不是標簽）。

Answer 1

使用etree :)

>>> xml = """<?xml version="1.0" encoding="UTF-8"?>
... <ISBNdb server_time="2005-02-25T23:03:41">
...  <BookList total_results="1" page_size="10" page_number="1" shown_results="1">
...   <BookData book_id="somebook" isbn="0123456789">
...    <Title>Interesting Book</Title>
...    <TitleLong>Interesting Book: Read it or else..</TitleLong>
...    <AuthorsText>John Doe</AuthorsText>
...    <PublisherText>Acme Publishing</PublisherText>
...   </BookData>
...  </BookList>
... </ISBNdb>"""

from xml.etree import ElementTree as etree
tree = etree.fromstring(xml)

>>> for book in tree.iterfind('BookList/BookData'):
...     print 'isbn:', book.attrib['isbn']
...     for child in book.getchildren():
...             print '%s :' % child.tag, child.text
... 
isbn: 0123456789
Title : Interesting Book
TitleLong : Interesting Book: Read it or else..
AuthorsText : John Doe
PublisherText : Acme Publishing
>>> 

voila;)

Answer 2

有一個很棒的Python模塊叫做BeautifulSoup。 使用BeautifulStoneSoup類進行XML解析。

更多信息： http ： //www.crummy.com/software/BeautifulSoup/documentation.html

使用Python解析Appengine中的xml的最佳方法

問題描述

2 個解決方案

解決方案1
7 已采納 2011-01-07 18:46:56

解決方案2
0 2011-01-07 18:41:15

使用Python解析Appengine中的xml的最佳方法

問題描述

2 個解決方案

解決方案1 7 已采納 2011-01-07 18:46:56

解決方案2 0 2011-01-07 18:41:15

解決方案1
7 已采納 2011-01-07 18:46:56

解決方案2
0 2011-01-07 18:41:15