使用Beautiful Soup進行XML解析的問題

Question

當嘗試用Beautiful Soup替換XML中的某些元素時，我發現必須使用soup.find_all().string.replace_with()替換所需的元素。 但是，我遇到了一個問題，即soup.find_all()方法僅返回類型為None元素。

因此，我嘗試將問題分解為盡可能基本的XML：

from bs4 import BeautifulSoup as BS

xml = """
<xml>
    <test tag="0"/>
    <test tag="1"/>
</xml>"""

soup = BS(xml, 'xml')
for elem in soup.find_all("test"):
    print('Element {} has type {}.'.format(elem, elem.type))

這給出了完全相同的東西：

Element <test tag="0"/> has type None.
Element <test tag="1"/> has type None.

如果有人指出問題出在哪里，我會很高興。

提前致謝

Answer 1

好吧，我不確定您要查找的輸出內容是什么，但是您可以通過以下方式替換標記屬性：

from bs4 import BeautifulSoup as BS

xml = """
<xml>
    <test tag="0"/>
    <test tag="1"/>
</xml>"""

replace_list = ['0']
replacement = '2'

soup = BS(xml, 'xml')
for elem in soup.find_all("test"):
    if elem['tag'] in replace_list:
        elem['tag'] = replacement
    #print('Element {} has type {}.'.format(elem, elem.name))

xml = str(soup)

print (xml)

輸出：

<?xml version="1.0" encoding="utf-8"?>
<xml>
<test tag="2"/>
<test tag="1"/>
</xml>

使用Beautiful Soup進行XML解析的問題

問題描述

1 個解決方案

解決方案1
0 已采納 2019-02-28 14:33:28

使用Beautiful Soup進行XML解析的問題

問題描述

1 個解決方案

解決方案1 0 已采納 2019-02-28 14:33:28

解決方案1
0 已采納 2019-02-28 14:33:28