如何使用python，beautifulsoup不間斷地打印html？

Question

我正在嘗試使用beautifulsoup這樣打印html：

load = urllib2.urlopen(url)
soup = BeautifulSoup(load, 'lxml')
characteristics = soup.find('table', { 'class' : 'characteristics-table'})
print characteristics

得到這個：

<table class="characteristics-table">
<tr class="characteristics alt">
<td class="name">
Zīmols
</td>
<td>
Emporio Armani</td>
</tr>
<tr class="characteristics">
<td class="name">
<b>Mehānisma tips</b>
</td>
<td>
<b>Mehāniskie automātiskie</b></td>
</tr>...

但是需要這樣的東西：

<table class="characteristics-table"><tr class="characteristics alt"><td class="name">Zīmols</td><td>...

怎么做？

Answer 1

如果只想刪除characteristics中的換行符，則使用str.replace來刪除它們，方法是用空字符串''替換換行符：

print str(characteristics).replace('\n', '').replace('\r\n', '')

第一個替換unix樣式的換行符，第二個應用於第一個結果，替換Windows樣式的換行符。

編輯： .replace必須應用於beautifulsoup的查找返回的obj的str() 。

Answer 2

''.join(characteristics.split('\n'))   #or \r\n on Windows

如何使用python，beautifulsoup不間斷地打印html？

問題描述

2 個解決方案

解決方案1
2 2017-10-28 02:46:16

解決方案2
1 已采納 2017-10-28 02:51:27

如何使用python，beautifulsoup不間斷地打印html？

問題描述

2 個解決方案

解決方案1 2 2017-10-28 02:46:16

解決方案2 1 已采納 2017-10-28 02:51:27

解決方案1
2 2017-10-28 02:46:16

解決方案2
1 已采納 2017-10-28 02:51:27