[英]how to get these value using python
我有一些html資源喜歡:
<tr>
<td class="upl">XXXX</td>
<td class="upl">XXXX</td>
<td class="upl">XXXX</td>
<td class="up">XXXX</td>
<td>9.09</td>
<td class="upl">XXXX</td>
<td class="dn">XXXX</td>
<td>XXXX</td>
<td>XXXX</td>
<td>XXXX</td>
<td class="up">XXXX</td>
<td class="up">XXXX</td>
<td class="up">XXXX</td>
<td class="dn">XXXX</td>
<td class="up">XXXX</td>
</tr>
<tr>
<td class="upl">XXXX</td>
<td class="upl">XXXX</td>
<td class="upl">XXXX</td>
<td class="up">XXXX</td>
<td>XXXX</td>
<td class="upl">XXXX</td>
<td class="up">XXXX</td>
<td>XXXX</td>
<td>XXXX</td>
<td>XXXX</td>
<td class="up">XXXX</td>
<td class="up">XXXX</td>
<td class="up">XXXX</td>
<td class="dn">XXXX</td>
<td class="up">XXXX</td>
</tr>
如何使用BeautifulSoup 4獲取所有XXXX值? 我目前的代碼:
from urllib.request import urlopen
from bs4 import BeautifulSoup
html = urlopen("URL")
bsObj = BeautifulSoup(html, "html.parser")
nameList2 = bsObj.findAll("td")-->this only show all information
for name in nameList2:
print(name.get_text())
BeautifulSoup有一個屬性內容(注意:沒有方法)可以使用get_text()的瞬間:
from urllib.request import urlopen
from bs4 import BeautifulSoup
html = urlopen("URL")
bsObj = BeautifulSoup(html)
nameList2 = bsObj.findAll("td") #this only show all information
for name in nameList2:
print(name.contents) #will be a list like [u:"XXXX"]
在我的測試中,列表的長度始終為1,因此您可以使用:
print(name.contents[0]) #will be u:"XXXX"
擺脫u:調用u類的__str__
方法:
print(str(name.contents[0])) # --> "XXXX"
無論你做什么,希望它是有幫助的,
祝好運
聲明:本站的技術帖子網頁,遵循CC BY-SA 4.0協議,如果您需要轉載,請注明本站網址或者原文地址。任何問題請咨詢:yoyou2525@163.com.