簡體   English   中英

如何使用python獲取這些值

[英]how to get these value using python

我有一些html資源喜歡:

<tr>
    <td class="upl">XXXX</td>
    <td class="upl">XXXX</td>
    <td class="upl">XXXX</td>
    <td class="up">XXXX</td>
    <td>9.09</td>
    <td class="upl">XXXX</td>
    <td class="dn">XXXX</td>
    <td>XXXX</td>
    <td>XXXX</td>
    <td>XXXX</td>
    <td class="up">XXXX</td>
    <td class="up">XXXX</td>
    <td class="up">XXXX</td>
    <td class="dn">XXXX</td>
    <td class="up">XXXX</td>
</tr>
<tr>

    <td class="upl">XXXX</td>
    <td class="upl">XXXX</td>
    <td class="upl">XXXX</td>
    <td class="up">XXXX</td>
    <td>XXXX</td>
    <td class="upl">XXXX</td>
    <td class="up">XXXX</td>
    <td>XXXX</td>
    <td>XXXX</td>
    <td>XXXX</td>
    <td class="up">XXXX</td>
    <td class="up">XXXX</td>
    <td class="up">XXXX</td>
    <td class="dn">XXXX</td>
    <td class="up">XXXX</td>
</tr>

如何使用BeautifulSoup 4獲取所有XXXX值? 我目前的代碼:

from urllib.request import urlopen
from bs4 import BeautifulSoup

html = urlopen("URL")
bsObj = BeautifulSoup(html, "html.parser")

nameList2 = bsObj.findAll("td")-->this only show all information

for name in nameList2:
    print(name.get_text())

BeautifulSoup.contents

BeautifulSoup有一個屬性內容(注意:沒有方法)可以使用get_text()的瞬間:

from urllib.request import urlopen
from bs4 import BeautifulSoup

html = urlopen("URL")
bsObj = BeautifulSoup(html)
nameList2 = bsObj.findAll("td")   #this only show all information
for name in nameList2:
    print(name.contents) #will be a list like [u:"XXXX"]

在我的測試中,列表的長度始終為1,因此您可以使用:

    print(name.contents[0]) #will be u:"XXXX"

擺脫u:調用u類的__str__方法:

    print(str(name.contents[0])) # --> "XXXX"

無論你做什么,希望它是有幫助的,
祝好運

暫無
暫無

聲明:本站的技術帖子網頁,遵循CC BY-SA 4.0協議,如果您需要轉載,請注明本站網址或者原文地址。任何問題請咨詢:yoyou2525@163.com.

 
粵ICP備18138465號  © 2020-2024 STACKOOM.COM