簡體   English   中英

用美麗的湯解析跨度

[英]Parsing span with beautiful soup

我正在嘗試通過某些網站來解析,以查找div標簽和class_內的“ span”。 如果跨度等於字符串,例如“ line”,則跨度屬於特定類,則它將返回網站的實際鏈接。

我收到的錯誤消息:

第28行,在湯中= BeautifulSoup(url_html,“ html.parser”)第245行,在init elif len(markup)<= 256中,並且(TypeError:類型為'Response'的對象沒有len()

import csv
from bs4 import BeautifulSoup
import requests

contents = []

def condition_check():
    for sp in soup.find("div",class_='-vDIg'):
        check = sp.span
        if check in ['Line','LINE ID',]:
            return link


filename = 'link_business_filter.csv'

with(open(filename,'rt')) as f:
    data = csv.reader(f)

    for row in data:
        links = row[0]
        contents.append(links)



for link in contents:
    url_html = requests.get(link)
    soup = BeautifulSoup(url_html,"html.parser")
    con_fltr = condition_check()
    print(con_fltr)

您正在將Request對象傳遞給Beautiful Soup,您需要像這樣傳遞html內容:

for link in contents:
    url_html = requests.get(link)
    soup = BeautifulSoup(url_html.content,"html.parser")
    con_fltr = condition_check()
    print(con_fltr)

url_html-> url_html.content

暫無
暫無

聲明:本站的技術帖子網頁,遵循CC BY-SA 4.0協議,如果您需要轉載,請注明本站網址或者原文地址。任何問題請咨詢:yoyou2525@163.com.

 
粵ICP備18138465號  © 2020-2024 STACKOOM.COM