繁体 English 中英

使用 BS4 抓取网站

[英]Scraping websites with BS4

原文 2020-06-10 09:26:11 6 1 python/ beautifulsoup

我有这个代码

import requests
from bs4 import BeautifulSoup


result = requests.get("http://www.cvbankas.lt/")
src = result.content
soup = BeautifulSoup(src, 'lxml')

urls = []
for article_tag in soup.find_all("article"):
        a_tag = article_tag.find('a')
        urls.append(a_tag.attrs['href'])
        div_tag = article_tag.find('span')
        urls.append(div_tag.attrs['class'])

print(urls)

谁能解释我如何获得红色标记的数据？

1 个解决方案

您可以使用 class label “salary_amount”获得跨度

salary_object = article_tag.find("span", class_= "salary_amount")

然后提取带有创建的object的.text属性的文本。

Python - 请求 -BS4 和抓取 JS 网站（ajax）

[英]Python -Requests -BS4 and scraping JS websites (ajax)

[英]scraping with BS4

使用 bs4 请求抓取

[英]scraping with request with bs4

使用 bs4 抓取数据

[英]Scraping data with bs4

[英]Web Scraping by BS4

[英]Web scraping with BS4

Python，刮板BS4

[英]Python, Scraping BS4

使用 Bs4 抓取 Python 3

[英]Python 3 scraping with Bs4

使用请求和 BS4 进行抓取

[英]Scraping with requests and BS4

使用 BS4 随时间抓取

[英]Scraping using BS4 with time

暂无

暂无

声明:本站的技术帖子网页，遵循CC BY-SA 4.0协议，如果您需要转载，请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

相关问题 Python - 请求 -BS4 和抓取 JS 网站（ajax）用 BS4 刮使用 bs4 请求抓取使用 bs4 抓取数据 Web BS4报废 Web用BS4刮 Python，刮板BS4 使用 Bs4 抓取 Python 3 使用请求和 BS4 进行抓取使用 BS4 随时间抓取

相关标签

粤ICP备18138465号 © 2020-2024 STACKOOM.COM