繁体   English   中英

如何使用BeautifulSoup4获得锚标签的“标题”?

[英]How do I get the “title” of an anchor tag using BeautifulSoup4?

我不知道如何获得标题上的标题。 这是我的代码:

from flask import Flask
import requests
from bs4 import BeautifulSoup

laptops = 'http://webscraper.io/test-sites/e-commerce/allinone/computers/laptops'


def scrape():
    page = requests.get('http://webscraper.io/test-sites/e-commerce/allinone/computers/laptops')
    soup = BeautifulSoup(page.content, "lxml")
    links = soup("a", {"class":"title"})

    for link in links:
        print(link.prettify())


scrape()

结果示例:

<a class="title" href="/test-sites/e-commerce/allinone/product/251" title="Asus VivoBook X441NA-GA190">
 Asus VivoBook X4...
</a>

<a class="title" href="/test-sites/e-commerce/allinone/product/252" title="Prestigio SmartBook 133S Dark Grey">
 Prestigio SmartB...
</a>

<a class="title" href="/test-sites/e-commerce/allinone/product/253" title="Prestigio SmartBook 133S Gold">
 Prestigio SmartB...
</a>

我如何获得“头衔”?

可以通过订阅或元素上的.attrs字典访问诸如title类的属性:

for link in links:
    print(link['title'])

请参阅AttributesBeautifulSoup文档

对于给定的URL,将产生:

Asus VivoBook X441NA-GA190
Prestigio SmartBook 133S Dark Grey
Prestigio SmartBook 133S Gold
Aspire E1-510
Lenovo V110-15IAP
Lenovo V110-15IAP
Hewlett Packard 250 G6 Dark Ash Silver
# ... etc

暂无
暂无

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM