在美丽的汤网络抓取中一无所获

Question

My intend我的意图

I want to scrap commits of user from github using beautiful soup with python.我想使用带有 python 的漂亮汤从github 中删除用户的提交。

My issue我的问题

Getting none as result of my script.由于我的脚本none得到none结果。

My code我的代码

from bs4 import BeautifulSoup
import requests

html = requests.get('https://github.com/pnp/cli-microsoft365').text
soup = BeautifulSoup(html, 'html.parser')
commits = soup.find('strong', class_='repo-content-pjax-container > div > div.gutter-condensed.gutter-lg.flex-column.flex-md-row.d-flex > div.flex-shrink-0.col-12.col-md-9.mb-4.mb-md-0 > div.Box.mb-3 > div.Box-header.position-relative > div > div:nth-child(4) > ul > li > a > span > strong')
print(commits)

Answer 1

What happens?发生什么了？

Your using a "wild mix" in your find() and this will not lead to the element you are expected to find, thats why you get a None您在find()使用“狂野组合”，这不会导致您期望找到的元素，这就是为什么您会得到None

How to fix?怎么修？

Use the css selector to chain the parts you are looking for, in this case it will pick the <svg> in front of commits and its next <span> element that contains the <strong> :使用 css 选择器链接您要查找的部分，在这种情况下，它将选择提交前面的<svg>及其包含<strong>下一个<span>元素：

soup.select_one('svg.octicon.octicon-history + span strong').text

Output (in moment of my request)输出（在我请求的那一刻）

1,664

在美丽的汤网络抓取中一无所获

问题描述

1 个解决方案

解决方案1
0 已采纳 2021-10-16 07:07:02

What happens?发生什么了？

How to fix?怎么修？

Output (in moment of my request)输出（在我请求的那一刻）

在美丽的汤网络抓取中一无所获

问题描述

1 个解决方案

解决方案1 0 已采纳 2021-10-16 07:07:02

What happens?发生什么了？

How to fix?怎么修？

Output (in moment of my request)输出（在我请求的那一刻）

解决方案1
0 已采纳 2021-10-16 07:07:02