使用beautifulsoup从网站提取号码？

Question

以下python代码：

from bs4 import BeautifulSoup
div = '<div class="hm"><span class="xg1">查看:</span> 15660<span class="pipe">|</span><span class="xg1">回复:</span> 435</div>'
soup = BeautifulSoup(div, "lxml")
hm = soup.find("div", {"class": "hm"})
print(hm)

在这种情况下，我想要两个数字的输出：

15660
435

我想尝试使用beautifulsoup从网站上提取数字。 但是我不知道该怎么做？

Answer 1

使用正则表达式调用soup.find_all

>>> list(map(str.strip, soup.find_all(text=re.compile(r'\b\d+\b'))))

要么，

>>> [x.strip() for x in soup.find_all(text=re.compile(r'\b\d+\b'))]

['15660', '435']

如果您需要整数而不是字符串，请在列表int内调用int

>>> [int(x.strip()) for x in soup.find_all(text=re.compile(r'\b\d+\b'))]
[15660, 435]

使用beautifulsoup从网站提取号码？

问题描述

1 个解决方案

解决方案1
0 已采纳 2018-01-10 04:50:34

使用beautifulsoup从网站提取号码？

问题描述

1 个解决方案

解决方案1 0 已采纳 2018-01-10 04:50:34

解决方案1
0 已采纳 2018-01-10 04:50:34