python字符串以更健壮的方式解析数字

Question

Got a string after scraping a website.抓取网站后得到一个字符串。

 '<p class="NewsItemContent" style="font-size: 18px;">;As of March 18, 1999, 
6 p.m. Pacific Daylight Time, there are a total of 70;events and 16;planned  
in this area. This total does not include adjacent cities.</p>'

How could I parse out 70, 16. Just want a more robust way.我怎么能解析出 70、16。只是想要一个更健壮的方式。 Wording might change a little bit, but always a total of {};events and {};planned.措辞可能会稍有变化，但始终是 {};events 和 {};planned。 Thanks.谢谢。

Answer 1

Not a very clean solution but here we go:不是一个非常干净的解决方案，但我们开始：

import re

s = ('<p class="NewsItemContent" style="font-size: 18px;">;As of March 18, 1999, '
     '6 p.m. Pacific Daylight Time, there are a total of 70;events and 16;planned  '
     'in this area. This total does not include adjacent cities.</p>')

s = s.split('a total of ')[1]  # split by 'a total of' to get the second part

print(re.findall('\d+', s)[:2])  # finding the first two digits

['70', '16']

python字符串以更健壮的方式解析数字

问题描述

1 个解决方案

解决方案1
1 2020-03-19 20:43:02

python字符串以更健壮的方式解析数字

问题描述

1 个解决方案

解决方案1 1 2020-03-19 20:43:02

解决方案1
1 2020-03-19 20:43:02