Split digit using regex python

Question

I am trying to webscrape data from https://www.mygov.in/covid-19 , using Selenium , but when I extract the digits, there raises a new problem. 图片预览 . The number indicate current value and value of how much it changed. eg: 3,81,74,366⬆54,229.

When I scrape I get the text as 3,81,74,36654,229. So how can I get the current value only, using Selenium Python ?

eg:
3,81,74,36654,229 to 3,81,74,366
10,79,894198 to 10,79,894
22,40,7200 to 22,40,720

Answer 1

Here's an extract of an HTML fragment from that page:

<p class="mid-wrap">8,43,56,092
  <span class="data-up">39,477</span>
</p>

If you get the text for the p element, the return value will be merged with the span content.

Consider doing this:

for p in soup.select('p.mid-wrap'):
    span = p.find('span')
    if span:
        spantext = span.getText()
        print(spantext)
        span.extract()
    print(p.getText())

Output:

39,477
8,43,56,092

Answer 2

Assuming all numbers are bigger than one thousand and current value is the first thing in the string something like this should work

^.*?,\d{3}

Split digit using regex python

Question

2 answers

solution1
0 2022-02-21 07:31:21

solution2
0 2022-02-21 07:46:26

Split digit using regex python

Question

2 answers

solution1 0 2022-02-21 07:31:21

solution2 0 2022-02-21 07:46:26

solution1
0 2022-02-21 07:31:21

solution2
0 2022-02-21 07:46:26