Extract number from alpha-numeric column pandas

Question

Input df

Code               Value
USH0001108421891  -9999    
USH0001108421892  -9999 X3 
USH0001108421893  -77EX3 
USH0001108421894   483EQ3 
USH0001108421895   325EX3 
USH0001108421896   297ES3

As can be seen from the example, the column Value has both strings and integers. But I want the only the first set of integers before the alphabets.

Expected df

Code               Value
USH0001108421891  -9999    
USH0001108421892  -9999 
USH0001108421893  -77
USH0001108421894   483
USH0001108421895   325
USH0001108421896   297

I tried this, but it returned an error.

df1['Value'] = df1['Value'].astype(int)
ValueError: invalid literal for int() with base 10: '-77EX3'

Answer 1

You can use .str.extract with regex pattern containing capturing group:

df['Value'] = df['Value'].str.extract(r'^(-?\d+)', expand=False).astype(int)

              Code   Value
0  USH0001108421891  -9999
1  USH0001108421892  -9999
2  USH0001108421893    -77
3  USH0001108421894    483
4  USH0001108421895    325
5  USH0001108421896    297

Extract number from alpha-numeric column pandas

Question

1 answers

solution1
2 ACCPTED 2021-01-08 18:15:39

Extract number from alpha-numeric column pandas

Question

1 answers

solution1 2 ACCPTED 2021-01-08 18:15:39

solution1
2 ACCPTED 2021-01-08 18:15:39