How to remove only numbers from a string in Pandas columns

Question

I'm an environmental geologist and I'm just learning Python/Pandas. I have a dataframe of analytical data in Pandas similar to the example below:

I only want to remove numbers from Total_dl leaving the detection limits (numbers with <). This would be the final dataframe I'm looking for:

Since the column is strings I'm not sure how to parse the column. Any help would be appreciated.

Thanks

Answer 1

The following should do the trick:

import numpy as np


mask = df.Total_dll < 1.
df.loc[mask, 'Total_dll'] = np.nan

If Total_dll is of type string you can try the following:

import numpy as np


df.str.startswith('<')
df.loc[df.Total_dll.str.startswith('<'), np.nan]

Answer 2

One way to do it. Not sure how good a solution it is:

df['Total_dl'] = df['Total_dl'].apply(lambda o: o if '<' in str(o) else np.nan)

Using a function that does the same instead:

>>> df
   SampleID Total_dl
0    A-1-0'      2.5
1  A-1-0.5'   <0.021
>>> df.dtypes
SampleID    object
Total_dl    object
dtype: object
>>> def foo(o):
...     if '<' in str(o):
...         return o
...     else:
...         return np.nan
...         
>>> df['Total_dl'] = df['Total_dl'].apply(foo)
>>> df
   SampleID Total_dl
0    A-1-0'      NaN
1  A-1-0.5'   <0.021
>>>

Answer 3

Say your data frame is called df , then this will do the trick

import numpy as np
nan_condition = df[~df["Total_dl"].str.contains(">")]
df.loc[nan_condition,"Total_dl"] = np.nan

Answer 4

你可以用这个


data = data.loc[data[column] > x]

How to remove only numbers from a string in Pandas columns

Question

4 answers

solution1
0 2020-02-29 23:19:57

solution2
0 ACCPTED 2020-02-29 23:27:47

solution3
0 2020-03-01 07:06:41

solution4
-1 2020-02-29 23:25:10

How to remove only numbers from a string in Pandas columns

Question

4 answers

solution1 0 2020-02-29 23:19:57

solution2 0 ACCPTED 2020-02-29 23:27:47

solution3 0 2020-03-01 07:06:41

solution4 -1 2020-02-29 23:25:10

solution1
0 2020-02-29 23:19:57

solution2
0 ACCPTED 2020-02-29 23:27:47

solution3
0 2020-03-01 07:06:41

solution4
-1 2020-02-29 23:25:10