I want to replace all strings that contain a specific substring. So for example if I have this dataframe:
import pandas as pd
df = pd.DataFrame({'name': ['Bob', 'Jane', 'Alice'],
'sport': ['tennis', 'football', 'basketball']})
You can use str.contains
to mask the rows that contain 'ball' and then overwrite with the new value:
In [71]:
df.loc[df['sport'].str.contains('ball'), 'sport'] = 'ball sport'
df
Out[71]:
name sport
0 Bob tennis
1 Jane ball sport
2 Alice ball sport
To make it case-insensitive pass `case=False:
df.loc[df['sport'].str.contains('ball', case=False), 'sport'] = 'ball sport'
You can use apply
with a lambda. The x
parameter of the lambda function will be each value in the 'sport' column:
df.sport = df.sport.apply(lambda x: 'ball sport' if 'ball' in x else x)
一个不同的str.contains
df['support'][df.name.str.contains('ball')] = 'ball support'
You can use a lambda function also:
data = {"number": [1, 2, 3, 4, 5], "function": ['IT', 'IT application',
'IT digital', 'other', 'Digital'] }
df = pd.DataFrame(data)
df.function = df.function.apply(lambda x: 'IT' if 'IT' in x else x)
The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.