如何用正则表达式替换 pandas dataframe 中的列值？

Question

我想用'ASUS'或'ACER'（大写）替换下面列中的值，即只要值中有单词（忽略大小写）'acer'，只需将其替换为'ACER' , 和单词 'asus *' 替换为 'ASUS'。 我使用下面来自 Pandas 文档的示例屏幕截图作为示例。 我应用了正则表达式 function，但它似乎不起作用 - output 没有任何反应。 我的代码：

dfx = pd.DataFrame({'Brands':['asus', 'ASUS ZEN', 'Acer','ACER Swift']})
dfx = dfx.replace([{'Brands': r'^asus.$'}, {'Brands': 'ASUS'}, {'Brands': r'^acer.$'}, {'Brands': 'ACER'}], regex=True)
dfx['Brands'].unique()

Jupyter笔记本中的Output：

数组（['华硕'，'华硕 ZEN'，'宏碁'，'ACER Swift']，dtype=object）

使用的 Pandas 文档示例：

Pandas 链接在这里

非常感谢任何有一点解释的帮助。

接受的解决方案：

dfx = pd.DataFrame({'Brands':['asus', 'ASUS ZEN', 'Acer','ACER Swift']})

dfx['Brands'] =  dfx['Brands'].str.lower().str.replace('.*asus.*', 'ASUS', regex=True).str.replace('.*acer.*', 'ACER', regex=True)
OR
dfx['Brands'] = dfx.Brands.apply(lambda x: re.sub(r".*(asus|acer).*", lambda m: m.group(1).upper(), x, flags=re.IGNORECASE))

dfx['Brands'].unique()

Output：

数组（['华硕'，'ACER']，dtype =对象）

Answer 1

dfx.Brands.apply(lambda x: re.sub(r".*(asus|acer).*", lambda m: m.group(1).upper(), x, flags=re.IGNORECASE))

Answer 2

请试试

dfx['Brands'] =  dfx['Brands'].str.lower().str.replace('.*asus.*', 'ASUS', regex=True).str.replace('.*acer.*', 'ACER', regex=True)

如何用正则表达式替换 pandas dataframe 中的列值？

问题描述

2 个解决方案

解决方案1
1 2021-04-19 07:58:49

解决方案2
0 已采纳 2021-04-19 07:51:24

如何用正则表达式替换 pandas dataframe 中的列值？

问题描述

2 个解决方案

解决方案1 1 2021-04-19 07:58:49

解决方案2 0 已采纳 2021-04-19 07:51:24

解决方案1
1 2021-04-19 07:58:49

解决方案2
0 已采纳 2021-04-19 07:51:24