如果 Pandas 中字符串的开头和结尾处可用，如何有效地删除字符？

Question

The idea is to remove full stop, commas, quotation if it is available at the beginning and last string in Pandas.如果在 Pandas 的开头和最后一个字符串中可用，则删除句号、逗号、引号。

Given a df as below给定一个df如下

data = {'Name': ['"Tom hola.', '"nick"', 'krish here .','oh my *']}

The expected output is预期的 output 是

Tom hola
nick
krish here
oh my

I tried the following code, but it did not work as intended我尝试了以下代码，但它没有按预期工作

import pandas as pd
df = pd.DataFrame(data)
df['Name'] = df['Name'].str[-1:].replace({"\. ": "Na"},regex=True)

May I know how this objective can be achieved?我可以知道如何实现这个目标吗？

Also, can the approach extended for it to be applied across different columns?此外，该方法是否可以扩展以应用于不同的列？

Answer 1

You can use pd.Series.str.replace if you want replace only colum else use df.replace .如果您只想替换列，则可以使用pd.Series.str.replace ，否则使用df.replace 。

# Using `pd.Series.str.replace`
df['Name'] = df['Name'].str.replace(r'\.$','')
df          Name
0     Tom hola
1   secondx //
2         nick
3  krish here

# Using `df.replace`
df.replace(r'\.$', '', regex=True)
          Name
0     Tom hola
1   secondx //
2         nick
3  krish here

About regex pattern used in the answer click here regex101关于答案中使用的正则表达式模式单击此处regex101

EDIT:编辑：

You can use pd.Series.str.strip to strip " , . and *您可以使用pd.Series.str.strip剥离" 、 .和*

df['Name'].str.strip(r'\"\.\*')

0       Tom hola
1           nick
2    krish here
3         oh my
Name: Name, dtype: object

# OR
df.Name.str.replace(r'^\W+|(.*?)\W+$',r'\1') # Replaces only values in `Name`
# df.replace(r'^\W+|(.*?)\W+$',r'\1',regex=True) Replaces for whole df

More about regex pattern used in second case here更多关于在第二种情况下使用的正则表达式模式here

Answer 2

use (\W)*$ if you want to match all specials characters at the end of the string如果要匹配字符串末尾的所有特殊字符，请使用(\W)*$

df = pd.DataFrame({'Name': ['Tom hola.', 'secondx //', 'nick', 'krish here .']})
df['Name'] = df['Name'].replace({r'(\W)*$': ""}, regex=True)

Output: Output：

         Name
0     Tom hola
1    secondx 
2        nick
3  krish here

You can use https://regex101.com to test and better understand what your regex is doing您可以使用https://regex101.com来测试并更好地了解您的正则表达式在做什么

如果 Pandas 中字符串的开头和结尾处可用，如何有效地删除字符？

问题描述

2 个解决方案

解决方案1
2 已采纳 2020-07-18 07:59:55

解决方案2
1 2020-07-18 08:22:14

如果 Pandas 中字符串的开头和结尾处可用，如何有效地删除字符？

问题描述

2 个解决方案

解决方案1 2 已采纳 2020-07-18 07:59:55

解决方案2 1 2020-07-18 08:22:14

解决方案1
2 已采纳 2020-07-18 07:59:55

解决方案2
1 2020-07-18 08:22:14