为什么替换子字符串在 Pandas 数据框中不起作用？

Question

I try to replace everywhere the symbols " - in the start line and end line:我尝试在起始行和结束行中替换所有符号" - ：

dtnew.applymap(lambda x: x.replace('^-', ''))
dtnew.applymap(lambda x: x.replace('^"', ''))

But the output dataframe has these symbols但是输出数据框有这些符号

Answer 1

well, if performance is NOT an issue you can iterate over columns and rows and use a simple replace (see below).好吧，如果性能不是问题，您可以遍历列和行并使用简单的替换（见下文）。 Again, I would only use this if the dataframe is not enormous and you have no concern for performance.同样，如果数据框不是很大并且您不关心性能，我只会使用它。

for column in df.columns:
    for i in df.index:    
        df[column][i] = df[column][i].replace('-','').replace('"','')

Answer 2

Assuming this example and that you only want to replace the leading character(s):假设此示例并且您只想替换前导字符：

df = pd.DataFrame([['- abc', 'def -'], ['" ghi-', '--jkl']])

        0      1
0   - abc  def -
1  " ghi-  --jkl

Use str.lstrip .使用str.lstrip 。

df2 = df.apply(lambda c: c.str.lstrip('- "'))

output:输出：

      0      1
0   abc  def -
1  ghi-    jkl

# as list: [['abc', 'def -'], ['ghi-', 'jkl']]

For only the first character, use str.replace :仅对于第一个字符，使用str.replace ：

df2 = df.apply(lambda c: c.str.replace('^[- "]', '', regex=True))

output:输出：

       0      1
0    abc  def -
1   ghi-   -jkl

# as list: [[' abc', 'def -'], [' ghi-', '-jkl']]

generalization:概括：

to strip both start and end, use str.strip剥离开始和结束，使用str.strip
to remove all characters (anywhere): df.apply(lambda c: c.str.replace('[- "]', '', regex=True))删除所有字符（任何地方）： df.apply(lambda c: c.str.replace('[- "]', '', regex=True))
to remove first or last matching character: df.apply(lambda c: c.str.replace('(^[- "]|[- "]$)', '', regex=True))删除第一个或最后一个匹配字符： df.apply(lambda c: c.str.replace('(^[- "]|[- "]$)', '', regex=True))

为什么替换子字符串在 Pandas 数据框中不起作用？

问题描述

2 个解决方案

解决方案1
2 2022-05-26 19:38:06

解决方案2
1 2022-05-27 07:43:38

generalization:概括：

为什么替换子字符串在 Pandas 数据框中不起作用？

问题描述

2 个解决方案

解决方案1 2 2022-05-26 19:38:06

解决方案2 1 2022-05-27 07:43:38

generalization:概括：

解决方案1
2 2022-05-26 19:38:06

解决方案2
1 2022-05-27 07:43:38