熊猫列索引字符串

Question

So I want to take only the first three characters of a pandas column and match them. 因此，我只想获取pandas列的前三个字符并将其匹配。 This is what I have come up with but the implementation is incorrect: 这是我想出的，但是实现不正确：

df.loc[df[0:2] == 'x, y] = 'x'

Answer 1

You are close, need str and define column for replacement if df is DataFrame , also for x, y there is 4 characters with whitespace: 您很接近，如果df是DataFrame ，则需要str并定义要替换的列，对于x, y还有4带空格的字符：

df.loc[df['col'].str[:4] == 'x, y', 'col'] = 'x'

#another solution 
#df.loc[df['col'].str.startswith('x, y'), 'col'] = 'x'

If working with Series : 如果使用Series ：

s[s.str[:4] == 'x, y'] = 'x'

Sample : 样品：

df = pd.DataFrame({'col':['x, y temp', 'sx, y', 'x, y', 's']})
print (df)
         col
0  x, y temp
1      sx, y
2       x, y
3          s

#if want replace substring
df['col1'] = df['col'].str.replace('^x\, y', 'x')

#if want set new value if condition
df.loc[df['col'].str[:4] == 'x, y', 'col'] = 'x'
print (df)
     col    col1
0      x  x temp <-col1 replace only substring
1  sx, y   sx, y
2      x       x
3      s       s

熊猫列索引字符串

问题描述

1 个解决方案

解决方案1
2 已采纳 2018-08-02 15:12:28

熊猫列索引字符串

问题描述

1 个解决方案

解决方案1 2 已采纳 2018-08-02 15:12:28

解决方案1
2 已采纳 2018-08-02 15:12:28