如果数据框中的另一列使用pandas匹配某个值，则从数据框中的列中减去值

Question

说我有两个矩阵原始和参考

import pandas as pa
print "Original Data Frame"
# Create a dataframe
oldcols = {'col1':['a','a','b','b'], 'col2':['c','d','c','d'], 'col3':[1,2,3,4]}
a = pa.DataFrame(oldcols)
print "Original Table:"
print a

print "Reference Table:"
b = pa.DataFrame({'col1':['x','x'], 'col2':['c','d'], 'col3':[10,20]})
print b

现在我想从原始表（a）的第三列（col3）中减去两个表的第二列匹配的行中引用表（c）中的值。 因此，表2的第一行应该将值10添加到第三列，因为列为col2的表b的行为'c'，col3中的值为10。 说得通？ 这是一些代码：

col3 = []
for ix, row in a.iterrows():
    col3 += [row[2] + b[b['col2'] == row[1]]['col3']]

a['col3'] = col3
print "Output Table:"
print a

并希望使它看起来像这样：

Output Table:
  col1 col2  col3
0    a    c   11
1    a    d   22
2    b    c   13
3    b    d   24

问题是col3采用Name：和数组中的dtype

>>print col3
[0    11
Name: col3, dtype: int64, 1    22
Name: col3, dtype: int64, 0    13
Name: col3, dtype: int64, 1    24
Name: col3, dtype: int64]

你能帮忙吗？

Answer 1

这应该工作：

a['col3'] + a['col2'].map(b.set_index('col2')['col3'])
Out[94]: 
0    11
1    22
2    13
3    24
dtype: int64

或这个：

a.merge(b, on='col2', how='left')[['col3_x', 'col3_y']].sum(axis=1)
Out[110]: 
0    11
1    22
2    13
3    24
dtype: int64

您可以根据要求通过以下方式将其存储在原始文件中：

a['col3'] = a.merge(b, on='col2', how='left')[['col3_x', 'col3_y']].sum(axis=1)

如果数据框中的另一列使用pandas匹配某个值，则从数据框中的列中减去值

问题描述

1 个解决方案

解决方案1
3 已采纳 2016-07-19 13:24:56

如果数据框中的另一列使用pandas匹配某个值，则从数据框中的列中减去值

问题描述

1 个解决方案

解决方案1 3 已采纳 2016-07-19 13:24:56

解决方案1
3 已采纳 2016-07-19 13:24:56