[英]Pandas join 2 dataframes
我有2个数据框:
>>> result
id order_nr
0 3131334 334756912
0 3131312 386517432
0 3131309 356299432
0 3131205 397514312
...
>>> OMSdf
order_nr status refund_amount
0 377766482 already_canceled 0
1 348722582 already_canceled 0
2 395287472 on_hold 0
3 345883652 already_canceled 0
...
这两个数据帧在“ order_nr”字段中具有一些通用值:
>>> result[result['order_nr'] == '377766482']
id order_nr
0 3129153 377766482
>>> OMSdf[OMSdf['order_nr'] == '377766482']
order_nr status refund_amount
0 377766482 already_canceled 0
但我尝试将他们加入一起:
resultfinal = result.join(OMSdf, on='order_nr', how='inner', lsuffix='', rsuffix='_oms', sort=False)
结果为空:
>>> resultfinal
Empty DataFrame
Columns: [id, order_nr, order_nr_oms, status, refund_amount]
Index: []
使用以下代码
df_merged = pd.merge(result, OMSdf, how='inner')
在您的情况下,可以在连接之前使order_nr
作为两个数据order_nr
的索引,如下所示
result.index = result['order_nr']
OMSdf.index = OMSdf['order_nr']
声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.