根据另一个数据框中的值从DataFrame中选择行，并根据第二个DataFrame使用值更新其中一个列

Question

I have two Dataframes df and df1. 我有两个数据帧df和df1。

Main DataFrame is as follows: 主DataFrame如下：
DF: DF：

    start   end price
0   A   Z   1
1   B   Y   2
2   C   X   3
3   A   Z   4
4   D   W   5

Second DataFrame: 第二个DataFrame：
DF1: DF1：

start   end price
    0   A   Z   100
    1   B   Y   200

I want the main dataframe df to update the values in 'price' columns based on the start and end in df1. 我希望主数据帧df根据df1中的开头和结尾更新'price'列中的值。 it should update column value for all the rows having the same start and end as in df1. 它应该更新具有与df1相同的开始和结束的所有行的列值。 DF: DF：

start   end price
0   A   Z   100
1   B   Y   200
2   C   X   3
3   A   Z   100
4   D   W   5

(all AZ and BY in df should get updated). （df中的所有AZ和BY都应该更新）。 Is there anyway I can get this output ? 无论如何我能得到这个输出吗？ In reality the datframes have more columns but I want to update only one column(eg.'Price'). 实际上，数据帧有更多列，但我想只更新一列（例如''价格'）。

Answer 1

First, you can merge: 首先，您可以合并：

s = df1.merge(df2, left_on=['start', 'end'], right_on=['start', 'end'], how='left')

Then you can fillna and index your desired columns: 然后，您可以fillna并索引所需的列：

s.assign(price=s.price_y.fillna(s.price_x))[['start', 'end', 'price']]

  start end  price
0     A   Z  100.0
1     B   Y  200.0
2     C   X    3.0
3     A   Z  100.0
4     D   W    5.0

Answer 2

Using update 使用update

df=df.set_index(['start','end'])
df.update(df1.set_index(['start','end']))
df.reset_index()
Out[99]: 
  start end  price
0     A   Z  100.0
1     B   Y  200.0
2     C   X    3.0
3     A   Z  100.0
4     D   W    5.0

Answer 3

`merge`

df.drop('price', 1).merge(df1, 'left').fillna(df)

  start end  price
0     A   Z  100.0
1     B   Y  200.0
2     C   X    3.0
3     A   Z  100.0
4     D   W    5.0

I'm going to merge on ['start', 'end'] and that pesky price is going to get in my way. 我要在['start', 'end']上合并，那个讨厌的price会妨碍我。 So, I drop it. 所以，我放弃它。
I need to preserve df index because I have that repeat of 'A' and 'Z' . 我需要保留df索引，因为我重复了'A'和'Z' 。 So, I use a 'left' merge 所以，我使用'left' merge
Now my missing elements can be filled back in with df 现在我的遗失元素可以用df填充

根据另一个数据框中的值从DataFrame中选择行，并根据第二个DataFrame使用值更新其中一个列

问题描述

3 个解决方案

解决方案1
2 2018-09-21 02:02:39

解决方案2
2 2018-09-21 02:07:57

解决方案3
1 2018-09-21 05:23:48

`merge`

根据另一个数据框中的值从DataFrame中选择行，并根据第二个DataFrame使用值更新其中一个列

问题描述

3 个解决方案

解决方案1 2 2018-09-21 02:02:39

解决方案2 2 2018-09-21 02:07:57

解决方案3 1 2018-09-21 05:23:48

merge

解决方案1
2 2018-09-21 02:02:39

解决方案2
2 2018-09-21 02:07:57

解决方案3
1 2018-09-21 05:23:48

`merge`