pandas 在基于另一个 dataframe 列的列中设置值

Question

Imagine I have two pandas data frame as:想象一下，我有两个 pandas 数据框：

import pandas as pd

df1 = {'y1': [1, 2, 3, 4]}
df2 = {'y2': [3, 1, 2, 6]}

What I want is if a value in y2 is greater than the value in y1, I want to set df2['y2'] to the corresponding df['y1'] .我想要的是如果 y2 中的值大于 y1 中的值，我想将df2['y2']设置为相应的df['y1'] 。 When I try selecting the corresponding columns like:当我尝试选择相应的列时，例如：

df2[df2['y2'] > df1['y1']]

This is returns True rather than the index.这是返回True而不是索引。 I was hoping to do something like:我希望做类似的事情：

df2[df2['y2'] > df1['y1']]['y2'] = df1['y1']

Answer 1

If same index in both DataFrames:如果两个 DataFrame 中的索引相同：

Use DataFrame.loc :使用DataFrame.loc ：

df2.loc[df2['y2'] > df1['y1'], 'y2'] = df1['y1'] 
print (df2)
   y2
0   1
1   1
2   2
3   4

Or Series.where , Series.mask :或Series.where ， Series.mask ：

df2['y2'] = df1['y1'].where(df2['y2'].gt(df1['y1']), df2['y2'])
df2['y2'] = df2['y2'].mask(df2['y2'].gt(df1['y1']), df1['y1'])
print (df2)
   y2
0   1
1   1
2   2
3   4

Answer 2

Use numpy.where :使用numpy.where ：

In [233]: import numpy as np

In [234]: df1 = pd.DataFrame({'y1': [1, 2, 3, 4]})
In [236]: df2 = pd.DataFrame({'y2': [3, 1, 2, 6]})

In [242]: df2['y2'] = np.where(df2.y2.gt(df1.y1), df1.y1, df2.y2)

In [243]: df2
Out[243]: 
   y2
0   1
1   1
2   2
3   4

Answer 3

`np.minimum`

Maintain all of existing df2 but with updated column values in 'y2'维护所有现有的df2 ，但在'y2'中更新列值

df2.assign(y2=np.minimum(df1.y1, df2.y2))

   y2
0   1
1   1
2   2
3   4

Or just a new dataframe with one column或者只是一个带有一列的新 dataframe

pd.DataFrame({'y2': np.minimum(df1.y1, df2.y2)})

   y2
0   1
1   1
2   2
3   4

pandas 在基于另一个 dataframe 列的列中设置值

问题描述

3 个解决方案

解决方案1
2 2021-03-23 05:27:27

解决方案2
2 已采纳 2021-03-23 05:29:32

解决方案3
2 2021-03-23 05:32:29

`np.minimum`

pandas 在基于另一个 dataframe 列的列中设置值

问题描述

3 个解决方案

解决方案1 2 2021-03-23 05:27:27

解决方案2 2 已采纳 2021-03-23 05:29:32

解决方案3 2 2021-03-23 05:32:29

np.minimum

解决方案1
2 2021-03-23 05:27:27

解决方案2
2 已采纳 2021-03-23 05:29:32

解决方案3
2 2021-03-23 05:32:29

`np.minimum`