如何根据其他一些 dataframe 替换一个 pandas dataframe 列值？

Question

I have two dataframes.我有两个数据框。 df1 and df2 . df1和df2 。 This is the content of df1这是df1的内容

  col1  col2  col3
0    1    12   100
1    2    34   200
2    3    56   300
3    4    78   400

This is the content of df2这是df2的内容

  col1  col2  col3
0    2  1984   500
1    3  4891   600

I want to have this final data frame:我想要这个最终的数据框：

  col1  col2  col3
0    1    12   100
1    2  1984   200
2    3  4891   300
3    4    78   400

Note that col1 is the primary key in df1 and df2 .请注意， col1是df1和df2中的主键。 I tried to do it via mapping values, but I could not make it work.我试图通过映射值来做到这一点，但我无法让它工作。

Here is an MCVE for checking those data frames easily:这是一个用于轻松检查这些数据帧的 MCVE：

import pandas as pd
d = {'col1': ['1', '2','3','4'], 'col2': [12, 34,56,78],'col3':[100,200,300,400]}
df1 = pd.DataFrame(data=d)
d = {'col1': ['2','3'], 'col2': [1984,4891],'col3':[500,600]}
df2 = pd.DataFrame(data=d)
print(df1)
print(df2)
d = {'col1': ['1', '2','3','4'], 'col2': [12, 1984,4891,78],'col3':[100,200,300,400]}
df_final = pd.DataFrame(data=d)
print(df_final)

Answer 1

You can map and fillna :您可以map和fillna ：

df1['col2'] = (df1['col1']
               .map(df2.set_index('col1')['col2'])
               .fillna(df1['col2'], downcast='infer')
              )

output: output：

  col1  col2  col3
0    1    12   100
1    2  1984   200
2    3  4891   300
3    4    78   400

Answer 2

If col1 is unique, combine_first is an option, too:如果col1是唯一的， combine_first也是一个选项：

>>> (df2.drop("col3", axis=1)
        .set_index("col1")
        .combine_first(df1.set_index("col1"))
        .reset_index()
    )
  col1  col2  col3
0    1    12   100
1    2  1984   200
2    3  4891   300
3    4    78   400

如何根据其他一些 dataframe 替换一个 pandas dataframe 列值？

问题描述

2 个解决方案

解决方案1
2 已采纳 2022-09-16 15:51:26

解决方案2
2 2022-09-16 15:58:16

如何根据其他一些 dataframe 替换一个 pandas dataframe 列值？

问题描述

2 个解决方案

解决方案1 2 已采纳 2022-09-16 15:51:26

解决方案2 2 2022-09-16 15:58:16

解决方案1
2 已采纳 2022-09-16 15:51:26

解决方案2
2 2022-09-16 15:58:16