如何根据公共列将 dataframe 的列值替换为另一个 dataframe 的值？

Question

I have two dataframes, one that looks like this:我有两个数据框，一个看起来像这样：

hec_df: hec_df:

accident year事故年	factor因素	age年龄
2007 2007年	1.5 1.5	13 13
2008 2008年	1.6 1.6	11 11
2009 2009	1.7 1.7	15 15

and hec_ldfs:和 hec_ldfs：

accident year事故年	factor因素
2007 2007年	1.6 1.6
2008 2008年	1.64 1.64
2009 2009	1.7 1.7

My goal is to replace the factor value of df1 with the factor value of df2.我的目标是用 df2 的因子值替换 df1 的因子值。 My code for this is我的代码是

hec_df['factor'] = hec_df['factor'].map(hec_ldfs.set_index('accident year')['factor'])

But it returns NaN on the factor column.但它在因子列上返回 NaN。 Does anyone know why this is happening?有谁知道为什么会这样？

EDIT: I'm not sure why my first dataframe is formatted like that, does anyone know how to fix it?编辑：我不确定为什么我的第一个 dataframe 是这样格式化的，有人知道如何解决吗？

Answer 1

you're mapping factor to the accident_year, instead of hec_df.accident_year to the hec_df.accident year您将 factor 映射到 accident_year，而不是 hec_df.accident_year 到 hec_df.accident 年份

hec_df['factor'] = hec_df['accident year'].map(hec_ldfs.set_index('accident year')['factor']).fillna(hec_df['factor'])
hec_df

accident year   factor  age
0   2007    1.60    13
1   2008    1.64    11
2   2009    1.70    15

如何根据公共列将 dataframe 的列值替换为另一个 dataframe 的值？

问题描述

1 个解决方案

解决方案1
3 2022-11-30 02:06:07

如何根据公共列将 dataframe 的列值替换为另一个 dataframe 的值？

问题描述

1 个解决方案

解决方案1 3 2022-11-30 02:06:07

解决方案1
3 2022-11-30 02:06:07