根据 Date 列将 df1 中的列添加到 df2 中，如果缺少 df1 [Date] 条目，则填写 na

Question

I have dataframe df1 with 850 rows and column names ['Date', 'A'] .我有df1具有 850 行和列名['Date', 'A']数据框。 I also have df2 with 900 rows and column names ['Date', 'B', 'C', 'D'] .我也有df2 900 行和列名['Date', 'B', 'C', 'D'] 。

The difference in their number of rows is because df1 has some missing 'Date' entries.它们的行数不同是因为df1缺少一些“日期”条目。 But, all entries in df1['Date'] are in df2['Date'].但是，df1['Date'] 中的所有条目都在 df2['Date'] 中。

Question: I would like to merge df1['A'] to df2 on basis of same ['Date'] rows.问题：我想基于相同的['Date']行将df1['A']合并到df2 。 After merging, I would like the resultant df2['A'] to reflect a 'na' for all those rows whose ['Dates'] are missing in df1 .合并后，我希望生成的df2['A']为df1中缺少['Dates']的所有行反映一个 'na' 。

I tried df2=pd.merge(df2, df1, on="Date") but I get resultant df2 to have 850 rows which seems that the dates which don't match between df1 and df2 are being deleted.我尝试df2=pd.merge(df2, df1, on="Date")但我得到的结果df2有 850 行，这似乎是删除了 df1 和 df2 之间不匹配的日期。 Instead, I would want the post-merged resultant df2 to be 900 rows and the unmatched date rows should show 'na' in df2['A']`.相反，我希望合并后的结果df2为 900 行，并且不匹配的日期行应在 df2['A']` 中显示“na”。

How to achieve this?如何做到这一点？

Answer 1

Use left join instead of inner join (default behavior)使用left连接而不是inner连接（默认行为）

ie, IE，

new_df = pd.merge(df2, df1, on="Date", how='left')

To fill NA (as asked by OP in comments) with zero,用零填充NA （如 OP 在评论中要求的那样），

new_df.fillna(0, inplace=True)
# new_df['column'] = new_df['column'].astype(np.float64) # to convert column to float

根据 Date 列将 df1 中的列添加到 df2 中，如果缺少 df1 [Date] 条目，则填写 na

问题描述

1 个解决方案

解决方案1
1 已采纳 2022-05-19 06:10:11

根据 Date 列将 df1 中的列添加到 df2 中，如果缺少 df1 [Date] 条目，则填写 na

问题描述

1 个解决方案

解决方案1 1 已采纳 2022-05-19 06:10:11

解决方案1
1 已采纳 2022-05-19 06:10:11