如何按索引组合两个数据帧（熊猫）

Question

I have two dataframes, with the same date field, but different other fields.我有两个数据框，具有相同的date字段，但其他字段不同。 I need to add a column pneumonia_ARVI from dataframe pneumonia_ARVI to dataframe Result_data .我需要从数据Result_data pneumonia_ARVI Result_data到数据Result_data添加一列pneumonia_ARVI Result_data 。

They initially differ in the number of dates, in Result_data dataframe there are significantly more dates than in pneumonia_ARVI它们最初的日期数量不同，在Result_data数据Result_data ，日期明显多于pneumonia_ARVI Result_data

I need a concatenation with a date match, but if the records in the dataframe pneumonia_ARVI than in the dataframe Result_data , then the preference would have the dates specified in the dataset Result_data .我需要一个日期匹配的连接，但是如果数据帧pneumonia_ARVI Result_data的记录比数据帧Result_data的记录多，那么首选项将具有数据集Result_data指定的日期。 And the data that is missing in the dataset pneumonia_ARVI replaced with empty values.并将数据集pneumonia_ARVI缺失的数据替换为空值。

I have tried doing我试过做

Result_data = Result_data.set_index('date')
pneumonia_ARVI = pneumonia_ARVI.set_index('date')
End = pd.merge(Result_data, pneumonia_ARVI, left_index=True, right_index=True)

But this led to the fact that the data was adjusted to each other, and the field infected_city do not leave all their original values by date.但这导致数据相互调整，并且字段infected_city并没有按日期保留所有原始值。

How to combine this data correctly so that there are no problems with reducing the total number of dates?如何正确组合这些数据，以便减少日期总数没有问题？

Answer 1

Use join :使用join ：

#convert to datetime if needed
Result_data["date"] = pd.to_datetime(Result_data["date"])
pneumonia_ARVI["date"] = pd.to_datetime(pneumonia_ARVI["date"])

#set index as you have done
Result_data = Result_data.set_index('date')
pneumonia_ARVI = pneumonia_ARVI.set_index('date')

#perform a left join
End = Result_data.join(pneumonia_ARVI, how="left")

如何按索引组合两个数据帧（熊猫）

问题描述

1 个解决方案

解决方案1
1 2021-11-04 17:20:18

如何按索引组合两个数据帧（熊猫）

问题描述

1 个解决方案

解决方案1 1 2021-11-04 17:20:18

解决方案1
1 2021-11-04 17:20:18