按索引合並兩個 pandas 數據幀並替換 Python 中的列值

Question

我有兩個 pandas 數據幀：

DF1

index = np.arange('2020-01-01 00:00', '2020-01-01 00:04', dtype='datetime64[m]')
df = np.random.randint(100,500, size=(4,4))
columns =['Open','High','Low','Close']
df = pd.DataFrame(df, index=index, columns = columns)
df.index.name = 'Time'

                     Open  High  Low  Close
Time                                       
2020-01-01 00:00:00   266   397  177    475
2020-01-01 00:01:00   362   135  456    235
2020-01-01 00:02:00   315   298  296    493
2020-01-01 00:03:00   324   411  198    101

DF2

index = np.arange('2020-01-01 00:02', '2020-01-01 00:05', dtype='datetime64[m]')
df2 = np.random.randint(100,500, size=(3,4))
columns =['Open','High','Low','Close']
df2 = pd.DataFrame(df2, index=index, columns = columns)
df2.index.name = 'Time'

                     Open  High  Low  Close
Time                                       
2020-01-01 00:02:00   430   394  131    490
2020-01-01 00:03:00   190   211  394    359
2020-01-01 00:04:00   192   291  143    350

我需要按索引（時間）合並兩個數據幀，並將 DF1 的列值替換為 DF2 的列值。

這是我預期的 output：

                     Open  High  Low  Close
Time                                       
2020-01-01 00:00:00   266   397  177    475 ->>>> Correspond to DF1
2020-01-01 00:01:00   362   135  456    235 ->>>> Correspond to DF1
2020-01-01 00:02:00   430   394  131    490 ->>>> Correspond to DF2
2020-01-01 00:03:00   190   211  394    359 ->>>> Correspond to DF2
2020-01-01 00:04:00   192   291  143    350 ->>>> Correspond to DF2

我嘗試了幾個函數，包括合並或 concat (concat([df1, df2], join="inner")) 但沒有成功。 任何幫助將不勝感激。 謝謝！

Answer 1

嘗試這個：

df2.combine_first(df)

                     Open  High  Low  Close
Time                                       
2020-01-01 00:00:00   266   397  177    475
2020-01-01 00:01:00   362   135  456    235
2020-01-01 00:02:00   430   394  131    490
2020-01-01 00:03:00   190   211  394    359
2020-01-01 00:04:00   192   291  143    350

因為您提到pd.concat ，所以您可以這樣做。

out = pd.concat([df, df2])
out = out[~out.index.duplicated(keep='last')]
print(out)

                     Open  High  Low  Close
Time                                       
2020-01-01 00:00:00   266   397  177    475
2020-01-01 00:01:00   362   135  456    235
2020-01-01 00:02:00   430   394  131    490
2020-01-01 00:03:00   190   211  394    359
2020-01-01 00:04:00   192   291  143    350

按索引合並兩個 pandas 數據幀並替換 Python 中的列值

問題描述

1 個解決方案

解決方案1
2 已采納 2022-08-30 14:46:09

按索引合並兩個 pandas 數據幀並替換 Python 中的列值

問題描述

1 個解決方案

解決方案1 2 已采納 2022-08-30 14:46:09

解決方案1
2 已采納 2022-08-30 14:46:09