簡體   English   中英

通過 cols 合並/連接兩個 dataframe

[英]Merge/concat two dataframe by cols

我有兩個數據框:

import pandas as pd
from numpy import nan
df1 = pd.DataFrame({'key':[1,2,3,4],
                    'only_at_df1':['a','b','c','d'],
                    'col2':['e','f','g','h'],})

df2 = pd.DataFrame({'key':[1,9],
                    'only_at_df2':[nan,'x'],
                    'col2':['e','z'],})

如何獲得這個:

df3 = pd.DataFrame({'key':[1,2,3,4,9],
                    'only_at_df1':['a','b','c','d',nan],
                    'only_at_df2':[nan,nan,nan,nan,'x'],
                    'col2':['e','f','g','h','z'],})

任何幫助表示贊賞。

最好的方法可能是在臨時將“key”設置為索引后使用combine_first

df1.set_index('key').combine_first(df2.set_index('key')).reset_index()

output:

   key col2 only_at_df1 only_at_df2
0    1    e           a         NaN
1    2    f           b         NaN
2    3    g           c         NaN
3    4    h           d         NaN
4    9    z         NaN           x

這似乎是mergehow="outer"的直接使用:

df1.merge(df2, how="outer")

Output:

   key only_at_df1 col2 only_at_df2
0    1           a    e         NaN
1    2           b    f         NaN
2    3           c    g         NaN
3    4           d    h         NaN
4    9         NaN    z           x

暫無
暫無

聲明:本站的技術帖子網頁,遵循CC BY-SA 4.0協議,如果您需要轉載,請注明本站網址或者原文地址。任何問題請咨詢:yoyou2525@163.com.

 
粵ICP備18138465號  © 2020-2024 STACKOOM.COM