简体   繁体   中英

Pandas merge two dataframes with one to many relationship

I am trying to merge two pandas DataFrames with one of many relationship.

import pandas as pd

df1 = pd.DataFrame({'name': ['AA', 'BB', 'CC'],
                    'col1': [1, 2, 3],
                    'col2': [1, 2, 3] })

df2 = pd.DataFrame({'name': ['AA', 'AA', 'BB'],
                    'col1': [1, 2, 3],
                    'col2': [1, 2, 3] })

df_merged = pd.merge(
                     df1, 
                     df2, 
                     left_on = 'name',
                     right_on = 'name',
                     how = "inner"
                    )

使用左连接并删除重复项

df1.merge(df2, how='left', on='name').drop_duplicates(subset='name',keep='first')

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM