merge two pandas data frame and skip common columns of right

Question

I am using pandas DataFrame as a lightweight dataset to maintain some status and need to dynamically/continuously merge new DataFrames into existing table. Say I have two datasets as below:

df1:

df2:

I want to merge df2 to df1 (on index), and for columns in common (in this case, it is 'b'), simply discard the common column of df2.

   a  b   c
0  0  1  11
1  2  3  13
2  4  5  15
3  6  7  17
4  8  9  19

My code was checking common part between df1 and df2 by using SET， so that I manually drop common part in df2. I wonder is there any much efficient way to do this?

Answer 1

First identify the columns in df2 not in df1

cols = df2.columns.difference(df1.columns)

Then pd.DataFrame.join

df1.join(df2[cols])

   a  b   c
0  0  1  11
1  2  3  13
2  4  5  15
3  6  7  17
4  8  9  19

Or pd.concat will also work

pd.concat([df1, df2[cols]], axis=1)

   a  b   c
0  0  1  11
1  2  3  13
2  4  5  15
3  6  7  17
4  8  9  19

Answer 2

Pandas merge function will also work wonders. You can do it as:

pd.merge(left=df1, right=df2, how='inner')

   a  b   c
0  0  1  11
1  2  3  13
2  4  5  15
3  6  7  17
4  8  9  19

by eliminating the 'on' attribute of merge function it will consider the columns which are in-common in both of the dataframes.

merge two pandas data frame and skip common columns of right

Question

2 answers

solution1
4 2017-11-15 05:49:39

solution2
4 2017-11-15 07:29:39

merge two pandas data frame and skip common columns of right

Question

2 answers

solution1 4 2017-11-15 05:49:39

solution2 4 2017-11-15 07:29:39

solution1
4 2017-11-15 05:49:39

solution2
4 2017-11-15 07:29:39