Pandas drop rows in one dataframe that share a common value with a rows in a column of another dataframe

Question

I have a data frame called x1:

 FID  g1    g2    g3
  0   19    20    13
  1   16    11    14
  2   15    20    11

and a data frame called x2:

 FID  g1   
  1   16

I want to alter x1 so that it does not include the row in x2:

 FID  g1    g2    g3
  0   19    20    13
  2   15    20    11

I've tried:

x1 = pd.concat([x1,x2]).drop_duplicates(keep=False)

but think this only works if the data frames have matching schemas. Can I just keep rows in x1 that don't share the FID values in x2?

Answer 1

You can use pd.Series.isin to create a boolean series of the values in the FID column of x1 that appear in the FID column of x2 .

Then simply use pd.DataFrame.loc with the operator ~ to invert the boolean series and select the rows of x1 whose values in the FID column don't appear in the FID column of x2 :

cond = x1.FID.isin(x2.FID)
x1.loc[~cond] 
# output:
   FID  g1  g2  g3
0    0  19  20  13
2    2  15  20  11

Answer 2

x1.drop(x2['FID'])

output

    FID  g1  g2  g3
 0    0  19  20  13
 2    2  15  20  11

Pandas drop rows in one dataframe that share a common value with a rows in a column of another dataframe

Question

2 answers

solution1
2 ACCPTED 2020-04-11 20:37:40

solution2
0 2020-04-11 21:56:50

output

Pandas drop rows in one dataframe that share a common value with a rows in a column of another dataframe

Question

2 answers

solution1 2 ACCPTED 2020-04-11 20:37:40

solution2 0 2020-04-11 21:56:50

output

solution1
2 ACCPTED 2020-04-11 20:37:40

solution2
0 2020-04-11 21:56:50