Check if one column value is in another column and create column to indicate in Pandas

Question

I want to check if A in Table1 is found in A1 in Table2 and create an indicator column in Table1 to show this result.

Table1 :

A           B       C       G
Harry       pig     NY      Year1
Kasey       cat     AL      Year2
Ron         b       AK      Year1
Amy         axe     KY      Year4
...

Table2 :

A1          Value       
John        X     
Mark        Y   
Harry       Z       
Ron         Z  
...

Expected output:

A           B       C       G        exists
Harry       pig     NY      Year1    1
Kasey       cat     AL      Year2    0
Ron         b       AK      Year1    1
Amy         axe     KY      Year4    0
...

Answer 1

Here is a solution you can try out, using np.where

import numpy as np

df1['exists'] = np.where(df1['A'].isin(df2['A1'].unique()), 1, 0)

       A    B   C      G  exists
0  Harry  pig  NY  Year1       1
1  Kasey  cat  AL  Year2       0
2    Ron    b  AK  Year1       1
3    Amy  axe  KY  Year4       0

Answer 2

Another option is merge :

out = pd.merge(df1, df2['A1'].drop_duplicates(),
               left_on='A', right_on='A1', how='left') \
    .rename(columns={'A1': 'exists'})

out['exists'] = out['exists'].notnull().astype(int)

print(out)

Output:

       A    B   C      G  exists
0  Harry  pig  NY  Year1       1
1  Kasey  cat  AL  Year2       0
2    Ron    b  AK  Year1       1
3    Amy  axe  KY  Year4       0

Check if one column value is in another column and create column to indicate in Pandas

Question

2 answers

solution1
2 ACCPTED 2021-05-06 02:25:11

solution2
1 2021-05-06 02:29:03

Check if one column value is in another column and create column to indicate in Pandas

Question

2 answers

solution1 2 ACCPTED 2021-05-06 02:25:11

solution2 1 2021-05-06 02:29:03

solution1
2 ACCPTED 2021-05-06 02:25:11

solution2
1 2021-05-06 02:29:03