在 Pandas 中选择行，其中一列中的值是另一列中值的子字符串

Question

I have a dataframe below我在下面有一个数据框

>df = pd.DataFrame({'A':['apple','orange','grape','pear','banana'], \
                    'B':['She likes apples', 'I hate oranges', 'This is a random sentence',\
                         'This one too', 'Bananas are yellow']})

>print(df)

    A       B
0   apple   She likes apples
1   orange  I hate oranges
2   grape   This is a random sentence
3   pear    This one too
4   banana  Bananas are yellow

I'm trying to fetch all rows where column B contains the value in column A.我正在尝试获取 B 列包含 A 列中的值的所有行。

Expected Result:预期结果：

    A       B
0   apple   She likes apples
1   orange  I hate oranges
4   banana  Bananas are yellow

I'm able to do fetch only one row using我只能使用

>df[df['B'].str.contains(df.iloc[0,0])]

    A       B
0   apple   She likes apples

How can I fetch all such rows?我怎样才能获取所有这些行？

Answer 1

Use DataFrame.apply with convert both values to lower and test contains by in and filter by boolean indexing :使用DataFrame.apply将两个值都转换为较低和测试包含通过in并通过boolean indexing过滤：

df = df[df.apply(lambda x: x.A in x.B.lower(), axis=1)]

Or list comprehension solution:或列表理解解决方案：

df = df[[a in b.lower() for a, b in zip(df.A, df.B)]]

print (df)
        A                   B
0   apple    She likes apples
1  orange      I hate oranges
4  banana  Bananas are yellow

在 Pandas 中选择行，其中一列中的值是另一列中值的子字符串

问题描述

1 个解决方案

解决方案1
3 2019-12-06 07:33:12

在 Pandas 中选择行，其中一列中的值是另一列中值的子字符串

问题描述

1 个解决方案

解决方案1 3 2019-12-06 07:33:12

解决方案1
3 2019-12-06 07:33:12