使用条件的子集 df - pandas

Question

I'm aiming to subset a df using two conditions.我的目标是使用两个条件对 df 进行子集化。 Those being, return rows only when string values in L2 are after by a value in L1 and are followed by a value in L1 .这些之中，返回行仅在字符串值L2是通过在值之后L1 ，并且随后在值L1 。

df = pd.DataFrame({  
    'col_1' : ['a','m','x','b','n','c','c','o','y','a','m','c'],                             
    })


L1 = ['a','b','c']

L2 = ['m','n','o']

L3 = ['x','y','z']

m1 = df['col_1'].isin(L1) & df['col_1'].shift(-1).isin(L2)
m2 = df['col_1'].isin(L2) & df['col_1'].shift().isin(L1)

df = df[m1 | m2 ].reset_index(drop = True)

intended output:预期输出：

   col_1
4      n
10     m

Answer 1

you can try:你可以试试：

df[(df['col_1'].isin(L1).shift() & df['col_1'].isin(L2)) & \
    (df['col_1'].isin(L1).shift(-1) & df['col_1'].isin(L2))]

Output:输出：

   col_1
4      n
10     m

使用条件的子集 df - pandas

问题描述

1 个解决方案

解决方案1
3 已采纳 2021-06-22 10:57:32

使用条件的子集 df - pandas

问题描述

1 个解决方案

解决方案1 3 已采纳 2021-06-22 10:57:32

解决方案1
3 已采纳 2021-06-22 10:57:32