删除包含特定值的行之后的 Pandas DataFrame 行

Question

我正在尝试删除在'Ammend'列中为yes的行之后的所有行

df：

  Ammend
 0  no
 1  yes
 2  no
 3  no
 4  yes
 5  no

所需的输出 df：

  Ammend
 0  no
 1  yes
 3  no
 4  yes

看下面的代码：

df = df.drop(df[df['Amended' == 'yes']], inplace=True)

返回KeyError: False错误消息

我已经使用.index.tolist()和.loc等不同方法尝试了许多不同的变体，但我似乎无法弄清楚。

我也试过截断：

filings_df.truncate(after=filings_df.loc[filings_df['Filings'] == '10-K/A'].index[0], before = filings_df.loc[filings_df['Filings'] == '10-K/A'].index[1])

这将返回：

索引错误：索引 1 超出轴 0 的范围，大小为 1

Answer 1

尝试这个

import pandas as pd
import numpy as np

np.random.seed(525)
df = pd.DataFrame({'Other': np.random.rand(10), 'Ammend': np.random.choice(['yes', 'no'], 10)})
df

      Other Ammend
0  0.750282     no
1  0.379455     no
2  0.766467    yes
3  0.351025     no
4  0.965993     no
5  0.709159     no
6  0.838831    yes
7  0.218321     no
8  0.573360    yes
9  0.738974     no

输出：

df.drop(index=df[df['Ammend'].shift() == 'yes'].index)

      Other Ammend
0  0.750282     no
1  0.379455     no
2  0.766467    yes
4  0.965993     no
5  0.709159     no
6  0.838831    yes
8  0.573360    yes

Answer 2

使用带有shift技巧的pandas.Series.ne一种方法：

s = df["Ammend"]
new_df = df[~s.ne(s.shift()).cumsum().duplicated(keep="first")]
print(new_df)

输出：

  Ammend
0     no
1    yes
2     no
4    yes
5     no

删除包含特定值的行之后的 Pandas DataFrame 行

问题描述

2 个解决方案

解决方案1
0 已采纳 2020-09-11 23:06:39

解决方案2
0 2020-09-12 00:06:49

删除包含特定值的行之后的 Pandas DataFrame 行

问题描述

2 个解决方案

解决方案1 0 已采纳 2020-09-11 23:06:39

解决方案2 0 2020-09-12 00:06:49

解决方案1
0 已采纳 2020-09-11 23:06:39

解决方案2
0 2020-09-12 00:06:49