如何在仅某些列等于上一行时删除下一个熊猫数据框行

Question

I have created a dataframe called df with this code:我使用以下代码创建了一个名为df的数据框：

# initialize list of lists
data = {'ID': [1,2,3,4,5,6,7],
        'feature1': [100,32,100,100,100,93,100],
        'feature2': [100,32,100,100,100,93,100],
        'feature3': [100,32,100,100,100,93,100],
        }
 
# Create DataFrame
df = pd.DataFrame(data)

The dataframe looks like this:数据框如下所示：

print(df)

   ID  feature1  feature2  feature3
0   1       100       100       100
1   2        32        32        32
2   3       100       100       100
3   4       100       100       100
4   5       100       100       100
5   6        93        93        93
6   7       100       100       100

I want to remove the rows in which the values of columns:我想删除列值所在的行：

feature1 and feature1和
feature2 and feature2和
feature3 are exactly the same as the previous row. feature3与上一行完全相同。 In the example above, I need to remove rows 3 and 4 , so that the resulting dataframe will look like this:在上面的示例中，我需要删除3行和4行，以便生成的数据框如下所示：

Answer 1

Filter the feature like columns then calculate difference between previous and current row and check whether the difference is 0 for all the feature columns Filter feature列，然后计算前一行和当前行之间的差异，并检查所有feature列的差异是否为0

df[~df.filter(like='feature').diff().eq(0).all(1)]

   ID  feature1  feature2  feature3
0   1       100       100       100
1   2        32        32        32
2   3       100       100       100
5   6        93        93        93
6   7       100       100       100

如何在仅某些列等于上一行时删除下一个熊猫数据框行

问题描述

1 个解决方案

解决方案1
1 已采纳 2022-06-11 16:20:57

如何在仅某些列等于上一行时删除下一个熊猫数据框行

问题描述

1 个解决方案

解决方案1 1 已采纳 2022-06-11 16:20:57

解决方案1
1 已采纳 2022-06-11 16:20:57