Python Pandas：如何使用 df.drop 和 df.loc 删除行？

Question

Suppose I have the following dataframe:假设我有以下 dataframe：

import numpy as np
import pandas as pd

df = pd.DataFrame(
    {
        'user': ['Adam', 'Barry', 'Cindy', 'Dirk', 'Ella'],
        'income': [50000, 0, 100000, 30000, 0],
        'net worth': [250000, 1000000, 2000000, 50000, 0]
    }
)

So far, I've been removing rows based on conditions using the following:到目前为止，我一直在使用以下条件根据条件删除行：

df2 = df[df.income != 0]

And using multiple conditions like so:并像这样使用多个条件：

df3 = df[(df['income'] != 0) & (df['net worth'] > 100000)]

Question: Is this the preferred way to drop rows?问题：这是删除行的首选方式吗？ If not, what is?如果不是，那是什么？ Is it possible to do this via df.drop and df.loc ?是否可以通过df.drop和df.loc做到这一点？ What would the syntax be?语法是什么？

Answer 1

.loc creates a subset of the rows you want to keep rather than .drop filter rows you want to remove. .loc创建您要保留的行的子集，而不是.drop过滤您要删除的行。 drop need the row label (index name). drop需要行 label（索引名称）。

The equivalent of your last filter with drop is:最后一个带drop的过滤器的等价物是：

>>> df.drop(df[~((df['income'] != 0) & (df['net worth'] > 100000))].index)

    user  income  net worth
0   Adam   50000     250000
2  Cindy  100000    2000000

# OR a bit smart:
>>> df.drop(df[(df['income'] == 0) | (df['net worth'] <= 100000)].index)

    user  income  net worth
0   Adam   50000     250000
2  Cindy  100000    2000000

Which syntax do you prefer?您更喜欢哪种语法？

Python Pandas：如何使用 df.drop 和 df.loc 删除行？

问题描述

1 个解决方案

解决方案1
3 已采纳 2022-02-08 13:23:56

Python Pandas：如何使用 df.drop 和 df.loc 删除行？

问题描述

1 个解决方案

解决方案1 3 已采纳 2022-02-08 13:23:56

解决方案1
3 已采纳 2022-02-08 13:23:56