如何删除熊猫数据框中的唯一行

Question

index                                            SUBJECT
1                                                   test
2                                                  Hello
3                                                  Hello
4                               PRC review - phone calls

AFTER REMOVING 拆卸后

index                                            SUBJECT
2                                                  Hello
3                                                  Hello

I want to delete rows based on only the "SUBJECT" column. 我只想删除基于“ SUBJECT”列的行。 How to do this? 这个怎么做？

Answer 1

Use duplicated 使用duplicated

Ex: 例如：

import pandas as pd

df = pd.DataFrame({"SUBJECT": ["test", "Hello", "Hello", "PRC review - phone calls"]})
df = df[df.duplicated(subset=["SUBJECT"], keep=False)]
print(df)

Output: 输出：

  SUBJECT
1   Hello
2   Hello

Answer 2

You could do: 您可以这样做：

# get count for each value
s = df.SUBJECT.value_counts()

# get only those that appear more than once
repeated = set(s[s > 1].index.values)

# filter the data-frame base
result = df[df.SUBJECT.isin(repeated)]

print(result)

Output 输出量

   index SUBJECT
1      2   Hello
2      3   Hello

Answer 3

检查一下：

df.loc[(df.groupby('SUBJECT').count()>1).sum(axis=1),:]

Answer 4

Solution 1: 解决方案1：

using loc.. 使用loc ..

>>> df.loc[df.duplicated(keep=False), :]
  SUBJECT
1   Hello
2   Hello

Solution 2: 解决方案2：

Another way with groupby + transform .. groupby + 转换的另一种方法..

>>> df[df.groupby('SUBJECT')['SUBJECT'].transform('size') > 1]
  SUBJECT
1   Hello
2   Hello

如何删除熊猫数据框中的唯一行

问题描述

4 个解决方案

解决方案1
4 2019-02-13 14:23:45

解决方案2
1 2019-02-13 14:25:34

解决方案3
1 已采纳 2019-02-13 14:27:02

解决方案4
1 2019-02-13 14:56:55

Solution 1: 解决方案1：

Solution 2: 解决方案2：

如何删除熊猫数据框中的唯一行

问题描述

4 个解决方案

解决方案1 4 2019-02-13 14:23:45

解决方案2 1 2019-02-13 14:25:34

解决方案3 1 已采纳 2019-02-13 14:27:02

解决方案4 1 2019-02-13 14:56:55

Solution 1: 解决方案1：

Solution 2: 解决方案2：

解决方案1
4 2019-02-13 14:23:45

解决方案2
1 2019-02-13 14:25:34

解决方案3
1 已采纳 2019-02-13 14:27:02

解决方案4
1 2019-02-13 14:56:55