根据另一列的重复项删除一列的重复项，将另一列重复项保留在 pandas

Question

Keeping the duplicates of name column, I want to drop the duplicates of Count column except the unique values of name column保留 name 列的重复项，我想删除 Count 列的重复项，但 name 列的唯一值除外

here is a example df这是一个例子 df

Count数数	name姓名
yes是的	jhon约翰
yes是的	marry结婚
yes是的	marry结婚
yes是的	ishita石田
yes是的	ishita石田
yes是的	ishita石田

The result I want as:我想要的结果是：

Count数数	name姓名
yes是的	jhon约翰
yes是的	marry结婚
	marry结婚
yes是的	ishita石田
	ishita石田
	ishita石田

#pandas #python #熊猫#蟒蛇

Answer 1

The logic is逻辑是

groupby() and cumcount() instances of Name name的groupby()和cumcount()实例
0th instance, keep Count otherwise set to NaN第 0 个实例，保持Count否则设置为NaN

df = pd.read_csv(io.StringIO("""Count   name
yes jhon
yes marry
yes marry
yes ishita
yes ishita
yes ishita"""),sep="\t")

df.Count=np.where(df.groupby("name",as_index=False)["name"].cumcount()==0, df.Count, np.nan)

	Count数数	name姓名
0 0	yes是的	jhon约翰
1 1	yes是的	marry结婚
2 2	nan楠	marry结婚
3 3	yes是的	ishita石田
4 4	nan楠	ishita石田
5 5	nan楠	ishita石田

根据另一列的重复项删除一列的重复项，将另一列重复项保留在 pandas

问题描述

1 个解决方案

解决方案1
0 已采纳 2021-03-05 07:59:54

根据另一列的重复项删除一列的重复项，将另一列重复项保留在 pandas

问题描述

1 个解决方案

解决方案1 0 已采纳 2021-03-05 07:59:54

解决方案1
0 已采纳 2021-03-05 07:59:54