简体繁体中英

How to delete duplicates, but keep the first instance and a blank cell for the duplicates in Pandas?

原文 2016-09-27 15:41:46 8 1 python/ pandas/ dataframe

I have a pandas DataFrame, and I'm doing a groupby(['target']).count(). This works fine. However, one of the things I want, for each group, is the number of unique elements in the ID column.

What I'd like to do is, for the ID column, null out all but the first copy of any ID value (IDs are unique to groups, so I don't have to worry about that issue). Then, the groupby().count() will give me the number of unique IDs in each group... But I'm not sure how to do that.

1 answers

The DataFrame.duplicated() method is applicable here if you want to do it the way you described. It can return a Series with the first occurrence of an ID being False and the rest being True. You can then use this as a mask to set the duplicated IDs to null.

See: http://pandas.pydata.org/pandas-docs/stable/generated/pandas.DataFrame.duplicated.html

How to keep first two duplicates in a pandas dataframe?

Pandas: delete consecutive duplicates but keep the first and last value

How to delete duplicates pandas

How do I drop duplicates and keep the first value on pandas?

pandas: how to select first or last by column in keep with drop_duplicates

How to drop duplicates in pandas but keep more than the first

Pandas - Opposite of drop duplicates, keep first

Keep first occurrence while removing duplicates in pandas

how to delete duplicates from a cell of a column in csv

how to drop duplicates but keep first in pyspark dataframe?

暂无

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

Related Question How to keep first two duplicates in a pandas dataframe? Pandas: delete consecutive duplicates but keep the first and last value How to delete duplicates pandas How do I drop duplicates and keep the first value on pandas? pandas: how to select first or last by column in keep with drop_duplicates How to drop duplicates in pandas but keep more than the first Pandas - Opposite of drop duplicates, keep first Keep first occurrence while removing duplicates in pandas how to delete duplicates from a cell of a column in csv how to drop duplicates but keep first in pyspark dataframe?

Related Tags

How to delete duplicates, but keep the first instance and a blank cell for the duplicates in Pandas?

Question

1 answers

solution1 0 2016-09-27 19:23:57

solution1
0 2016-09-27 19:23:57