Pandas DataFrame - delete rows that have same value at a particular column as a previous row

Question

I have a pandas dataframe, I want to check for each row if it has the same value at a particular column(let's call it porduct_type), and if it does, delete it. In other words, out of a group of consecutive rows with the same value at a particular column, I want to keep only one.

Example, if column A is the one on which we don't want consecutive duplicates:

Answer 1

It's a little tricky, but you could do something like

>>> df.groupby((df["A"] != df["A"].shift()).cumsum().values).first()
   A   B    C
1  0   1    1
2  2   1   10
3  0  11  100
4  5   2  200

Pandas DataFrame - delete rows that have same value at a particular column as a previous row

Question

1 answers

solution1
4 ACCPTED 2014-07-24 21:52:15

Pandas DataFrame - delete rows that have same value at a particular column as a previous row

Question

1 answers

solution1 4 ACCPTED 2014-07-24 21:52:15

solution1
4 ACCPTED 2014-07-24 21:52:15