Aggregate Function to dataframe while retaining rows in Pandas

Question

I want to aggregate my data based off a field known as COLLISION_ID and a count of each COLLISION_ID .

I want to remove repeating COLLISION_IDs since they have the same Coordinates, but retain a count of occurrences in original data-set.

My code is below

df2 = df1.groupby(['COLLISION_ID'])[['COLLISION_ID']].count()

This returns such:

I would like my data returned as the COLLISION_ID numbers, the count, and the remaining columns of my data which are not shown here(~40 additional columns that will be filtered later)

Answer 1

If you are talking about filter, we should do transform

df1['count_col']=df1.groupby(['COLLISION_ID'])['COLLISION_ID'].transform('count')

Then you can filter the df1 with column count

Aggregate Function to dataframe while retaining rows in Pandas

Question

1 answers

solution1
1 ACCPTED 2020-04-13 21:37:54

Aggregate Function to dataframe while retaining rows in Pandas

Question

1 answers

solution1 1 ACCPTED 2020-04-13 21:37:54

solution1
1 ACCPTED 2020-04-13 21:37:54