Pandas: groupby then count based on condition gives NaN

Question

I have the following dataset:

+----+------+
| ID | Type |
+----+------+
| a  | New  |
+----+------+
| b  | Old  |
+----+------+
| b  | Old  |
+----+------+
| b  | New  |
+----+------+
| c  | Old  |
+----+------+

I'm trying to group by ID, and then count the number of New occurences for each group. So for example I would have a=1 , b=2 , and c=0 .

Here's what I've tried:

df['NewAmount'] = df.groupby('ID')['Type'].apply(
    lambda x: x[x == 'New'].count())

And I get this:

+----+------+----------+
| ID | Type | NewAmount|
+----+------+----------+
| a  | New  | NaN      |
+----+------+----------+
| b  | Old  | NaN      |
+----+------+----------+
| b  | Old  | NaN      |
+----+------+----------+
| b  | New  | NaN      |
+----+------+----------+
| c  | Old  | NaN      |
+----+------+----------+

Answer 1

You should try with transform

df['out'] = df['Type'].eq('New').groupby(df['ID']).transform('sum')

Pandas: groupby then count based on condition gives NaN

Question

1 answers

solution1
1 ACCPTED 2021-03-30 14:19:13

Pandas: groupby then count based on condition gives NaN

Question

1 answers

solution1 1 ACCPTED 2021-03-30 14:19:13

solution1
1 ACCPTED 2021-03-30 14:19:13