Cumulative count for consecutive rows of a specific value in a pandas DataFrame column

Question

I have this dataframe and want to add another column that cumcounts until it doesn't equal the star symbol * , and then continue again from 1 when the star symbol reappears.

    Star
0   *
1   *
2   *
3   *
4   s
5   s
6   *
7   *

Output Expect:

    Star  Number
0   *     1
1   *     2
2   *     3
3   *     4
4   s     NaN
5   s     NaN
6   *     1
7   *     2

Answer 1

This is a simple groupby and masking operation.

m = df.Star.ne('*')
# Big thanks to @W-B for the bug fix!
df['Number'] = df[~m].groupby(m.cumsum()).cumcount().add(1)


df
  Star  Number
0    *     1.0
1    *     2.0
2    *     3.0
3    *     4.0
4    s     NaN
5    s     NaN
6    *     1.0
7    *     2.0

Answer 2

From itertools groupby

import itertools
df['New']=sum([list(range(len(list(y)))) for _ , y in itertools.groupby(df.Star.tolist())],[])
df.loc[df.Star.ne('*'),'New']=np.nan
df.New+=1
df
Out[1152]: 
  Star  New
0    *  1.0
1    *  2.0
2    *  3.0
3    *  4.0
4    s  NaN
5    s  NaN
6    *  1.0
7    *  2.0

Cumulative count for consecutive rows of a specific value in a pandas DataFrame column

Question

2 answers

solution1
5 ACCPTED 2018-12-06 19:31:09

solution2
3 2018-12-06 19:46:49

Cumulative count for consecutive rows of a specific value in a pandas DataFrame column

Question

2 answers

solution1 5 ACCPTED 2018-12-06 19:31:09

solution2 3 2018-12-06 19:46:49

solution1
5 ACCPTED 2018-12-06 19:31:09

solution2
3 2018-12-06 19:46:49