简体   繁体   English

枚举列中的值

[英]Enumerate values in a column

Minimum Reproducible Example:最小可重现示例:

df = pd.DataFrame({'event_name': ['fulham','fulham','fulham','fulham','fulham','fulham'],
                      'batfast_id': ['bfs1', 'bfs1', 'bfs1', 'bfs1', 'bfs1', 'bfs1'],
                      'session_no': [1,1,1,1,1,1],
                      'overs': [0,0,0,0,0,0],
                      'deliveries_faced': [0,1,2,3,4,5],
                      'delivery_type': ['Extra Slow Leg Spin','Extra Slow Leg Spin','Slow Straight','Extra Slow Off Spin','Extra Slow Leg Spin','Extra Slow Leg Spin'],
                      'length': ['Yorker','Yorker','Yorker','Yorker','Yorker','Yorker']}, columns=['event_name', 'batfast_id','session_no','overs', 'deliveries_faced','delivery_type','length'])
df = df.set_index(['event_name', 'batfast_id','session_no','overs', 'deliveries_faced'],drop=True)
print(df)

I then produce a length/type column that is a combination of length and delivery_type using this code:然后,我使用以下代码生成一个length/type列,它是lengthdelivery_type类型的组合:

conditions = [
    (df['delivery_type'] == 'Extra Slow Off Spin') & (df['length'] == 'Yorker'),
    (df['delivery_type'] == 'Extra Slow Leg Spin') & (df['length'] == 'Yorker'),
    (df['delivery_type'] == 'Slow Straight') & (df['length'] == 'Yorker'),
    ]

values = ['ES_OS_Y', 'ES_LS_Y','S_S_Y']

df['length/type'] = np.select(conditions, values)
print(df)

The problem is that I wish to enumerate each delivery from 0-5 for each delivery of the over so that it looks like this:问题是我希望从 0-5 枚举每次交付的每个交付,使其看起来像这样:

                                                        delivery_type       length length/type
event_name batfast_id session_no overs deliveries_faced            
fulham     bfs1       1          0     0                Extra Slow Leg Spin Yorker   ES_LS_Y_0                                                  
                                       1                Extra Slow Leg Spin Yorker   ES_LS_Y_1
                                       2                Slow Straight       Yorker     S_S_Y_2
                                       3                Extra Slow Off Spin Yorker   ES_OS_Y_3
                                       4                Extra Slow Leg Spin Yorker   ES_LS_Y_4
                                       5                Extra Slow Leg Spin Yorker   ES_LS_Y_5

Try:尝试:

df['length/type'] = df['length/type'] + '_' \
                    + df.groupby(df.index.names[:-1]).cumcount().astype(str)
print(df)

# Output:
                                                               delivery_type  length length/type
event_name batfast_id session_no overs deliveries_faced                                         
fulham     bfs1       1          0     0                 Extra Slow Leg Spin  Yorker   ES_LS_Y_0
                                       1                 Extra Slow Leg Spin  Yorker   ES_LS_Y_1
                                       2                       Slow Straight  Yorker     S_S_Y_2
                                       3                 Extra Slow Off Spin  Yorker   ES_OS_Y_3
                                       4                 Extra Slow Leg Spin  Yorker   ES_LS_Y_4
                                       5                 Extra Slow Leg Spin  Yorker   ES_LS_Y_5

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM