从每个客户 ID 中识别零值，然后从前一行下一行 Pandas 中获取值

Question

Input:输入：

df = pd.DataFrame([[101, 1, 'reg'],
               [101, 1, '1098'],
               [101, 0, 'Reg'],
               [102, 1, 'Paymode'],
               [102, 0, 'Reg'],
               [103, 1, 'reg'],
               [103, 0.0, 'reg'],
               [103, 0.0, 'reg']
              ]
              , columns=['cus_ID', 'Paperlessmode', 'types of paper'])

output:输出：

df=pd.DataFrame([[101, 1, 'reg','1098'],
               [101, 1, '1098','1098'],
               [101, 0, 'Reg','1098'],
               [102, 1, 'Paymode','Paymode'],
               [102, 0, 'Reg','Paymode'],
               [103, 1, 'reg','reg'],
               [103, 0.0, 'reg','reg'],
               [103, 0.0, 'reg','reg']
              ]
              , columns=['cus_ID', 'Paperlessmode', 'types of paper','last occurance_paper'])

I want to identify the types of paper which is presence before zero in Paperlessmode for each customer id in Python 3.6我想为 Python 3.6 中的每个客户 ID 识别在 Paperlessmode 中出现在零之前的纸张类型

Answer 1

You can use Series.map by shifted values with cumulative sum and compared by 1 :您可以使用Series.map通过具有累积总和的移位值并按1进行比较：

s = df[df['Paperlessmode'].eq(0).groupby(df['cus_ID']).transform(lambda x: x.shift(-1).cumsum().eq(1))].set_index('cus_ID')['types of paper']
df['last occurance_paper'] = df['cus_ID'].map(s)
print (df)
   cus_ID  Paperlessmode types of paper last occurance_paper
0     101            1.0            reg                 1098
1     101            1.0           1098                 1098
2     101            0.0            Reg                 1098
3     102            1.0        Paymode              Paymode
4     102            0.0            Reg              Paymode
5     103            1.0            reg                  reg
6     103            0.0            reg                  reg
7     103            0.0            reg                  reg

Alternative:选择：

d = df[df['Paperlessmode'].eq(0).groupby(df['cus_ID']).shift(-1, fill_value=False)].set_index('cus_ID')['types of paper'].to_dict()
df['last occurance_paper'] = df['cus_ID'].map(d)

从每个客户 ID 中识别零值，然后从前一行下一行 Pandas 中获取值

问题描述

1 个解决方案

解决方案1
2 已采纳 2020-11-19 11:40:51

从每个客户 ID 中识别零值，然后从前一行下一行 Pandas 中获取值

问题描述

1 个解决方案

解决方案1 2 已采纳 2020-11-19 11:40:51

解决方案1
2 已采纳 2020-11-19 11:40:51