从每个客户 ID 的下一列中识别最后一个是值和 0，然后从前一行下一列熊猫中获取值

Question

Input:输入：

df = pd.DataFrame([[101,'y', 1, 'reg'],
                [101,'y', 1, '1098'],
                [101, 'y',0, 'Reg'],
                [101, 'y',0, 'sed'],
                [101,'n',0,'VA'],
                [102, 'y',1, 'Paymode'],
                [102, 'y',0, 'roy'],
                [102, 'n',0, 'Reg'],
                [103, 'y',1, 'reg'],
                [103, 'n',0, 'PCA'],
                [103, 'n',0, 'FXD']
              ]
              , columns=['cus_ID', 'emailflg','Paperlessmode', 'types of paper'])

output:输出：

df= pd.DataFrame([[101,'y', 1, 'reg','1098'],
                [101,'y', 1, '1098','1098'],
                [101, 'y',0, 'Reg','1098'],
                [101, 'y',0, 'sed','1098'],
                [101,'n',0,'VA','1098'],
                [102, 'y',1, 'Paymode','roy'],
                [102, 'y',0, 'roy','roy'],
                [102, 'n',0, 'Reg','roy'],
                [103, 'y',1, 'reg','reg'],
                [103, 'n',0, 'PCA','reg'],
                [103, 'n',0, 'FXD','reg']
              ]
              , columns=['cus_ID','emailflg', 'Paperlessmode', 'types of paper','last occurance_paper'])

condtion 1= df['Emailflg' =='y'] & df['Paperlessmode'].eq(0) & df['types of paper!='Reg','PCA']

if Condition 1 exits then take same row value from type of paper如果条件 1 退出，则从纸张类型中获取相同的行值
OR或者
identify the types of paper which is presence before zero in Paperlessmode and 'n' in Email flag for each customer id in Python 3.6在 Python 3.6 中为每个客户 ID 在 Paperlessmode 和 Email 标志中的“n”之前识别纸张类型
Kindly let me know if any concern and plz don't devote如果有任何问题，请让我知道，请不要投入

Answer 1

I don't really understand how your conditions are done, but for the solution that you should follow, it doesn't rally matter.我真的不明白你的条件是如何完成的，但对于你应该遵循的解决方案，它并不重要。 I have listed several pseudo-conditions which you can adapt to your own real conditions.我列出了几个伪条件，您可以根据自己的实际情况进行调整。 The idea is to list the conditions and choices for what the values of the new column should be.这个想法是列出新列的值应该是什么的条件和选择。 In this example, 4 conditions, 4 potential values.在本例中，4 个条件，4 个潜在值。 Then use np.select in the following way:然后按以下方式使用np.select ：

conditions = [((df['emailflg'] =='y') & (df['Paperlessmode']==0)), 
               ((df['types of paper'] =='reg') & (df['Paperlessmode']==0)),
             ((df['types of paper'] =='Reg') & (df['Paperlessmode']==0)),
              ((df['types of paper'] =='VA') & (df['Paperlessmode']==0))]
choices = ['REG','reg','Paymode', 'VA']


import numpy as np

df['last occurance_paper'] = np.select(conditions, choices, default='XXX')

which gives这使

   cus_ID emailflg  Paperlessmode types of paper last occurance_paper
0     101        y              1            reg                  XXX
1     101        y              1           1098                  XXX
2     101        y              0            Reg                  REG
3     101        n              0             VA                   VA
4     102        y              1        Paymode                  XXX
5     102        n              0            Reg              Paymode
6     103        y              1            reg                  XXX
7     103        n              0            PCA                  XXX
8     103        n              0            FXD                  XXX

Answer 2

I think you need:我认为你需要：

m1 = df['Paperlessmode'].eq(0)
m2 = df['emailflg'].eq('y')
m3 = df['emailflg'].eq('n') 
 
d = df[m1 & m2].set_index('cus_ID')['types of paper'].to_dict()
s = df[(m1 & m3).groupby(df['cus_ID']).transform(lambda x: x.shift(-1).cumsum().eq(1))].set_index('cus_ID')['types of paper']

df['last occurance_paper'] = df['cus_ID'].map(d).fillna(df['cus_ID'].map(s))
print (df)
       cus_ID emailflg  Paperlessmode types of paper last occurance_paper
0     101        y              1            reg                  Reg
1     101        y              1           1098                  Reg
2     101        y              0            Reg                  Reg
3     101        n              0             VA                  Reg
4     102        y              1        Paymode              Paymode
5     102        n              0            Reg              Paymode
6     103        y              1            reg                  reg
7     103        n              0            PCA                  reg
8     103        n              0            FXD                  reg

Answer 3

s = df[df['Paperlessmode'].eq(0).groupby(df['cus_ID']).transform(lambda x: x.shift(-1).cumsum().eq(1))].set_index('cus_ID')['types of paper']
df['last occurance_paper'] = df['cus_ID'].map(s)
print (df)

从每个客户 ID 的下一列中识别最后一个是值和 0，然后从前一行下一列熊猫中获取值

问题描述

3 个解决方案

解决方案1
1 2020-11-24 10:53:44

解决方案2
1 2020-11-24 11:07:27

解决方案3
1 已采纳 2021-04-08 11:56:36

从每个客户 ID 的下一列中识别最后一个是值和 0，然后从前一行下一列熊猫中获取值

问题描述

3 个解决方案

解决方案1 1 2020-11-24 10:53:44

解决方案2 1 2020-11-24 11:07:27

解决方案3 1 已采纳 2021-04-08 11:56:36

解决方案1
1 2020-11-24 10:53:44

解决方案2
1 2020-11-24 11:07:27

解决方案3
1 已采纳 2021-04-08 11:56:36