根据另一列中的值创建新列

Question

I have a data frame:我有一个数据框：

 id|concept    |description
 12|           |rewards member
 12|tier one   |
 12|not avail  |rewards member

GOAL: Create a new column final_desc with the content in either the concept or description column目标：使用concept或description列中的内容创建一个新列final_desc

There are 4 possible scenarios:有4种可能的情况：

There is a value in concept column and not in description , in which final_desc is the value in the concept在concept列有值，在description中没有，其中final_desc是concept中的值
There is a value in description column and not in concept , in which final_desc is the value in the description description列中有值， concept中没有，其中final_desc是description中的值
The value in concept column is not avail , in which final_desc is the value in the description concept列中的值无效，其中final_desc是description中的值
Both the concept and description column are empty, in which final_desc is empty concept和description栏均为空，其中final_desc为空

I tried using a where statement but that does not account for scenario 3.我尝试使用 where 语句，但这不考虑场景 3。

df['final_desc'] = np.where(df['concept'].isnull(), df['description'], df['concept'])

I think I need a custom function but am not sure how to write to work across columns我想我需要一个自定义 function 但不知道如何编写跨列工作

Answer 1

You can combine a replace and ffill/bfill :您可以结合使用replace和ffill/bfill ：

df['final_desc'] = (df[['concept','description']].replace('not avail',np.nan)
                     .bfill(1)['concept']
                   )

Output: Output：

   id    concept     description      final_desc
0  12        NaN  rewards member  rewards member
1  12   tier one             NaN        tier one
2  12  not avail  rewards member  rewards member

Answer 2

This might do the trick:这可能会奏效：

df['final_desc'] = df.concept.replace('not avail',np.nan).fillna(df.description).fillna(df.concept.replace('not avail',np.nan))

根据另一列中的值创建新列

问题描述

2 个解决方案

解决方案1
0 2020-06-18 20:10:06

解决方案2
0 2020-06-18 20:15:06

根据另一列中的值创建新列

问题描述

2 个解决方案

解决方案1 0 2020-06-18 20:10:06

解决方案2 0 2020-06-18 20:15:06

解决方案1
0 2020-06-18 20:10:06

解决方案2
0 2020-06-18 20:15:06