如何根据 Pandas DataFrame 中的条件添加每组具有重复值的新列？

Question

This is an example DataFrame.这是一个示例数据帧。

RootProduct | Product | Value
    A           A        1  
    A           B        2   
    A           C        3
    D           D        4
    D           E        5

How can I add a fourth column, repeating the value present in the Value column when RootProduct == Product grouped by RootProduct ?当RootProduct == Product按RootProduct分组时，如何添加第四列，重复Value列中的Value ？

This would result in the following DataFrame这将导致以下 DataFrame

RootProduct | Product | Value  | RootValue
    A           A        1          1
    A           B        2          1
    A           C        3          1 
    D           D        4          4 
    D           E        5          4

Answer 1

Idea is compare both columns by boolean indexing with Series.eq and then create Series by index with Product by DataFrame.set_index , so possible use Series.map by column RootProduct :想法是通过两列比较boolean indexing与Series.eq然后创建Series的指数与Product由DataFrame.set_index ，因此可能使用Series.map柱RootProduct ：

s = df[df['RootProduct'].eq(df['Product'])].set_index('Product')['Value']
df['RootValue'] = df['RootProduct'].map(s)
print (df)
  RootProduct Product  Value  RootValue
0           A       A      1          1
1           A       B      2          1
2           A       C      3          1
3           D       D      4          4
4           D       E      5          4

Detail of Series : Series详情：

print (s)
Product
A    1
D    4
Name: Value, dtype: int64

如何根据 Pandas DataFrame 中的条件添加每组具有重复值的新列？

问题描述

1 个解决方案

解决方案1
4 已采纳 2020-02-22 10:05:25

如何根据 Pandas DataFrame 中的条件添加每组具有重复值的新列？

问题描述

1 个解决方案

解决方案1 4 已采纳 2020-02-22 10:05:25

解决方案1
4 已采纳 2020-02-22 10:05:25