根据条件在 df 的新列中添加值

Question

I have the following df sorted by date and the by name :我有以下df按date和name排序：

         date     name   valor 
  2  2018-03-01   ACC      75      
  0  2018-03-01   ACE      50     
  0  2018-03-20   ACE      50   
  1  2018-03-01   BBV      20    
  1  2018-03-14   BBV      20
  5  2018-04-16   BBV      58
  6  2018-04-20   BBV     -58

I am looking forward to generate a new column (called result )in the df where if one of the values in name is the same as the one after , then add them together in the new column.我期待在 df 中生成一个新column （称为result ），如果name中的值之一与之后的值相同，则将它们添加到新列中。

The desired output would look something like this:所需的输出如下所示：

     date        name    valor  result 
  2  2018-03-01   ACC      75     75
  0  2018-03-01   ACE      50     50
  0  2018-03-20   ACE      50    100 
  1  2018-03-01   BBV      20     20
  1  2018-03-14   BBV      20     40
  5  2018-04-16   BBV      58     98
  6  2018-04-20   BBV     -58     40

This is what I am trying:这就是我正在尝试的：

for index,row in df.iterrows():
    for i in range(1,len(df)+1):
        if (row['name'][i]==row['name'][i+1]) and ( row['name'][i-1]!=row['name'][i]):
            df["result"]=df["valor"][i]+df["valor"][i+1]
        elif (row['name'][i]==row['name'][i+1]) and (row['name'][i-1]==row['name'][i]):
            df["result"]=df["result"][i]+df["valor"][i+1]

An indexing error outputs indicating string index out of range , however I am sure there should be a more efficient way to obtain the desired output. indexing error输出指示string index out of range ，但是我确信应该有更有效的方法来获得所需的输出。

Thank you for reading my post.感谢您阅读我的帖子。

Answer 1

You should use groupby.cumsum for this.您应该为此使用groupby.cumsum 。 Using vectorised functionality which comes with pandas is usually more efficient and cleaner than iterating rows.使用其自带的矢量化功能pandas通常比迭代行更高效，更清洁。

df['result'] = df.groupby('name')['valor'].cumsum()

print(df)

         date name  valor  result
2  2018-03-01  ACC     75      75
0  2018-03-01  ACE     50      50
0  2018-03-20  ACE     50     100
1  2018-03-01  BBV     20      20
1  2018-03-14  BBV     20      40
5  2018-04-16  BBV     58      98
6  2018-04-20  BBV    -58      40

根据条件在 df 的新列中添加值

问题描述

1 个解决方案

解决方案1
2 已采纳 2018-04-28 09:32:22

根据条件在 df 的新列中添加值

问题描述

1 个解决方案

解决方案1 2 已采纳 2018-04-28 09:32:22

解决方案1
2 已采纳 2018-04-28 09:32:22