如何比较 DataFrame 中的两列并根据该比较更改第三列的值？

Question

I have following table in Pandas:我在 Pandas 中有下表：

index | project | category | period | update | amount
0     | 100130  | labour   | 202201 | 202203 | 1000
1     | 100130  | labour   | 202202 | 202203 | 1000
2     | 100130  | labour   | 202203 | 202203 | 1000
3     | 100130  | labour   | 202204 | 202203 | 1000
4     | 100130  | labour   | 202205 | 202203 | 1000

And my final goal is to get table grouped by project and category with summary of amount column but only from month of update until now.我的最终目标是让表格按项目和类别分组，并包含金额列的摘要，但仅限于从更新月份到现在。 So for example above I will get summary from 202203 until 202205 which is 3000 for project 100130 and category labour.因此，例如上面的例子，我将获得从 202203 到 202205 的摘要，对于项目 100130 和类别劳动力，这是 3000。

As a first step I tried following condition:作为第一步，我尝试了以下条件：

for index, row in table.iterrows():
    if row["period"] < row["update"]
        row["amount"] = 0

But:但：

this iteration is not working此迭代不起作用
is there some simple and not so time consuming way how to do it?有没有一些简单又不那么耗时的方法呢？ As my table has over 60.000 rows, so iteration not so good idea probably.因为我的表有超过 60.000 行，所以迭代可能不是一个好主意。

Answer 1

table["amount"] = 0 if table["period"] < table["update"] else None

Answer 2

I did some more research and this code seems to solve my problem:我做了更多研究，这段代码似乎解决了我的问题：

def check_update(row):
    if row["period"] < row["update"]:
        return 0
    else:
        return row["amount"]

table["amount2"] = table.apply(check_update, axis=1)

如何比较 DataFrame 中的两列并根据该比较更改第三列的值？

问题描述

2 个解决方案

解决方案1
0 2022-12-03 20:53:11

解决方案2
0 已采纳 2022-12-03 22:43:54

如何比较 DataFrame 中的两列并根据该比较更改第三列的值？

问题描述

2 个解决方案

解决方案1 0 2022-12-03 20:53:11

解决方案2 0 已采纳 2022-12-03 22:43:54

解决方案1
0 2022-12-03 20:53:11

解决方案2
0 已采纳 2022-12-03 22:43:54