如何从 dataframe 中的另一列按条件创建新组？

Question

So I have this kind of data所以我有这样的数据

data = [['A', 0], ['A', 1], ['A', 2], ['A', 15], ['A', 2], ['A', 12],['B',1],['B',3]]
df = pd.DataFrame(data, columns = ['name', 'interval'])

    name    interval
0   A       0
1   A       1
2   A       2
3   A       15
4   A       2
5   A       12
6   B       1
7   B       3

so I want to create a new name based on the interval (if the interval>10 then the new name is generated) but still using the previous name like this (this is just an example name)所以我想根据间隔创建一个新名称（如果间隔>10，则生成新名称）但仍然使用以前的名称（这只是一个示例名称）

    name    interval    new_name
0   A       0           A_0
1   A       1           A_0
2   A       2           A_0
3   A       15          A_1
4   A       2           A_1
5   A       12          A_2
6   B       1           B_0
7   B       3           B_0

My current code is accessing every row using for, any other idea to process it?我当前的代码正在使用 for 访问每一行，还有其他想法来处理它吗？ Thank you谢谢

###################### ######################

Credits to Rutger for his idea.感谢 Rutger 的想法。 This is the flow how to do it这是流程怎么做

    name    interval    condition  cumsum   new_name(name+"_"+cumsum)
0   A       0           False      0        A_0
1   A       1           False      0        A_0
2   A       2           False      0        A_0
3   A       15          True       1        A_1
4   A       2           False      1        A_1
5   A       12          True       2        A_2
6   B       1           False      0        B_0
7   B       3           False      0        B_0

Details of the code is in the Rutger's answer代码的详细信息在 Rutger 的回答中

Answer 1

I think the easiest is to start with creating a bool series and then create your new field like this:我认为最简单的方法是从创建 bool 系列开始，然后像这样创建新字段：

df['large_interval'] = 10 < df['interval']
df['new_name'] = df['name'] + '_' + df.groupby('name')['large'].cumsum().astype(str)

On the second line it counts how many large intervals have passed per group.在第二行，它计算每组经过了多少大间隔。 That value is used as a string and added after then name and _.该值用作字符串并在名称和_之后添加。

如何从 dataframe 中的另一列按条件创建新组？

问题描述

1 个解决方案

解决方案1
1 已采纳 2021-06-14 10:13:44

如何从 dataframe 中的另一列按条件创建新组？

问题描述

1 个解决方案

解决方案1 1 已采纳 2021-06-14 10:13:44

解决方案1
1 已采纳 2021-06-14 10:13:44