如何实现原dataframe每一行的组键？（'by' 是 pandas 石斑鱼）

Question

I would like to materialize for each row of a dataframe the corresponding group key it would get if I was using a groupby operation with a pandas Grouper .如果我将groupby操作与 pandas Grouper一起使用，我想为 dataframe 的每一行实现相应的组密钥。

import pandas as pd

# Test data
ts = [pd.Timestamp('2022/03/01 09:00'),
      pd.Timestamp('2022/03/01 10:00'),
      pd.Timestamp('2022/03/01 10:30'),
      pd.Timestamp('2022/03/01 15:00')]
df = pd.DataFrame({'a':range(len(ts)), 'ts': ts})

grouper = pd.Grouper(key='ts', freq='2H', sort=False, origin='start_day')

Is there any way to get for each row the corresponding groupkey?有没有办法为每一行获取相应的组键？ The result I am looking for could be either a list, or a pandas Series or Index, or numpy array, the same length as the initial dataframe, and would then contain following values.我正在寻找的结果可能是一个列表，或者一个 pandas 系列或索引，或者 numpy 数组，与初始 dataframe 的长度相同，然后将包含以下值。

result = pd.Series([pd.Timestamp('2022-03-01 08:00:00'),
                    pd.Timestamp('2022-03-01 10:00:00'),
                    pd.Timestamp('2022-03-01 10:00:00'),
                    pd.Timestamp('2022-03-01 14:00:00')])

Thanks for your help!谢谢你的帮助！ Bests最好的

Answer 1

Similar idea to @Andrej, just creates a table with a new column与@Andrej 类似的想法，只是创建一个带有新列的表

pd.concat(g.assign(grouper_val = i) for i,g in df.groupby(grouper))

Answer 2

Not directly using the groupby but you can use:不直接使用groupby但您可以使用：

df['ts'].dt.floor('2H')

With the groupby :使用groupby ：

df.groupby(grouper)['ts'].transform(lambda g: g.name)

Output: Output：

0   2022-03-01 08:00:00
1   2022-03-01 10:00:00
2   2022-03-01 10:00:00
3   2022-03-01 14:00:00
Name: ts, dtype: datetime64[ns]

Answer 3

Given:鉴于：

   a                  ts
0  0 2022-03-01 09:00:00
1  1 2022-03-01 10:00:00
2  2 2022-03-01 10:30:00
3  3 2022-03-01 15:00:00

Doing:正在做：

pd.Series(df.resample('2H', origin='start_day', on='ts').groups)

Output: Output：

2022-03-01 08:00:00    1
2022-03-01 10:00:00    3
2022-03-01 12:00:00    3
2022-03-01 14:00:00    4
dtype: int64

如何实现原dataframe每一行的组键？（'by' 是 pandas 石斑鱼）

问题描述

3 个解决方案

解决方案1
2 2022-08-12 20:30:54

解决方案2
2 2022-08-12 20:32:03

解决方案3
0 2022-08-12 21:06:07

如何实现原dataframe每一行的组键？ （'by' 是 pandas 石斑鱼）

问题描述

3 个解决方案

解决方案1 2 2022-08-12 20:30:54

解决方案2 2 2022-08-12 20:32:03

解决方案3 0 2022-08-12 21:06:07

如何实现原dataframe每一行的组键？（'by' 是 pandas 石斑鱼）

解决方案1
2 2022-08-12 20:30:54

解决方案2
2 2022-08-12 20:32:03

解决方案3
0 2022-08-12 21:06:07