使用平均值從 pandas.DataFrame.groupby(['arg1', 'arg2']) 創建新列

Question

我在 pandas.DataFrame 中有類似以下的數據：

df = pd.DataFrame({
    'Year' : [2001, 2001, 2001, 2001, 2002, 2002, 2002, 2002],
    'Month' : ['Aug', 'Aug', 'Sep', 'Sep', 'Aug', 'Aug', 'Sep', 'Sep'],
    'Day' : [1, 2, 1, 2, 1, 2, 1, 2],
    'Value' : [1, 2, 3, 4, 5, 6, 7, 8]  })

現在我按“月”和“年”分組，並計算平均值：

print(df.groupby(['Month', 'Year'])['Value'].mean())

output 看起來像：

月	年
八月	2001年	1.5
	2002年	5.5
九月	2001年	3.5
	2002年	7.5

現在我想創建一個新的數據框，如下所示：

年	八月	九月
2001年	1.5	3.5
2002年	5.5	7.5

pandas 模塊中是否有任何功能可以幫助我解決這個問題？ 提前致謝！

Answer 1

您可以使用 pivot_table 這樣做：

table = pd.pivot_table(df, values='Value', index=['Year'],
                columns=['Month'], aggfunc=np.mean)

問候，傑霍娜。

Answer 2

OP離預期的目標不遠了。 因為一個人正在使用pandas.DataFrame.groupby和pandas.Series.mean ，所以一個人所要做的就是使用pandas.DataFrame.unstack如下

df_new = df.groupby(['Year', 'Month'])['Value'].mean().unstack()

[Out]:

Month  Aug  Sep
Year           
2001   1.5  3.5
2002   5.5  7.5

使用平均值從 pandas.DataFrame.groupby(['arg1', 'arg2']) 創建新列

問題描述

2 個解決方案

解決方案1
0 2022-12-11 15:22:49

解決方案2
0 已采納 2022-12-11 15:25:48

使用平均值從 pandas.DataFrame.groupby(['arg1', 'arg2']) 創建新列

問題描述

2 個解決方案

解決方案1 0 2022-12-11 15:22:49

解決方案2 0 已采納 2022-12-11 15:25:48

解決方案1
0 2022-12-11 15:22:49

解決方案2
0 已采納 2022-12-11 15:25:48