簡體 English 中英

Pandas Groupby對不同的列使用不同的agg方法

[英]Pandas Groupby using different agg methods for different columns

原文 2019-05-03 14:47:05 9 1 python/ pandas/ group-by

這是場景：

我有一個大型有序數據集，包含314列和超過300.000行的ML問題。
我想通過X列（供應商）按數據集進行分組。
一列是日期時間類型，一些列本質上是數字的，而另一列是從一些分類列中進行的一次熱編碼。

期望的輸出：

我想從列X中分組，並將數字列聚合為“均值”，將某些列聚合為“最后”，將一個熱編碼的列按“總和”聚合。 全部采用相同的agg方法。

由於我們討論的是314列數據集，因此我不能僅創建包含每列的dict。

df_train.groupby('Supplier').agg({<some columns> : 'last', <some columns>: 'sum', <some columns>: 'mean' })

PS：我使用我想要應用不同聚合的序列來排序列。

1 個解決方案

您可以使用select_dtypes來獲取數字列，並在字典理解中使用它們。

numeric_cols = df_train.select_dtypes('numeric').columns

agg_dict = {c: 'sum' if c in numeric_cols else 'last' for c in df_train.columns}

grouped = df_train.groupby('Supplier').agg(agg_dict)

關於您的單熱編碼列，您需要提供有關如何識別它們的更多信息。

使用交叉表在 Pandas 中聚合具有不同聚合函數的多個列

[英]Aggregate Multiple columns with different agg functions in Pandas using Crosstab

groupby 和 agg 多列 pandas

[英]groupby and agg with multiple columns pandas

當對 GroupBy object 使用 apply 和 agg 時，pandas 給出不同的數值結果

[英]pandas gives different numerical results when using apply and agg for a GroupBy object

Groupby 2 個不同的列 Python Pandas

[英]Groupby 2 different columns Python Pandas

保留一列但在Pandas Groupby和Agg中使用其他列

[英]Keep One Column but Using Other Columns in Pandas Groupby and Agg

如何使用pandas為不同列聚合不同條件的數據？

[英]How to use pandas to agg data with different condition for different columns?

在熊貓中對不同列使用不同功能的groupby

[英]groupby in pandas with different functions for different columns

Pandas 列上的 groupby() 和 agg() 方法混淆

[英]Pandas groupby() and agg() method confusion on columns

與agg的pandas groupby無法在多列上使用

[英]pandas groupby with agg not working on multiple columns

熊貓中不同列的填充方法不同

[英]Different fill methods for different columns in pandas

暫無

暫無

聲明:本站的技術帖子網頁，遵循CC BY-SA 4.0協議，如果您需要轉載，請注明本站網址或者原文地址。任何問題請咨詢:yoyou2525@163.com.

相關問題 使用交叉表在 Pandas 中聚合具有不同聚合函數的多個列 groupby 和 agg 多列 pandas 當對 GroupBy object 使用 apply 和 agg 時，pandas 給出不同的數值結果 Groupby 2 個不同的列 Python Pandas 保留一列但在Pandas Groupby和Agg中使用其他列如何使用pandas為不同列聚合不同條件的數據？在熊貓中對不同列使用不同功能的groupby Pandas 列上的 groupby() 和 agg() 方法混淆與agg的pandas groupby無法在多列上使用熊貓中不同列的填充方法不同

相關標簽

粵ICP備18138465號 © 2020-2024 STACKOOM.COM