Pandas pivot_table 保留順序

Question

>>> df
   A   B   C      D
0  foo one small  1
1  foo one large  2
2  foo one large  2
3  foo two small  3
4  foo two small  3
5  bar one large  4
6  bar one small  5
7  bar two small  6
8  bar two large  7
>>> table = pivot_table(df, values='D', index=['A', 'B'],
...                     columns=['C'], aggfunc=np.sum)
>>> table
          small  large
foo  one  1      4
     two  6      NaN
bar  one  5      4
     two  6      7

我希望輸出如上所示，但我得到一個排序的輸出。 bar 高於 foo 等等。

Answer 1

我認為pivot_table沒有排序選項，但是groupby有：

df.groupby(['A', 'B', 'C'], sort=False)['D'].sum().unstack('C')
Out: 
C        small  large
A   B                
foo one    1.0    4.0
    two    6.0    NaN
bar one    5.0    4.0
    two    6.0    7.0

您將分組列傳遞給groupby，對於要顯示為列值的那些，您可以使用unstack。

如果您不想要索引名稱，請將它們重命名為None：

df.groupby(['A', 'B', 'C'], sort=False)['D'].sum().rename_axis([None, None, None]).unstack(level=2)
Out: 
         small  large
foo one    1.0    4.0
    two    6.0    NaN
bar one    5.0    4.0
    two    6.0    7.0

Answer 2

在創建pivot_table ，索引會按字母順序自動排序 。 不僅foo和bar ，你也可能會注意到small和large的排序。 如果你想要foo在上面，你可能需要使用sortlevel再次sort它們進行sort 。 如果你在這里想要輸出，那么可能需要在A和C上進行排序。

table.sortlevel(["A","B"], ascending= [False,True], sort_remaining=False, inplace=True)
table.sortlevel(["C"], axis=1, ascending=False,  sort_remaining=False, inplace=True)
print(table)

輸出：

C        small  large
A   B                
foo one  1.0    4.0  
    two  6.0    NaN   
bar one  5.0    4.0  
    two  6.0    7.0

更新：

要刪除索引名稱A ， B和C ：

table.columns.name = None
table.index.names = (None, None)

Answer 3

從pandas 1.3.0 開始，可以在pd.pivot_table指定sort=False ：

>>> import pandas as pd
>>> df = pd.DataFrame({"A": ["foo", "foo", "foo", "foo", "foo", "bar", "bar", "bar", "bar"],
...                    "B": ["one", "one", "one", "two", "two", "one", "one", "two", "two"],
...                    "C": ["small", "large", "large", "small","small", "large", "small", "small", "large"],
...                    "D": [1, 2, 2, 3, 3, 4, 5, 6, 7],
...                    "E": [2, 4, 5, 5, 6, 6, 8, 9, 9]})
>>> pd.pivot_table(df, values='D', index=['A', 'B'], columns=['C'],
...                aggfunc='sum', sort=False)
C        large  small
A   B                
foo one    4.0    1.0
    two    NaN    6.0
bar one    4.0    5.0
    two    7.0    6.0

Pandas pivot_table 保留順序

問題描述

3 個解決方案

解決方案1
7 2017-07-08 17:07:14

解決方案2
1 已采納 2017-07-08 17:12:44

更新：

解決方案3
0 2021-11-15 15:57:39

Pandas pivot_table 保留順序

問題描述

3 個解決方案

解決方案1 7 2017-07-08 17:07:14

解決方案2 1 已采納 2017-07-08 17:12:44

更新：

解決方案3 0 2021-11-15 15:57:39

解決方案1
7 2017-07-08 17:07:14

解決方案2
1 已采納 2017-07-08 17:12:44

解決方案3
0 2021-11-15 15:57:39