根據唯一的列組合將數據框拆分為多個數據框

Question

我有以下數據框：

import pandas as pd

units = [1, 1, 1, 5, 5, 5]
locations = [30, 30, 30, 32, 32, 32]
timestamps = [1, 2, 3, 1, 2, 3]
quantities = [1, 5, 3, 10, 35, 39]
data = {'units': units, 'locations': locations, 'timestamps': timestamps,
        'quantities': quantities}
df = pd.DataFrame(data=data)

看起來像這樣：

🐍 >>> df
   units  locations  timestamps  quantities
0      1         30           1           1
1      1         30           2           5
2      1         30           3           3
3      5         32           1          10
4      5         32           2          35
5      5         32           3          39

我需要從單位和位置的所有獨特組合中獲取數據框列表，即使用df.groupby(['units', 'locations']) 。 最終結果應該是這樣的：

(1, 30)
   timestamps  quantities
0           1           1
1           2           5
2           3           3

(5, 32)
   timestamps  quantities
3           1          10
4           2          35
5           3          39

請問這可能嗎？

Answer 1

通過 groupby 運行字典理解。 您可以在 Pandas doc for groupby:split-apply-combine頁面上閱讀更多相關信息：

d = {name:group.filter(['timestamps','quantities']) 
     for name, group in df.groupby(['units','locations'])}

#print(d.keys())
#dict_keys([(1, 30), (5, 32)])

print(d[(1,30)])

    timestamps  quantities
0       1           1
1       2           5
2       3           3

 print(d[(5,32)])

  timestamps    quantities
3       1          10
4       2          35
5       3          39

Answer 2

另一種方法是將 dict comp 與groupby和concat

d = pd.concat(({combo : data for combo,data in df.groupby(['units','locations'])}))

print(d)

        units  locations  timestamps  quantities
1 30 0      1         30           1           1
     1      1         30           2           5
     2      1         30           3           3
5 32 3      5         32           1          10
     4      5         32           2          35
     5      5         32           3          39

Answer 3

你是對的，它只是 groupby：

cols = ['units','locations']
for k, d in df.drop(cols, axis=1).groupby([df[c] for c in cols]):
    print(k)
    print(d)

輸出：

(1, 30)
   timestamps  quantities
0           1           1
1           2           5
2           3           3
(5, 32)
   timestamps  quantities
3           1          10
4           2          35
5           3          39

根據唯一的列組合將數據框拆分為多個數據框

問題描述

3 個解決方案

解決方案1
2 2020-03-29 00:59:59

解決方案2
1 2020-03-29 01:35:51

解決方案3
0 2020-03-29 01:59:11

根據唯一的列組合將數據框拆分為多個數據框

問題描述

3 個解決方案

解決方案1 2 2020-03-29 00:59:59

解決方案2 1 2020-03-29 01:35:51

解決方案3 0 2020-03-29 01:59:11

解決方案1
2 2020-03-29 00:59:59

解決方案2
1 2020-03-29 01:35:51

解決方案3
0 2020-03-29 01:59:11