Python：對組中心值 n SD 內的數字進行分組

Question

我有一個看起來像這樣的多個浮動列表

mylist = [10, 10.2, 10.5, 11, 15, 15.3, 15.4, 16, 27, 27.4, 28, 28.1, 28.2]

我想對彼此接近的值進行分組。 例如。 我想將 10 到 11 的值分組為 4 個值的平均值。 我很難確定中心值，然后選擇屬於該組的左右值。 我怎么能這樣做？

Answer 1

使用defaultdict怎么樣：

In [1]: from collections import defaultdict

In [2]: group = defaultdict(list)

In [3]: mylist = [10, 10.2, 10.5, 11, 15, 15.3, 15.4, 16, 27, 27.4, 28, 28.1, 28
   ...: .2]

In [4]: for val in mylist:
   ...:     group[int(val)].append(val)
   ...:     

In [5]: group
Out[5]: 
defaultdict(list,
            {10: [10, 10.2, 10.5],
             11: [11],
             15: [15, 15.3, 15.4],
             16: [16],
             27: [27, 27.4],
             28: [28, 28.1, 28.2]})

它不需要排序輸入。 此外，它保留了相關值的順序

假設，我正確理解您的要求。

Answer 2

我聽起來你想要一個通用的方法，可能是這樣的：

from scipy.stats import binned_statistic

data = [10, 10.2, 10.5, 11, 15, 15.3, 15.4, 16, 27, 27.4, 28, 28.1, 28.2]
stats, edges, binarray = binned_statistic(data,data,bins=4)

edges    # Is the boundary values that split the data evenly into 4 bins. 
binarray # Shows which numbers in your original array belong to which equal sized bin. 
         # Note that nothing belongs to bin-3 because the gap is too wide.

Python：對組中心值 n SD 內的數字進行分組

問題描述

2 個解決方案

解決方案1
2 2020-05-11 08:54:13

解決方案2
0 2020-05-11 09:09:10

Python：對組中心值 n SD 內的數字進行分組

問題描述

2 個解決方案

解決方案1 2 2020-05-11 08:54:13

解決方案2 0 2020-05-11 09:09:10

解決方案1
2 2020-05-11 08:54:13

解決方案2
0 2020-05-11 09:09:10