使用python从数字列表中保存有序对

Question

我有一个数字数组：

q1a = [1,2,2,2,4,3,1,3,3,4,0,0]

我想将它们保存在一个数组中，使用PYTHON将其存储为（数字，数字的比例）。

例如：[[0 0.1667]，[1 0.1667]，[2 0.25]，[3 0.25]，[4 0.167]。

这对于计算数字分布至关重要。 我怎样才能做到这一点？

尽管我编写了将数字保存为：（数字，它在列表中出现的次数）的代码，但是我无法弄清楚如何找到每个数字的比例。 谢谢。

sorted_sample_values_of_x = unique, counts = np.unique(q1a, return_counts=True)
np.asarray((unique, counts)).T
np.put(q1a, [0], [0])

sorted_x = np.matrix(sorted_sample_values_of_x)
sorted_x = np.transpose(sorted_x)
print('\n' 'Values of x (sorted):' '\n')
print(sorted_x)

Answer 1

>>> q1a = [1,2,2,2,4,3,1,3,3,4,0,0]
>>> from collections import Counter
>>> sorted([[x, float(y)/len(q1a)] for (x, y) in Counter(q1a).items()],
...        key=lambda x: x[0])
[[0, 0.16666666666666666],
 [1, 0.16666666666666666],
 [2, 0.25],
 [3, 0.25],
 [4, 0.16666666666666666]]

Answer 2

您将需要做两件事。

将sorted_x数组转换为float数组。
然后将其除以counts总和数组。

范例-

In [34]: sorted_x = np.matrix(sorted_sample_values_of_x)

In [35]: sorted_x = np.transpose(sorted_x).astype(float)

In [36]: sorted_x
Out[36]:
matrix([[ 0.,  2.],
        [ 1.,  2.],
        [ 2.,  3.],
        [ 3.,  3.],
        [ 4.,  2.]])

In [37]: sorted_x[:,1] = sorted_x[:,1]/counts.sum()

In [38]: sorted_x
Out[38]:
matrix([[ 0.        ,  0.16666667],
        [ 1.        ,  0.16666667],
        [ 2.        ,  0.25      ],
        [ 3.        ,  0.25      ],
        [ 4.        ,  0.16666667]])

要将具有规定的数字存储在新数组中，请执行-

In [41]: sorted_x = np.matrix(sorted_sample_values_of_x)

In [42]: sorted_x = np.transpose(sorted_x).astype(float)

In [43]: ns = sorted_x/np.array([1,counts.sum()])

In [44]: ns
Out[44]:
matrix([[ 0.        ,  0.16666667],
        [ 1.        ,  0.16666667],
        [ 2.        ,  0.25      ],
        [ 3.        ,  0.25      ],
        [ 4.        ,  0.16666667]])

Answer 3

In [12]: from collections import Counter

In [13]: a = [1,2,2,2,4,3,1,3,3,4,0,0]

In [14]: counter = Counter(a)

In [15]: sorted( [ [key, float(counter[key])/len(a)]  for key in counter ] )
Out[15]:
[[0, 0.16666666666666666],
 [1, 0.16666666666666666],
 [2, 0.25],
 [3, 0.25],
 [4, 0.16666666666666666]]

Answer 4

#!/usr/bin/env python
import numpy as np
q1a = [1,2,2,2,4,3,1,3,3,4,0,0]

unique, counts = np.unique(q1a, return_counts=True)
counts = counts.astype(float) # convert to float
counts /= counts.sum()        # counts -> proportion
print(np.c_[unique, counts])

输出量

[[ 0.          0.16666667]
 [ 1.          0.16666667]
 [ 2.          0.25      ]
 [ 3.          0.25      ]
 [ 4.          0.16666667]]

Answer 5

作为collections.Counter的替代方法，请尝试collections.defaultdict 。 这样，您就可以在输入过程中累计总频率（即应该更有效），并且更具可读性（IMO）。

from collections import defaultdict

q1a = [1,2,2,2,4,3,1,3,3,4,0,0]
n = float(len(q1a))
frequencies = defaultdict(int)
for i in q1a:
    frequencies[i] += 1/n

print frequencies.items()
[(0, 0.16666666666666666), (1, 0.16666666666666666), (2, 0.25), (3, 0.25), (4, 0.16666666666666666)]

Answer 6

使用numpy的有趣替代方法

print [(val, 1.*np.sum(q1a==val)/len(q1a) ) for val in np.unique(q1a) ]
#[(0, 0.16666666666666666),
#(1, 0.16666666666666666),
#(2, 0.25),
#(3, 0.25),
#(4, 0.16666666666666666)]

1.是强制浮法分割

使用python从数字列表中保存有序对

问题描述

6 个解决方案

解决方案1
1 2015-07-21 04:03:26

解决方案2
1 已采纳 2015-07-21 04:06:48

解决方案3
0 2015-07-21 04:07:27

解决方案4
0 2015-07-21 04:19:13

输出量

解决方案5
0 2015-07-21 04:20:18

解决方案6
0 2015-07-21 04:41:13

使用python从数字列表中保存有序对

问题描述

6 个解决方案

解决方案1 1 2015-07-21 04:03:26

解决方案2 1 已采纳 2015-07-21 04:06:48

解决方案3 0 2015-07-21 04:07:27

解决方案4 0 2015-07-21 04:19:13

输出量

解决方案5 0 2015-07-21 04:20:18

解决方案6 0 2015-07-21 04:41:13

解决方案1
1 2015-07-21 04:03:26

解决方案2
1 已采纳 2015-07-21 04:06:48

解决方案3
0 2015-07-21 04:07:27

解决方案4
0 2015-07-21 04:19:13

解决方案5
0 2015-07-21 04:20:18

解决方案6
0 2015-07-21 04:41:13