Python，集群两列计数

Question

I have the following list: 我有以下清单：

a = [['A','R.1',1],['B','R.2',1],['B','R.2',2],['C','R.2',3],
     ['C','C.1',4],['C','C.1',5],['A','C.1',8],['B','C.1',9],
     ['B','C.1',1],['A','R.3',2],['C','R.1',3],['A','R.2',4],
     ['C','R.1',5],['A','R.1',1],['C','R.2',5],['A','R.1',8]]

I need to somehow group it to generate the following result: 我需要以某种方式对其进行分组以生成以下结果：

[['A', 'C.1', 1],
 ['A', 'R.1', 3],
 ['A', 'R.2', 1],
 ['A', 'R.3', 1],
 ['B', 'C.1', 2],
 ['B', 'R.2', 2],
 ['C', 'C.1', 2],
 ['C', 'R.1', 2],
 ['C', 'R.2', 2]]

Where the third column is count of rows where the first and second columns match. 第三列是第一列和第二列匹配的行数。 From the original list the value of the third column is negligible. 从原始列表中，第三列的值可以忽略不计。

I have already tried via "for" nested and "list comprehension", but I have not been able to come up with any results. 我已经尝试过通过“用于”嵌套和“列表理解”，但是我还无法得出任何结果。

Does anyone have any clue how I can resolve this? 有谁知道如何解决这个问题？

Answer 1

With collections.defaultdict object: 使用collections.defaultdict对象：

import collections

a = [['A','R.1',1],['B','R.2',1],['B','R.2',2],['C','R.2',3],
     ['C','C.1',4],['C','C.1',5],['A','C.1',8],['B','C.1',9],
     ['B','C.1',1],['A','R.3',2],['C','R.1',3],['A','R.2',4],
     ['C','R.1',5],['A','R.1',1],['C','R.2',5],['A','R.1',8]]

d = collections.defaultdict(int)
for l in a:
    d[(l[0],l[1])] += 1

result = [list(k)+[v] for k,v in sorted(d.items())]
print(result)

The output: 输出：

[['A', 'C.1', 1], ['A', 'R.1', 3], ['A', 'R.2', 1], ['A', 'R.3', 1], ['B', 'C.1', 2], ['B', 'R.2', 2], ['C', 'C.1', 2], ['C', 'R.1', 2], ['C', 'R.2', 2]]

Just for "pretty" print: 仅用于“漂亮”打印：

import pprint
...
pprint.pprint(result)

The output: 输出：

[['A', 'C.1', 1],
 ['A', 'R.1', 3],
 ['A', 'R.2', 1],
 ['A', 'R.3', 1],
 ['B', 'C.1', 2],
 ['B', 'R.2', 2],
 ['C', 'C.1', 2],
 ['C', 'R.1', 2],
 ['C', 'R.2', 2]]

Answer 2

Similar to @RomanPerekhrest, I used a Counter : 与@RomanPerekhrest相似，我使用了一个Counter ：

from collections import Counter

a = [['A','R.1',1],['B','R.2',1],['B','R.2',2],['C','R.2',3],
     ['C','C.1',4],['C','C.1',5],['A','C.1',8],['B','C.1',9],
     ['B','C.1',1],['A','R.3',2],['C','R.1',3],['A','R.2',4],
     ['C','R.1',5],['A','R.1',1],['C','R.2',5],['A','R.1',8]]

def transform(table):
    c = Counter(map(lambda c: tuple(c[:-1]), table))
    return sorted(map(lambda p: list(p[0]) + [p[1]], c.items()))

print(transform(a))

Python，集群两列计数

问题描述

2 个解决方案

解决方案1
1 2017-10-06 21:20:19

解决方案2
0 2017-10-06 21:33:21

Python，集群两列计数

问题描述

2 个解决方案

解决方案1 1 2017-10-06 21:20:19

解决方案2 0 2017-10-06 21:33:21

解决方案1
1 2017-10-06 21:20:19

解决方案2
0 2017-10-06 21:33:21