[英]How do I sort a dictionary by key length, then by an element in the value list?
我想首先按键的长度对字典进行排序,然后按值列表中的一个元素对字典进行排序。
到目前为止,我已经通过使用 lambda 排序接近了,但我遇到了问题,因为我的值是列表。 文档不清楚如何处理这个问题(或者至少据我所知)。
# What I have so far is:
d = {'ATCG': [1,2,3], 'ATTT':[1,3,5], 'GCT':[2,4,8], 'TCC': [4,5,6], 'AC':
[7,8,9]}
value_sort = sorted(d.items(), key=lambda item: item[0])
# this only sorts by the first item in my list-- how do I sort by
the third item in my list?
key_sort = sorted(value_sort, key=len, reverse=True)
# sorts by largest length key
for k in key_sort:
v = d[k[0]]
print(k[0) + ":" + str(v))
我希望 output 看起来像:
ATCG:[1,2,3]
ATTT:[1,3,5]
TCC:[4,5,6]
GCT:[2,4,8]
AC:[7,8,9]
排序是按最大键,然后按我的值列表中最小的第三个值。
编辑:我使键更清晰
这是一个按值的长度然后按值对字典进行排序的示例
示例 1:
d = {'ATCG':[1,2,3], 'ATTT':[1,3,5], 'TCC':[4,5,6], 'GCT':[2,4,8], 'AC':[7,8,9]}
# Use tuple to specify multiple criteria to sort upon
# i.e. first by length, then by key length,
# then value (i.e. t[0] is key, t[1] is value)
value_sort = sorted(d.items(), key=lambda t: (len(t[0]), t[1]))
for k, v in value_sort:
print(k, v)
Output
AC [7, 8, 9]
GCT [2, 4, 8]
TCC [4, 5, 6]
ATCG [1, 2, 3]
ATTT [1, 3, 5]
示例 2:
d = {"one": [(1,3),(1,4)],
"two": [(1,2),(1,2),(1,3)],
"three": [(1,1)]}
# Use tuple to specify multiple criteria to sort upon
# i.e. first by length, then by value
# t[1] is value of dictionary entry
value_sort = sorted(d.items(), key=lambda t: (len(t[1]), t[1]))
for k, v in value_sort:
print(k, v)
Output
three [(1, 1)]
one [(1, 3), (1, 4)]
two [(1, 2), (1, 2), (1, 3)]
如果您将其分解为自己的 function 而不是使用 lambda,则准备您的密钥 function 会更容易。
此外,您可能希望在排序后使用OrderedDict
来维护顺序:
from collections import OrderedDict
def get_keylength(v):
key, values = v
return len(key), values[0]
d = {'ATCG': [1,2,3], 'ATTT':[1,3,5], 'GCT':[2,4,8], 'TCC': [4,5,6], 'AC': [7,8,9]}
od = OrderedDict(sorted(d.items(), key=get_keylength, reverse=True))
OrderedDict([('ATCG', [1, 2, 3]), ('ATTT', [1, 3, 5]), ('TCC', [4, 5, 6]), ('GCT', [2, 4, 8]), ('AC', [7, 8, 9])])
如果你想要更清洁的 output 你可以使用pprint
(漂亮的打印):
import pprint
pprint.pprint(od)
OrderedDict([('ATCG', [1, 2, 3]),
('ATTT', [1, 3, 5]),
('TCC', [4, 5, 6]),
('GCT', [2, 4, 8]),
('AC', [7, 8, 9])])
# To get rid of OrderedDict you can convert back to a standard dictionary with `dict()`
pprint.pprint(dict(od))
{'AC': [7, 8, 9],
'ATCG': [1, 2, 3],
'ATTT': [1, 3, 5],
'GCT': [2, 4, 8],
'TCC': [4, 5, 6]}
声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.