[英]Max values for partial matching keys in python dictionary
我有以下字典,其中的鍵是“ month,country:ID”,值僅是總計:
ID_dict = {'11,United Kingdom:14416': 129.22, '11,United Kingdom:17001': 357.6,
'12,United States:14035': 90000.0, '12,United Kingdom:17850': 241.16,'12,United
States:14099': 90000.0, '12,France:12583': 252.0, '12,United Kingdom:13047':
215.13, '01,Germany:12662': 78.0, '01,Germany:12600': 14000}
實際的字典將比這本大得多。
我試圖返回包含最高總數的每個“月,國家”的密鑰。 如果有平局,則ID用逗號分隔。 基於以上字典的示例輸出:
'11,United Kingdom:17001'
'12,United Kingdom:17850'
'12,United States:14035, 14099'
'12,France:12583'
'01,Germany:12600'
我可以使用以下代碼獲取最高值的字符串:
highest = max(ID_dict.values())
print([k for k, v in ID_dict.items() if v == highest])
但是,真正要努力克服這一點。 我當時正在嘗試使用re.match和re.search,但並沒有走得太遠。
您可以找到month, country
對的最大值,並將此關系存儲在字典中。 然后創建一個字典,該字典具有對(month, country)
和作為值的IDs
列表,這些IDs
值等於該對(month, country)
對的最大值:
import re
ID_dict = {'11,United Kingdom:14416': 129.22, '11,United Kingdom:17001': 357.6,
'12,United States:14035': 90000.0, '12,United Kingdom:17850': 241.16, '12,United States:14099': 90000.0,
'12,France:12583': 252.0, '12,United Kingdom:13047': 215.13, '01,Germany:12662': 78.0,
'01,Germany:12600': 14000}
table = {tuple(re.split(',|:', key)[:2]): value for key, value in sorted(ID_dict.items(), key=lambda e: e[1])}
result = {}
for key, value in ID_dict.items():
splits = re.split(',|:', key)
if value == table[tuple(splits[:2])]:
result.setdefault(tuple(splits[:2]), []).append(splits[2])
for key, value in result.items():
print('{}:{}'.format(','.join(key), ', '.join(value)))
輸出量
01,Germany:12600
12,United States:14099, 14035
12,United Kingdom:17850
11,United Kingdom:17001
12,France:12583
上面的方法是O(nlogn),因為它使用sorted
,使其變為O(n),您可以通過以下循環更改字典理解:
table = {}
for s, v in ID_dict.items():
key = tuple(re.split(',|:', s)[:2])
table[key] = max(table.get(key, v), v)
以下代碼使用“ month,country”鍵和(值,IDnum)列表作為值創建一個新字典。 然后,它對每個列表進行排序,並收集與最大值對應的所有IDnum。
ID_dict = {
'11,United Kingdom:14416': 129.22, '11,United Kingdom:17001': 357.6,
'12,United States:14035': 90000.0, '12,United Kingdom:17850': 241.16,
'12,United States:14099': 90000.0, '12,France:12583': 252.0,
'12,United Kingdom:13047': 215.13, '01,Germany:12662': 78.0,
'01,Germany:12600': 14000
}
# Create a new dict with 'month,country' keys
# and lists of (value, IDnum) as the values
new_data = {}
for key, val in ID_dict.items():
newkey, idnum = key.split(':')
new_data.setdefault(newkey, []).append((val, idnum))
# Sort the values for each 'month,country' key,
# and get the IDnums corresponding to the highest values
for key, val in new_data.items():
val = sorted(val, reverse=True)
highest = val[0][0]
# Collect all IDnums that have the highest value
ids = []
for v, idnum in val:
if v != highest:
break
ids.append(idnum)
print(key + ':' + ', '.join(ids))
輸出
11,United Kingdom:17001
12,United States:14099, 14035
12,United Kingdom:17850
12,France:12583
01,Germany:12600
聲明:本站的技術帖子網頁,遵循CC BY-SA 4.0協議,如果您需要轉載,請注明本站網址或者原文地址。任何問題請咨詢:yoyou2525@163.com.