使用来自同一行但不同列的值填充字典

Question

Lately I've been trying to map some values, so I'm trying to create a dictionary to do so. 最近我一直试图映射一些值，所以我试图创建一个字典来做到这一点。 The odd thing is my DataFrame has a column made of lists, and DataFrames are always a bit awkward with lists. 奇怪的是我的DataFrame有一个由列组成的列，而DataFrame对列表总是有点尴尬。 The DataFrame has the following structure: DataFrame具有以下结构：

    rules          procedure
['10','11','12']       1
['13','14']            2
['20','21','22','24']  3

So I want to create a dictionary that maps '10' to 1, '14' to 2, and so on. 所以我想创建一个字典，将'10'映射到1，'14'映射到2，依此类推。 I tried the following: 我尝试了以下方法：

dicc=dict()
for j in df['rules']:
    for i,k in zip(j,df.procedure):
        dicc[i]=k

But that isn't making it. 但那不是成功的。 Probably something to do with indexes. 可能与索引有关。 What am I missing? 我错过了什么？

Edit: I'm trying to create a dictionary that maps the values '10', '11', '12' to 1; 编辑：我正在尝试创建一个将值'10'，'11'，'12'映射到1的字典; '13','14' to 2; '13'，'14'到2; '20','21','22','24' to 3, so if I type dicc['10'] I get 1 , if I type dicc['22'] I get 3 . “20”，“21”，“22”，“24”到3，所以如果我输入dicc['10']我得到1 ，如果我输入dicc['22']我得到3 。 Obviously, the actual DataFrame is quite bigger and I can't do it manually. 显然，实际的DataFrame非常大，我无法手动完成。

Answer 1

You can do it like this: 你可以这样做：

import pandas as pd

data = [[['10', '11', '12'], 1],
        [['13', '14'], 2],
        [['20', '21', '22', '24'], 3]]

df = pd.DataFrame(data=data, columns=['rules', 'procedure'])

d = {r : p for rs, p in df[['rules', 'procedure']].values for r in rs}
print(d)

Output 产量

{'20': 3, '10': 1, '11': 1, '24': 3, '14': 2, '22': 3, '13': 2, '12': 1, '21': 3}

Notes: 笔记：

The code {r : p for rs, p in df[['rules', 'procedure']].values for r in rs} is a dictionary comprehension, the dictionary counterpart of list. 代码{r : p for rs, p in df[['rules', 'procedure']].values for r in rs}是字典理解，是列表的字典对应物。
The df[['rules', 'procedure']].values is equivalent to zip(df.rules, df.procedure) it outputs a pair of list, int. df[['rules', 'procedure']].values相当于zip(df.rules, df.procedure)它输出一对list，int。 So the rs variable is a list and p is an integer. 所以rs变量是一个列表， p是一个整数。
Finally you iterate over the values of rs using the second for loop 最后，使用第二个for循环迭代rs的值

UPDATE UPDATE

As suggested for @piRSquared you can use zip: 正如@piRSquared的建议你可以使用zip：

d = {r : p for rs, p in zip(df.rules, df.procedure) for r in rs}

Answer 2

Help from `cytoolz` 来自`cytoolz`帮助

from cytoolz.dicttoolz import merge

merge(*map(dict.fromkeys, df.rules, df.procedure))

{'10': 1,
 '11': 1,
 '12': 1,
 '13': 2,
 '14': 2,
 '20': 3,
 '21': 3,
 '22': 3,
 '24': 3}

Note 注意

I updated my post to mimic how @jpp passed multiple iterables to map . 我更新了我的帖子以模仿@jpp如何通过多个迭代来map 。 @jpp's answer is very good . @jpp的答案非常好。 Though I'd advocate for upvoting all useful answers, I wish I could upvote their answer again (-: 虽然我主张提出所有有用的答案，但我希望我能再次提出他们的答案（ - ：

Answer 3

Using collections.ChainMap : 使用collections.ChainMap ：

from collections import ChainMap

res = dict(ChainMap(*map(dict.fromkeys, df['rules'], df['procedure'])))

print(res)

{'10': 1, '11': 1, '12': 1, '13': 2, '14': 2,
 '20': 3, '21': 3, '22': 3, '24': 3}

For many uses, the final dict conversion is not necessary: 对于许多用途，最终的dict转换不是必需的：

A ChainMap class is provided for quickly linking a number of mappings so they can be treated as a single unit. 提供了一个ChainMap类，用于快速链接多个映射，以便将它们视为一个单元。 It is often much faster than creating a new dictionary and running multiple update() calls. 它通常比创建新字典和运行多个update()调用快得多。

See also What is the purpose of collections.ChainMap? 另请参见collections.ChainMap的目的是什么？

Answer 4

You may check flatten the list 您可以检查展平列表

dict(zip(sum(df.rules.tolist(),[]),df.procedure.repeat(df.rules.str.len())))
Out[60]: 
{'10': 1,
 '11': 1,
 '12': 1,
 '13': 2,
 '14': 2,
 '20': 3,
 '21': 3,
 '22': 3,
 '24': 3}

Answer 5

using itertools.chain and DataFrame.itertuples : 使用itertools.chain和DataFrame.itertuples ：

dict(
    chain.from_iterable(
        ((rule, row.procedure) for rule in row.rules) for row in df.itertuples()
    )
)

使用来自同一行但不同列的值填充字典

问题描述

5 个解决方案

解决方案1
8 已采纳 2018-10-09 15:06:48

解决方案2
5 2018-10-09 15:11:21

Help from `cytoolz` 来自`cytoolz`帮助

Note 注意

解决方案3
4 2018-10-09 15:16:45

解决方案4
2 2018-10-09 15:03:25

解决方案5
1 2018-10-09 15:23:09

使用来自同一行但不同列的值填充字典

问题描述

5 个解决方案

解决方案1 8 已采纳 2018-10-09 15:06:48

解决方案2 5 2018-10-09 15:11:21

Help from cytoolz 来自cytoolz帮助

Note 注意

解决方案3 4 2018-10-09 15:16:45

解决方案4 2 2018-10-09 15:03:25

解决方案5 1 2018-10-09 15:23:09

解决方案1
8 已采纳 2018-10-09 15:06:48

解决方案2
5 2018-10-09 15:11:21

Help from `cytoolz` 来自`cytoolz`帮助

解决方案3
4 2018-10-09 15:16:45

解决方案4
2 2018-10-09 15:03:25

解决方案5
1 2018-10-09 15:23:09