如何合并两个字符串列表中的重复项？

Question

I am bit new with python (2.7) and I am having a hard time doing this. 我对python（2.7）有点陌生，我很难做到这一点。

I have the following lists: 我有以下列表：

animal = ['cat', 'cat', 'dog', 'dog', 'dog', 'horse']
names = ['cat_01', 'cat_02', 'dog_01', 'dog_02', 'dog_03', 'horse_01']

And I would like to have the following (it could be a list of tuples or a dict) 我想要以下内容（可能是元组列表或字典）

new = {"cat":('cat_01','cat_02'), "dog":('dog_01','dog_02', 'dog_03'), "horse":('horse_01')}

How best to do this? 如何做到最好？

Answer 1

Short solution using list comprehension: 使用列表理解的简短解决方案：

animal = ['cat', 'cat', 'dog', 'dog', 'dog', 'horse']
names = ['cat_01', 'cat_02', 'dog_01', 'dog_02', 'dog_03', 'horse_01']
result = {a:tuple([n for n in names if a in n]) for a in animal}

print result

The output: 输出：

{'cat': ('cat_01', 'cat_02'), 'horse': ('horse_01',), 'dog': ('dog_01', 'dog_02', 'dog_03')}

Answer 2

You can also use groupby from itertools 您也可以从itertools使用groupby

from itertools import groupby
my_dict = {}
for key, groups in groupby(zip(animal, names), lambda x: x[0]):
    my_dict[key] = tuple(g[1] for g in groups)

This might be a little faster when your list grows. 当您的列表增加时，这可能会更快一些。

Answer 3

Assuming your lists are sorted as they are in the example: 假设您的列表按示例中的顺序排序：

Code: 码：

my_dict = {}
for animal, name in zip(animals, names):
    my_dict.setdefault(animal, []).append(name)
print(my_dict)

Gives: 给出：

{'horse': ['horse_01'], 'dog': ['dog_01', 'dog_02', 'dog_03'], 'cat': ['cat_01', 'cat_02']}

And if you need tuples not lists: 如果您需要元组，则不列出：

my_dict = {k: tuple(v) for k, v in my_dict.items()}

如何合并两个字符串列表中的重复项？

问题描述

3 个解决方案

解决方案1
1 2017-02-19 17:53:37

解决方案2
1 2017-02-19 19:16:30

解决方案3
0 已采纳 2017-02-19 17:57:29

如何合并两个字符串列表中的重复项？

问题描述

3 个解决方案

解决方案1 1 2017-02-19 17:53:37

解决方案2 1 2017-02-19 19:16:30

解决方案3 0 已采纳 2017-02-19 17:57:29

解决方案1
1 2017-02-19 17:53:37

解决方案2
1 2017-02-19 19:16:30

解决方案3
0 已采纳 2017-02-19 17:57:29