如何合并两个数组列表（x，y，z，m），排除仅基于（x，y，z）的重复项

Question

我有两个表格形式

list1 = list(zip(SGXm, SGYm, SGZm, Lm))
list2 = list(zip(SGXmm, SGYmm, SGZmm, Lmm))

我想合并它们，同时排除重复的（x，y，z）条目，并忽略L中的差异。

list1.extend(x for x in list2 if x not in list1)

仅对我的x，y，z执行此工作，但是我想保留L（在可以选择的情况下，在第一个列表中）。

Answer 1

您必须提取需要进行比较的三元组。

seen = set(item[:3] for item in list1)
list1.extend(item for item in list2 if item[:3] not in seen)

Answer 2

如果您想对输出进行排序（特别是如果您已经对输入进行排序），则itertools.groupby和heapq.merge很好地结合使用。 如果输入尚未排序，则需要这样做。 可以一次连接并排序所有内容：

from operator import itemgetter

commonkey = itemgetter(0, 1, 2)
combined = sorted(list1 + list2, key=commonkey)

或者，如果它们已经被排序，或者您想独立地排序，请使用heapq.merge并避免对输入内容进行浅表复制：

# Explicit sort calls only necessary if inputs not already sorted
list1.sort(key=commonkey)
list2.sort(key=commonkey)

# Combine already sorted inputs with heapq.merge, avoiding intermediate lists
combined = heapq.merge(list1, list2, key=commonkey)

无论选择哪种方法，都可以通过对groupby的简单理解来跟进它，只需获取每个唯一组中的第一个条目，就仅保留每个唯一键的一个副本：

# Groups neighboring entries with the same key, and we keep only the first one
uniq = [next(g) for _, g in itertools.groupby(combined, key=commonkey)]

如何合并两个数组列表（x，y，z，m），排除仅基于（x，y，z）的重复项

问题描述

2 个解决方案

解决方案1
0 已采纳 2016-02-23 01:19:01

解决方案2
0 2016-02-23 02:40:36

如何合并两个数组列表（x，y，z，m），排除仅基于（x，y，z）的重复项

问题描述

2 个解决方案

解决方案1 0 已采纳 2016-02-23 01:19:01

解决方案2 0 2016-02-23 02:40:36

解决方案1
0 已采纳 2016-02-23 01:19:01

解决方案2
0 2016-02-23 02:40:36