在不导入库和使用集合的情况下删除列表中重复项的最快方法

Question

我试图使用以下代码从列表中删除重复项：

a = [1,2,3,4,2,6,1,1,5,2]
res = []
[res.append(i) for i in a if i not in res]

但我想这样做而不将我想要的列表定义为空列表（即省略行res = [] ），例如：

a = [1,2,3,4,2,6,1,1,5,2]
#Either:
res = [i for i in a if i not in res]
#Or:
[i for i in a if i not in 'this list'] # this list is not a string. I meant it as the list being comprehensed

我想避免库导入和set()

Answer 1

我想可能对你有用。 它在保持顺序的同时从列表中删除重复项。

newlist=[i for n,i in enumerate(L) if i not in L[:n]]

Answer 2

对于 Python3.6+，您可以使用dict.fromkeys() ：

>>> a = [1, 2, 3, 4, 2, 6, 1, 1, 5, 2]
>>> list(dict.fromkeys(a))
[1, 2, 3, 4, 6, 5]

从文档：

创建一个新字典，其中的键来自可迭代对象，值设置为值。

如果您使用的是较低的 Python 版本，则需要使用collections.OrderedDict来维护订单：

>>> from collections import OrderedDict
>>> a = [1, 2, 3, 4, 2, 6, 1, 1, 5, 2]
>>> list(OrderedDict.fromkeys(a))
[1, 2, 3, 4, 6, 5]

Answer 3

这是建议的解决方案的简单基准，

它表明dict.fromkeys将表现最好

from simple_benchmark import BenchmarkBuilder
import random


b = BenchmarkBuilder()

@b.add_function()
def AmitDavidson(a):
    return [i for n,i in enumerate(a) if i not in a[:n]]

@b.add_function()
def RoadRunner(a):
    return list(dict.fromkeys(a))

@b.add_function()
def DaniMesejo(a):
    return  list({k: '' for k in a})


@b.add_function()
def rdas(a):
    return  sorted(list(set(a)), key=lambda x: a.index(x))


@b.add_function()
def unwanted_set(a):
    return  list(set(a))


@b.add_arguments('List lenght')
def argument_provider():
    for exp in range(2, 18):
        size = 2**exp
        yield size, [random.randint(0, 10) for _ in range(size)]

r = b.run()
r.plot()

Answer 4

这是一个使用set的解决方案，它确实保留了顺序：

a = [1,2,3,4,2,6,1,1,5,2]
a_uniq = sorted(list(set(a)), key=lambda x: a.index(x))
print(a_uniq)

Answer 5

单行，理解， O(n) ，保留 Python 3.6+ 中的顺序：

a = [1, 2, 3, 4, 2, 6, 1, 1, 5, 2]

res = list({k: '' for k in a})
print(res)

在不导入库和使用集合的情况下删除列表中重复项的最快方法

问题描述

5 个解决方案

解决方案1
6 2020-04-18 13:45:50

解决方案2
5 已采纳 2020-04-18 14:04:30

解决方案3
4 2020-04-18 15:06:59

解决方案4
3 2020-04-18 13:48:31

解决方案5
2 2020-04-18 13:45:07

在不导入库和使用集合的情况下删除列表中重复项的最快方法

问题描述

5 个解决方案

解决方案1 6 2020-04-18 13:45:50

解决方案2 5 已采纳 2020-04-18 14:04:30

解决方案3 4 2020-04-18 15:06:59

解决方案4 3 2020-04-18 13:48:31

解决方案5 2 2020-04-18 13:45:07

解决方案1
6 2020-04-18 13:45:50

解决方案2
5 已采纳 2020-04-18 14:04:30

解决方案3
4 2020-04-18 15:06:59

解决方案4
3 2020-04-18 13:48:31

解决方案5
2 2020-04-18 13:45:07