Fastest way to remove duplicates in list of lists in Python?

Question

I have a list of lists in Python3, where the data looks like this:

['Type1', ['123', '22'], ['456', '80']]
['Type2', ['456', '80'], ['123', '22']]

The list is quite large, but the above is an example of duplicate data I need to remove. Below is an example of data that is NOT duplicated and does not need to be removed:

['Type1', ['789', '45'], ['456', '80']]
['Type2', ['456', '80'], ['123', '22']]

I've already removed all the exact identical duplicates. What is the fastest way to accomplish this "reversed duplicate" removal in Python3?

Answer 1

Two possibilities:

Convert each sublist to a tuple and insert into a set. Do the same for the compare candidate and compare sets to determine equality.
Establish a sorting method for the sublists, then sort each list of sublists. This will enable easy comparison.

Both these approaches basically work around your problem of sublist ordering; there are lots of other ways.

Answer 2

data = [['Type1', ['123', '22'], ['456', '80']],
    ['Type2', ['456', '80'], ['123', '22']]]
myList = []
for i in data:
    myTuple = (i[1], i[2])
    myList.append(myTuple)

print(myList)
for x in myList:
    for y in myList:
        if x==y:
            myList.remove(x)
            break

print(myList)

Fastest way to remove duplicates in list of lists in Python?

Question

2 answers

solution1
1 ACCPTED 2019-03-11 02:19:58

solution2
1 2019-03-11 02:30:33

Fastest way to remove duplicates in list of lists in Python?

Question

2 answers

solution1 1 ACCPTED 2019-03-11 02:19:58

solution2 1 2019-03-11 02:30:33

solution1
1 ACCPTED 2019-03-11 02:19:58

solution2
1 2019-03-11 02:30:33