How to iterate through a string and find the duplicated values

Question

My question is that I can't seem to find the right method for iterating through subjects2 and pick out the duplicated strings. Below is my method:

nosubjects = []
subjects2 = ["hi","hi","bi","ki","si","bi","li"]
for i in subjects2:
  if subjects2.count(i)==2:
    nosubjects.extend(i)
    print(nosubjects)

But when I print it out it appears like this:

['hi','hi']
['h', 'i', 'h', 'i','b', 'i']
['hi', 'i', 'h', 'i', 'b', 'i', 'b', 'i']

Please help thanks!

Answer 1

Use collections.Counter to get count of each element and take only those whose count exceeds 1:

from collections import Counter

subjects2 = ['hi', 'hi', 'bi', 'ki', 'si', 'bi', 'li']
nosubjects = [x for x, i in Counter(subjects2).items() if i > 1]

print(nosubjects)
# ['hi', 'bi']

Answer 2

Problems in your code:

You are trying to check the count of each element in the list, due to which duplicated elements will be checked multiple times.
You are printing nosubjects inside the if condition which will cause it to be printed multiple times

Use sets . First to get unique set of elements in the list, then you can check if the count of each element in the set exceeds 1 in the original list.

nosubjects = []
subjects2 = ['hi','hi','bi','ki','si','bi','li']

for i in set(subjects2):
  if subjects2.count(i)>=2:
    nosubjects.append(i)

print(nosubjects)

Using list comprehension:

subjects2 = ['hi','hi','bi','ki','si','bi','li']

nosubjects = [i for i in set(subjects2) if subjects2.count(i) >=2]    
print(nosubjects)

How to iterate through a string and find the duplicated values

Question

2 answers

solution1
1 2019-01-19 06:04:18

solution2
0 ACCPTED 2019-01-19 05:57:16

How to iterate through a string and find the duplicated values

Question

2 answers

solution1 1 2019-01-19 06:04:18

solution2 0 ACCPTED 2019-01-19 05:57:16

solution1
1 2019-01-19 06:04:18

solution2
0 ACCPTED 2019-01-19 05:57:16