簡體 English 中英

如何在Python中找到相同的序列

[英]How to find identical sequences in Python

原文 2014-10-16 02:37:18 3 2 python/ sequences

我是Python的新手，我想知道如何從Python中的Fasta文件中查找相同的序列。 例如，在這里我有4個記錄序列讀取，如何找到相同的序列並返回其ID？ 非常感謝你！！

from Bio import SeqIO
record=list(SeqIO.parse("data/dna.txt", "fasta"))
for i in range(0,len(record)):
    print record[i].id,record[i].seq


seq1 GAATGCATACTGCATCGATA
seq2 CATAAAACGTCTCCATCGCT
seq3 TGCCCAAGTTGTGAAGTGTC
seq4 TGCCCAAGTTGTGAAGTGTC

2 個解決方案

您可以使用defaultdict編譯每個序列的ID列表，如下所示：

from Bio import SeqIO
from collections import defaultdict
records=list(SeqIO.parse("data/dna.txt", "fasta"))
compilation = defaultdict(list)
for record in records:
    compilation[record.seq].append(record.id)

最簡單的方法是使用dict 。

from Bio import SeqIO
records = list(SeqIO.parse("data/dna.txt", "fasta"))
d = dict()
for record in records:
    if record.seq in d:
        d[record.seq].append(record)
    else:
        d[record.seq] = [record]
for seq, record_set in d.iteritems():
    print seq + ': (' + str(len(record_set)) + ')'
    for record in record_set:
        print '    ' + record.id

打印像：

GAATGCATACTGCATCGATA: (1)
    seq1
CATAAAACGTCTCCATCGCT: (1)
    seq2
TGCCCAAGTTGTGAAGTGTC: (2)
    seq3
    seq4

如何在python中查找單詞序列？

[英]how to find sequences of words in python?

如何在python中連續找到三個相同的值

[英]how to find three identical values in a row in python

如何在DataFrame中找到相同的行— python

[英]how find the identical rows in a DataFrame — python

在 numpy 數組中查找相同值序列的長度（運行長度編碼）

[英]find length of sequences of identical values in a numpy array (run length encoding)

Python在字符串中找到相似的序列

[英]Python find similar sequences in string

在Python的子列表中查找單詞序列

[英]Find sequences of words in sublists in Python

Python：如何在FASTA文件中查找短序列的坐標？

[英]Python: How to find coordinates of short sequences in a FASTA file?

如何在python中實現序列？

[英]how to implement sequences in python?

如何計算Python序列中重復數字的序列？

[英]How to count sequences of repeated numbers in Python sequences?

Python：如何查找列表中特定數量的項目是否相同？

[英]Python: How to find whether a specific number of items in a list are identical?

暫無

暫無

聲明:本站的技術帖子網頁，遵循CC BY-SA 4.0協議，如果您需要轉載，請注明本站網址或者原文地址。任何問題請咨詢:yoyou2525@163.com.

相關問題 如何在python中查找單詞序列？如何在python中連續找到三個相同的值如何在DataFrame中找到相同的行— python 在 numpy 數組中查找相同值序列的長度（運行長度編碼） Python在字符串中找到相似的序列在Python的子列表中查找單詞序列 Python：如何在FASTA文件中查找短序列的坐標？如何在python中實現序列？如何計算Python序列中重復數字的序列？ Python：如何查找列表中特定數量的項目是否相同？

相關標簽

粵ICP備18138465號 © 2020-2024 STACKOOM.COM