BioPython遍历Fasta文件中的序列

Question

I'm new to BioPython and I'm trying to import a fasta/fastq file and iterate through each sequence, while performing some operation on each sequence. 我是BioPython的新手，正在尝试导入fasta / fastq文件并遍历每个序列，同时对每个序列执行一些操作。 I know this seems basic, but my code below for some reason is not printing correctly. 我知道这似乎很基本，但是出于某些原因，我的以下代码无法正确打印。

from Bio import SeqIO

newfile = open("new.txt", "w")
records = list(SeqIO.parse("rosalind_gc.txt", "fasta"))

i = 0
dna = records[i]

while i <= len(records):
    print (dna.name)
    i = i + 1

I'm trying to basically iterate through records and print the name, however my code ends up only printing "records[0]", where I want it to print "records[1-10]". 我试图从根本上遍历记录并打印名称，但是我的代码最终只打印“ records [0]”，而我希望它在其中打印“ records [1-10]”。 Can someone explain why it ends up only print "records[0]"? 有人可以解释为什么它最终只打印“ records [0]”吗？

Answer 1

The reason for your problem is here: 问题的原因在这里：

i = 0
dna = records[i]

Your object 'dna' is fixed to the index 0 of records, ie, records[0]. 您的对象'dna'固定为记录的索引0，即records [0]。 Since you are not calling it again, dna will always be fixed on that declaration. 由于您不再调用它，因此dna将始终固定在该声明上。 On your print statement within your while loop, use something like this: 在while循环中的print语句上，使用如下代码：

while i <= len(records):
    print (records[i].name)
    i = i + 1

If you would like to have an object dna as a copy of records entries, you would need to reassign dna to every single index, making this within your while loop, like this: 如果要将对象dna作为记录条目的副本，则需要将dna重新分配给每个索引，使其在while循环内，如下所示：

while i <= len(records):
    dna = records[i]
    print (dna.name)
    i = i + 1

However, that's not the most efficient way. 但是，这不是最有效的方法。 Finally, for you to learn, a much nicer way than with your while loop with i = i + 1 is to use a for loop, like this: 最后，供您学习，比使用i = i + 1的while循环更好的方法是使用for循环，如下所示：

for i in range(0,len(records)):
    print (records[i].name)

For loops do the iteration automatically, one by one. 对于循环，自动进行一次循环迭代。 range() will give a set of integers from 0 to the length of records. range（）将给出一组从0到记录长度的整数。 There are also other ways, but I'm keeping it simple. 还有其他方法，但我保持简单。

BioPython遍历Fasta文件中的序列

问题描述

1 个解决方案

解决方案1
1 已采纳 2018-04-06 01:41:03

BioPython遍历Fasta文件中的序列

问题描述

1 个解决方案

解决方案1 1 已采纳 2018-04-06 01:41:03

解决方案1
1 已采纳 2018-04-06 01:41:03