如何在字典中查找字符串？

Question

I've got a file that contains a list of 3-letter strings (codons for those who know biology) in a column. 我有一个文件，该文件在一列中包含3个字母的字符串（对于那些了解生物学的人来说是密码子）的列表。 In my program, I have constructed a dictionary, where each particular string corresponds to a designated letter (amino acids for those who know biology). 在我的程序中，我构建了一个字典，其中每个特定的字符串都对应一个指定的字母（对于那些了解生物学的人来说是氨基酸）。 So, I want my program to go through this entire list of strings/codons and for each codon, I want the program to look it up on the dictionary and output which letter that given codon/string corresponds to. 因此，我希望我的程序遍历整个字符串/密码子列表，对于每个密码子，我希望程序在词典中查找并输出给定密码子/字符串对应的字母。 Unfortunately, I don't have much experience using dictionaries, so I'm not sure how to look it up. 不幸的是，我没有太多使用字典的经验，所以我不确定如何查找它。 I've tried something but I keep getting errors. 我已经尝试过一些东西，但是我总是出错。 The variable 'new_codon' contains the list of strings/amino acids from the file that I'm using. 变量“ new_codon”包含我正在使用的文件中的字符串/氨基酸列表。 Here's what I've got so far: 到目前为止，这是我得到的：

codon_lookup = {'GCT': 'A', 'GCC': 'A','GCA': 'A','GCG': 'A', 'TGT':'C','TGC':'C',    'GAT':'D','GAC': 'D', 'GAA':'E','GAG': 'E', 'TTT':'F','TTC': 'F', 'GGT': 'G','GGC': 'G','GGA':'G','GGG': 'G', 'CAT':'H','CAC': 'H', 'ATT':'I','ATC':'I','ATA':'I','AAA':'K','AAG':'K', 'TTA': 'L','TTG': 'L','CTT': 'L','CTC': 'L','CTA': 'L','CTG': 'L', 'ATG': 'M', 'AAT':'N','AAC':'N', 'CCT': 'P','CCC': 'P','CCA': 'P','CCG': 'P', 'CAA': 'Q','CAG': 'Q', 'CGT': 'R','CGC': 'R','CGA': 'R','CGG': 'R','AGA': 'R','AGG': 'R', 'TCT': 'S','TCC': 'S','TCA': 'S','TCG': 'S','AGT': 'S','AGC': 'S', 'ACT': 'T','ACC': 'T','ACA': 'T','ACG': 'T', 'GTT': 'V','GTC': 'V','GTA': 'V','GTG': 'V', 'TGG':'W', 'TAT':'Y', 'TAC':'Y', 'TAA': 'Z', 'TAG': 'Z', 'TGA':'Z'}

for x in new_codon:
   codon_lookup[x]
   if codon_lookup[x] == ref_aa[x]: # Here I'm comparing it to another list I have from another file to see if they match or don't match 
             print "1"
   else: 
           print "0"

Answer 1

This will solve your KeyError 这将解决您的KeyError

for x in new_codon:
    x = x.rstrip() # remove line seperators
    ...

For your question in the comments. 对于您在评论中的问题。

for x, aa in zip(new_codon, ref_aa):
    x = x.rstrip() # remove line seperators

    if codon_lookup[x] == aa: 
        print "1"
    else: 
        print "0"

Answer 2

To check if value in a dictionary use 'is': 要检查字典中的值是否使用'is'：

for x in new_codon:
   if x in codon_lookup:
             print "1"
   else: 
           print "0"

Answer 3

You get a KeyError if the element you asked is NOT in the dictionary. 如果您要求的元素KeyError词典中，则会得到一个KeyError 。 And you are asking for "ATC\\r\\n" instead of "ATC" . 并且您要使用"ATC\\r\\n"而不是"ATC" 。 The problem is not in this part of the code. 问题不在代码的这一部分。 You are just reading the new_codon with endline characters. 您正在阅读带有结尾字符的new_codon 。
All you have to do is add a simple statement to remove endline characters from the end of your string x . 您所要做的就是添加一个简单的语句，以从字符串x的末尾删除末尾字符。

codon_lookup = {'GCT': 'A', 'GCC': 'A',...}

for x in new_codon:
   #This statement(`codon_lookup[x]`) was pointless
   x = x[:3] # Removes the part after the third character
   if codon_lookup[x] == ref_aa[x]: # Here I'm comparing it to another list I have from another file to see if they match or don't match 
           print "1"
   else: 
           print "0"

If ref_aa is a list, you will get TypeError of course. 如果ref_aa是一个列表，那么您当然会得到TypeError 。 x is a string, ref_aa is a list; x是字符串， ref_aa是列表； you cannot use ref_aa[x] . 您不能使用ref_aa[x] 。 To fix this issue, you can use enumerate ( docs for enumerate ): 要解决此问题，您可以使用enumerate （ enumerate docs ）：

codon_lookup = {'GCT': 'A', 'GCC': 'A',...}

for i,x in enumerate(new_codon):
   x = x[:3] # Removes the part after the third character
   if codon_lookup[x] == ref_aa[i]: # Changed the 'x' with 'i' for list 
           print "1"
   else: 
           print "0"

如何在字典中查找字符串？

问题描述

3 个解决方案

解决方案1
2 2013-11-21 00:28:26

解决方案2
1 2013-11-21 00:18:36

解决方案3
1 已采纳 2013-11-21 00:29:47

如何在字典中查找字符串？

问题描述

3 个解决方案

解决方案1 2 2013-11-21 00:28:26

解决方案2 1 2013-11-21 00:18:36

解决方案3 1 已采纳 2013-11-21 00:29:47

解决方案1
2 2013-11-21 00:28:26

解决方案2
1 2013-11-21 00:18:36

解决方案3
1 已采纳 2013-11-21 00:29:47