[英]Python - how to handle a dictionary with spaces
我有一个列表,其中包含一个字典,如下所示:
[{'DeltaG': -14.36, 'BasePairs': 8, 'Dimer': "5' TCAGATGTGTATAAGAGACAGGTGTAATCGTTCCGCTTGAATGTGANGCAAGAA\n : |||||||| : \n3' TAGTCACCTGCGTTCCTGACACTAGCGAGACAGAGAATATGTGTAGAGGCGAGCTAAGGTACTTGAAAGGGTGTATTAGAG"}
['Dimer']
看起来像这样:
5' TCAGATGTGTATAAGAGACAGGTGTAATCGTTCCGCTTGAATGTGANGCAAGAA
: |||||||| :
3' TAGTCACCTGCGTTCCTGACACTAGCGAGACAGAGAATATGTGTAGAGGCGAGCTAAGGTACTTGAAAGGGTGTATTAGAG
如果|
我想过滤接受或忽略字典 Dimer
中的字符不在N
下(上面的字符串末尾只有一个 N,并且 position 始终一致)。
我尝试了这个解决方案,如果在上面的序列之前没有空格,它就可以工作:
for i in results:
if i['Dimer'][107] != '|':
print(i)
我的问题是,有时在上部字符串之前有空格(如下所示)然后 position 107
(i['Dimer'][107].= '|') 不正确? 谁能帮我这个?
#这只是一个显示结构的虚拟示例:
5' TCAGATGTGTATAAGAGACAGGTGTAATCGTTCCGCTTGAATGTGANGCAAGAA
: |||||||| :
3' TAGTCACCTGCGTTCCTGACACTAGCGAGACAGAGAATATGTGTAGAGGCGAGCTAAGGTACTTGAAAGGGTGTATTAGAG
谢谢你。
您可以使用itertools.zip_longest
:
d = [
{
"DeltaG": -14.36,
"BasePairs": 8,
"Dimer": "5' TCAGATGTGTATAAGAGACAGGTGTAATCGTTCCGCTTGAATGTGANGCAAGAA\n : |||||||| : \n3' TAGTCACCTGCGTTCCTGACACTAGCGAGACAGAGAATATGTGTAGAGGCGAGCTAAGGTACTTGAAAGGGTGTATTAGAG",
}
]
from itertools import zip_longest
# control print:
print(d[0]["Dimer"])
print()
print("-" * 80)
print()
for l1, l2, l3 in zip_longest(*d[0]["Dimer"].split("\n")):
if l1 == "N" and l2 == "|" and l3 in "TCGA":
print('Character | under the "N": ', l1, l3)
break
else:
print('No character | under the "N"')
印刷:
5' TCAGATGTGTATAAGAGACAGGTGTAATCGTTCCGCTTGAATGTGANGCAAGAA
: |||||||| :
3' TAGTCACCTGCGTTCCTGACACTAGCGAGACAGAGAATATGTGTAGAGGCGAGCTAAGGTACTTGAAAGGGTGTATTAGAG
--------------------------------------------------------------------------------
Character | under the "N": N G
或者:如果只有一个"N"
:
l1, l2, l3 = d[0]["Dimer"].split("\n")
i = l1.index("N") if "N" in l1 else None
ch2 = l2[i] if i < len(l2) else None
ch3 = l3[i] if i < len(l3) else None
if not i is None and ch2 == "|" and ch3 in "TCGA":
print('Character | under the "N": ', ch3)
印刷:
Character | under the "N": G
编辑:要检查多个项目:
lst = [
{
"DeltaG": -14.36,
"BasePairs": 8,
"Dimer": "5' TCAGATGTGTATAAGAGACAGGTGTAATCGTTCCGCTTGAATGTGANGCAAGAA\n : |||||||| : \n3' TAGTCACCTGCGTTCCTGACACTAGCGAGACAGAGAATATGTGTAGAGGCGAGCTAAGGTACTTGAAAGGGTGTATTAGAG",
},
{
"DeltaG": -12.99,
"BasePairs": 6,
"Dimer": "5' TCAGATGTGTATAAGAGACAGGTGTAATCGTTCCGCTTGAATGTGANGCAAGAA\n :: : |||||| :: : : \n3' TAGTCACCTGCGTTCCTGACACTAGCGAGACAGAGAATATGTGTAGAGGCGAGCTAAGGTACTTGAAAG",
},
]
def is_pipe_under_N(item):
l1, l2, l3 = item["Dimer"].split("\n")
i = l1.index("N") if "N" in l1 else None
ch2 = l2[i] if i < len(l2) else None
ch3 = l3[i] if i < len(l3) else None
return not i is None and ch2 == "|" and ch3 in "TCGA"
for item in lst:
if not is_pipe_under_N(item):
print(item["DeltaG"])
break
印刷:
-12.99
声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.