[英]Python - how to handle a dictionary with spaces
我有一個列表,其中包含一個字典,如下所示:
[{'DeltaG': -14.36, 'BasePairs': 8, 'Dimer': "5' TCAGATGTGTATAAGAGACAGGTGTAATCGTTCCGCTTGAATGTGANGCAAGAA\n : |||||||| : \n3' TAGTCACCTGCGTTCCTGACACTAGCGAGACAGAGAATATGTGTAGAGGCGAGCTAAGGTACTTGAAAGGGTGTATTAGAG"}
['Dimer']
看起來像這樣:
5' TCAGATGTGTATAAGAGACAGGTGTAATCGTTCCGCTTGAATGTGANGCAAGAA
: |||||||| :
3' TAGTCACCTGCGTTCCTGACACTAGCGAGACAGAGAATATGTGTAGAGGCGAGCTAAGGTACTTGAAAGGGTGTATTAGAG
如果|
我想過濾接受或忽略字典 Dimer
中的字符不在N
下(上面的字符串末尾只有一個 N,並且 position 始終一致)。
我嘗試了這個解決方案,如果在上面的序列之前沒有空格,它就可以工作:
for i in results:
if i['Dimer'][107] != '|':
print(i)
我的問題是,有時在上部字符串之前有空格(如下所示)然后 position 107
(i['Dimer'][107].= '|') 不正確? 誰能幫我這個?
#這只是一個顯示結構的虛擬示例:
5' TCAGATGTGTATAAGAGACAGGTGTAATCGTTCCGCTTGAATGTGANGCAAGAA
: |||||||| :
3' TAGTCACCTGCGTTCCTGACACTAGCGAGACAGAGAATATGTGTAGAGGCGAGCTAAGGTACTTGAAAGGGTGTATTAGAG
謝謝你。
您可以使用itertools.zip_longest
:
d = [
{
"DeltaG": -14.36,
"BasePairs": 8,
"Dimer": "5' TCAGATGTGTATAAGAGACAGGTGTAATCGTTCCGCTTGAATGTGANGCAAGAA\n : |||||||| : \n3' TAGTCACCTGCGTTCCTGACACTAGCGAGACAGAGAATATGTGTAGAGGCGAGCTAAGGTACTTGAAAGGGTGTATTAGAG",
}
]
from itertools import zip_longest
# control print:
print(d[0]["Dimer"])
print()
print("-" * 80)
print()
for l1, l2, l3 in zip_longest(*d[0]["Dimer"].split("\n")):
if l1 == "N" and l2 == "|" and l3 in "TCGA":
print('Character | under the "N": ', l1, l3)
break
else:
print('No character | under the "N"')
印刷:
5' TCAGATGTGTATAAGAGACAGGTGTAATCGTTCCGCTTGAATGTGANGCAAGAA
: |||||||| :
3' TAGTCACCTGCGTTCCTGACACTAGCGAGACAGAGAATATGTGTAGAGGCGAGCTAAGGTACTTGAAAGGGTGTATTAGAG
--------------------------------------------------------------------------------
Character | under the "N": N G
或者:如果只有一個"N"
:
l1, l2, l3 = d[0]["Dimer"].split("\n")
i = l1.index("N") if "N" in l1 else None
ch2 = l2[i] if i < len(l2) else None
ch3 = l3[i] if i < len(l3) else None
if not i is None and ch2 == "|" and ch3 in "TCGA":
print('Character | under the "N": ', ch3)
印刷:
Character | under the "N": G
編輯:要檢查多個項目:
lst = [
{
"DeltaG": -14.36,
"BasePairs": 8,
"Dimer": "5' TCAGATGTGTATAAGAGACAGGTGTAATCGTTCCGCTTGAATGTGANGCAAGAA\n : |||||||| : \n3' TAGTCACCTGCGTTCCTGACACTAGCGAGACAGAGAATATGTGTAGAGGCGAGCTAAGGTACTTGAAAGGGTGTATTAGAG",
},
{
"DeltaG": -12.99,
"BasePairs": 6,
"Dimer": "5' TCAGATGTGTATAAGAGACAGGTGTAATCGTTCCGCTTGAATGTGANGCAAGAA\n :: : |||||| :: : : \n3' TAGTCACCTGCGTTCCTGACACTAGCGAGACAGAGAATATGTGTAGAGGCGAGCTAAGGTACTTGAAAG",
},
]
def is_pipe_under_N(item):
l1, l2, l3 = item["Dimer"].split("\n")
i = l1.index("N") if "N" in l1 else None
ch2 = l2[i] if i < len(l2) else None
ch3 = l3[i] if i < len(l3) else None
return not i is None and ch2 == "|" and ch3 in "TCGA"
for item in lst:
if not is_pipe_under_N(item):
print(item["DeltaG"])
break
印刷:
-12.99
聲明:本站的技術帖子網頁,遵循CC BY-SA 4.0協議,如果您需要轉載,請注明本站網址或者原文地址。任何問題請咨詢:yoyou2525@163.com.