繁体   English   中英

Python - 如何处理带空格的字典

[英]Python - how to handle a dictionary with spaces

我有一个列表,其中包含一个字典,如下所示:

[{'DeltaG': -14.36, 'BasePairs': 8, 'Dimer': "5' TCAGATGTGTATAAGAGACAGGTGTAATCGTTCCGCTTGAATGTGANGCAAGAA\n                                           :   |||||||| :                                     \n3'                                      TAGTCACCTGCGTTCCTGACACTAGCGAGACAGAGAATATGTGTAGAGGCGAGCTAAGGTACTTGAAAGGGTGTATTAGAG"}

['Dimer']看起来像这样:

5' TCAGATGTGTATAAGAGACAGGTGTAATCGTTCCGCTTGAATGTGANGCAAGAA
                                           :   |||||||| :                                     
3'                                      TAGTCACCTGCGTTCCTGACACTAGCGAGACAGAGAATATGTGTAGAGGCGAGCTAAGGTACTTGAAAGGGTGTATTAGAG

如果|我想过滤接受或忽略字典 Dimer中的字符不在N下(上面的字符串末尾只有一个 N,并且 position 始终一致)。

我尝试了这个解决方案,如果在上面的序列之前没有空格,它就可以工作:

for i in results:
    if i['Dimer'][107] != '|':
        print(i)

我的问题是,有时在上部字符串之前有空格(如下所示)然后 position 107 (i['Dimer'][107].= '|') 不正确? 谁能帮我这个?

#这只是一个显示结构的虚拟示例:

5'      TCAGATGTGTATAAGAGACAGGTGTAATCGTTCCGCTTGAATGTGANGCAAGAA
                                           :   |||||||| :                                     
3'                                      TAGTCACCTGCGTTCCTGACACTAGCGAGACAGAGAATATGTGTAGAGGCGAGCTAAGGTACTTGAAAGGGTGTATTAGAG

谢谢你。

您可以使用itertools.zip_longest

d = [
    {
        "DeltaG": -14.36,
        "BasePairs": 8,
        "Dimer": "5' TCAGATGTGTATAAGAGACAGGTGTAATCGTTCCGCTTGAATGTGANGCAAGAA\n                                           :   |||||||| :                                     \n3'                                      TAGTCACCTGCGTTCCTGACACTAGCGAGACAGAGAATATGTGTAGAGGCGAGCTAAGGTACTTGAAAGGGTGTATTAGAG",
    }
]

from itertools import zip_longest

# control print:
print(d[0]["Dimer"])
print()
print("-" * 80)
print()

for l1, l2, l3 in zip_longest(*d[0]["Dimer"].split("\n")):
    if l1 == "N" and l2 == "|" and l3 in "TCGA":
        print('Character | under the "N": ', l1, l3)
        break
else:
    print('No character | under the "N"')

印刷:

5' TCAGATGTGTATAAGAGACAGGTGTAATCGTTCCGCTTGAATGTGANGCAAGAA
                                           :   |||||||| :                                     
3'                                      TAGTCACCTGCGTTCCTGACACTAGCGAGACAGAGAATATGTGTAGAGGCGAGCTAAGGTACTTGAAAGGGTGTATTAGAG

--------------------------------------------------------------------------------

Character | under the "N":  N G

或者:如果只有一个"N"

l1, l2, l3 = d[0]["Dimer"].split("\n")

i = l1.index("N") if "N" in l1 else None
ch2 = l2[i] if i < len(l2) else None
ch3 = l3[i] if i < len(l3) else None

if not i is None and ch2 == "|" and ch3 in "TCGA":
    print('Character | under the "N": ', ch3)

印刷:

Character | under the "N":  G

编辑:要检查多个项目:

lst = [
    {
        "DeltaG": -14.36,
        "BasePairs": 8,
        "Dimer": "5' TCAGATGTGTATAAGAGACAGGTGTAATCGTTCCGCTTGAATGTGANGCAAGAA\n                                           :   |||||||| :                                     \n3'                                      TAGTCACCTGCGTTCCTGACACTAGCGAGACAGAGAATATGTGTAGAGGCGAGCTAAGGTACTTGAAAGGGTGTATTAGAG",
    },
    {
        "DeltaG": -12.99,
        "BasePairs": 6,
        "Dimer": "5'                TCAGATGTGTATAAGAGACAGGTGTAATCGTTCCGCTTGAATGTGANGCAAGAA\n                       ::                   :    |||||| :: :    :       \n3' TAGTCACCTGCGTTCCTGACACTAGCGAGACAGAGAATATGTGTAGAGGCGAGCTAAGGTACTTGAAAG",
    },
]


def is_pipe_under_N(item):
    l1, l2, l3 = item["Dimer"].split("\n")

    i = l1.index("N") if "N" in l1 else None
    ch2 = l2[i] if i < len(l2) else None
    ch3 = l3[i] if i < len(l3) else None

    return not i is None and ch2 == "|" and ch3 in "TCGA"


for item in lst:
    if not is_pipe_under_N(item):
        print(item["DeltaG"])
        break

印刷:

-12.99

暂无
暂无

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM