如何从同一函数中的两个循环获得结果？

Question

I have a function that calculates an "either or" type of list for my data. 我有一个函数可以为我的数据计算列表的“或”类型。

keyword = sys.argv[1]  # a name from the Name column

def exon_coords():
    exon_start_plus = [] # Plus strand coordinates
    exon_start_minus = [] # Minus strand coordinates
    for line in csv.reader(sys.stdin, csv.excel_tab):
        if len(line) >= 1:
            if re.search(keyword, str(line)): # If arg keyword exists in file
                if line[3] == "-": # If the DNA strand is a minus strand
                    chrompos = line[0] + ";" # Get the chromosome position
                    exon_start_minus.append(chrompos+line[1]) # Full exon position
                else: # all other lines are plus strands
                    chrompos = line[0] + ";" 
                    exon_start_plus.append(chrompos+line[1])

return exon_start_minus, exon_start_plus #Return lists

Goal is to then write an output text file with the coordinates. 目的是编写一个带有坐标的输出文本文件。

with open(keyword+"_plus.txt", "w") as thefile:
    for item in exon_start_plus:
        thefile.write("{}, ".format(item))

OR if the keyword resulted in MINUS strands: 或者，如果关键字产生了MINUS股：

with open(keyword+"_minus.txt", "w") as thefile:
    for item in exon_start_minus:
        thefile.write("{}, ".format(item))

I tried putting these write files within the code but then the return functions just would not give me the full list, and I end up only writing one coordinate every time. 我尝试将这些写文件放入代码中，但是返回函数将无法提供完整列表，而我每次只能写一个坐标。 I put them at the end, but this results in empty files and empty strings - I would like to keep this as one function and have it determine if the keyword (ie a gene ID) has coordinates given for plus/minus strand (I have a gigantic data file that contains this data and the point is to not manually scan the IDs and see if they are plus/minus DNA strands). 我将它们放在最后，但是这导致空文件和空字符串-我想将其保留为一个函数，并让它确定关键字（即基因ID）是否具有给定正负链的坐标（我有包含此数据的巨大数据文件，目的是不要手动扫描ID，看看它们是否是正负DNA链）。

Thank you! 谢谢！

EDIT (sample data, had to remove some columns so I edited the code as well): 编辑（示例数据，必须删除一些列，所以我也编辑了代码）：

Position    Start   End Strand  Overhang    Name
1   3798630 3798861 +   .   ENSPFOG0000001
1   3799259 3799404 +   .   ENSPFOG0000001
1   3809992 3810195 +   .   ENSPFOG0000001
1   3810582 3810729 +   .   ENSPFOG0000001
2   4084800 4084866 -   .   ENSPFOG0000002
2   4084466 4084566 -   .   ENSPFOG0000002
2   4084089 4084179 -   .   ENSPFOG0000002

So if I use ENSPFOG0000001 as my keyword, then the script should run and determine that the strands are plus, collect the start coordinates in a list and then output a file that just has the coordinates. 因此，如果我将ENSPFOG0000001用作关键字，则脚本应运行并确定线束为正，将起始坐标收集在列表中，然后输出仅包含坐标的文件。 The file would have keyword+"_plus.txt" appended. 该文件将附加关键字+“ _ plus.txt”。 If it was ENSPFOG0000002, then it would collect the minus strand coordinates, and create a file where keyword+"_minus.txt" is created. 如果是ENSPFOG0000002，则它将收集负链坐标，并创建一个文件，其中创建了关键字+“ _ minus.txt”。

Answer 1

An empty list evaluates to False: 一个空列表的结果为False：

>>> exon_start_minus, exon_start_plus = [], []
>>> bool(exon_start_minus), bool(exon_start_plus)
(False, False)
>>> exon_start_minus, exon_start_plus = [1], []
>>> bool(exon_start_minus), bool(exon_start_plus)
(True, False)
>>> exon_start_minus, exon_start_plus = [1], [1]
>>> bool(exon_start_minus), bool(exon_start_plus)
(True, True)
>>> exon_start_minus, exon_start_plus = [], [1]
>>> bool(exon_start_minus), bool(exon_start_plus)
(False, True)

So you can test for an empty list and take action as appropriate 因此，您可以测试一个空列表并采取适当的措施

>>> if exon_start_plus:
    print('!!!')

!!!
>>> if exon_start_minus:
    print('!!!')

>>>

To retrieve both lists from the function: 要从函数中检索两个列表：

exon_start_minus, exon_start_plus =  exon_coords()

如何从同一函数中的两个循环获得结果？

问题描述

1 个解决方案

解决方案1
0 已采纳 2017-07-24 19:56:41

如何从同一函数中的两个循环获得结果？

问题描述

1 个解决方案

解决方案1 0 已采纳 2017-07-24 19:56:41

解决方案1
0 已采纳 2017-07-24 19:56:41