繁体   English   中英

如何将排序列表分组为该列表连续元素的起点和终点的元组?

[英]How can I group a sorted list into tuples of start and endpoints of consecutive elements of this list?

假设我的排序列表是这样的:

L = ["01-string","02-string","03-string","05-string","07-string","08-string"]

如您所见,此列表已排序。 我现在想要这个列表中每个连续字符串块的起点和终点,例如,output 应该是:

L_continuous = [("01-string", "03-string"),("05-string","05-string"),("07-string","08-string")]

所以,为了澄清,我需要一个元组列表,在每个元组中,我需要列表中每个连续块的开始和结束。 因此,例如,我的列表中的元素 0、1 和 2 是连续的,因为 01、02、03 是连续的数字 - 所以起点和终点将是“01-string”和“03-string”。

数字 1-3 是连续的,因此它们形成了一个块,而 5 在列表中没有任何连续的数字,因此 forms 本身就是一个块。

不是单线,但这样的事情可能会起作用:

L = ["01-string","02-string","03-string","05-string","07-string","08-string"]
counter = None
# lastNum = None
firstString = ""
lastString = ""
L_continuous = list()
for item in L:
    currentNum = int(item[0:2])
    if counter is None:
        # startTuple
        firstString = item
        counter = currentNum
        lastString = item
        continue
    if counter + 1 == currentNum:
        # continuation of block
        lastString = item
        counter += 1
        continue
    if currentNum > counter + 1:
        # end of block
        L_continuous.append((firstString,lastString))
        firstString = item
        counter = currentNum
        lastString = item
        continue
    else:
        print ('error - not sorted or unique numbers')
# add last block
L_continuous.append((firstString,lastString))

print(L_continuous)

首先要做的是从字符串数据中提取一个 int,这样我们就可以比较连续的数字

def extract_int(s):
    return int(s.split('-')[0])

然后一个直接的解决方案是跟踪最后看到的数字,并在与前一个不连续时发出一个新块。 在循环结束时,我们需要发出一个“剩余”块。

def group_by_blocks(strs):
    blocks = []
    last_s = first_s = strs[0]
    last_i = extract_int(last_s)

    for s in strs[1:]:
        i = extract_int(s)
        if i != last_i + 1:
            blocks.append( (first_s, last_s) )
            first_i, first_s = i, s
        last_i, last_s = i, s

    blocks.append( (first_s, last_s) )
    return blocks

例子:

>>> group_by_blocks(L)
[('01-string', '03-string'), ('05-string', '05-string'), ('07-string', '08-string')]

暂无
暂无

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM