Python：在遍历i时将范围添加到范围列表中

# Define a function that test if 2 intervals overlap def overlap(start1, end1, start2, end2): return not (end1 < start2 or end2 < start1) best_list = [] # Create a list that will store the best intervals best_list.append([sort[0][0],sort[0][1]]) # Append the first interval of the sorted list # Loop through the sorted list for line in sort: local_start, local_end = line.rsplit("\s",1)[0].split() for i in range(len(best_list)): best_start = best_list[i][0] best_end = best_list[i][1] test = overlap(int(best_start), int(best_end), int(local_start), int(local_end)) if test is False: best_list.append([local_start, local_end])

2条回答

网友

1楼 · 编辑于 2024-04-26 00:31:50

假设您解析csv，并且已经有一个列表，其中[(start, stop, index), ....]为[(int, int, float), ...]，那么您可以按以下方式对其排序：

from operator import itemgetter
data = sorted(data, key=itemgetter(2), reverse=True)

这意味着您按第三个位置排序，并按从最大到最小的相反顺序返回结果

def nonoverlap(data):
    result = [data[0]]
    for cand in data[1:]:
        start, stop, _ = cand
        current_span = range(start, stop+1)
        for item in result:
            i, j, _ = item
            span = range(i, j+1)
            if (start in span) or (stop in span):
                break
            elif (i in current_span) or (j in current_span):
                break
        else:
            result.append(cand)
    return result

然后使用上述函数，您将获得所需的结果。对于提供的代码段，您将获得[(504, 789, 9.68), (1024, 1257, 7.52)]。我在这里使用一个事实，即可以使用1 in range(0, 10)，它将返回True。虽然这是一个幼稚的实现，但您可以将其作为一个起点。如果只想返回开始和停止请将返回行替换为return [i[:2] for i in result]。你知道吗

注意：我还想补充一点，您的代码有一个逻辑错误。您在每次比较之后都会做出决定，但必须在与您的best_list中已经存在的所有元素进行比较之后做出决定。这就是为什么(504, 789)和(1027, 1305)通过了测试，但不应该通过。我希望这张纸条能对你有所帮助。你知道吗

网友

2楼 · 编辑于 2024-04-26 00:31:50

嗯，我对你的密码有点疑问。既然sort包含字符串，那么这一行append([sort[0][0],sort[0][1]])会做什么呢？你知道吗

总之，主要的问题是，当列表中存在多个元素时，只要其中一个元素通过重叠测试就足以添加到列表中（而不是您想要的）。E、当两个(504, 789),(1024, 1257)都存在时，(1027, 1305)将被插入到列表中，因为与(504, 789)比较时它通过了测试。你知道吗

所以，我做了一些改变，现在它似乎像预期的那样工作：

best_list = [] # Create a list that will store the best intervals
best_list.append(sort[0].rsplit(" ", 1)[0].split()) # Append the first interval of the sorted list
# Loop through the sorted list
for line in sort:
    local_start, local_end = line.rsplit("\s", 1)[0].split()
    flag = False # <- flag to check the overall overlapping
    for i in range(len(best_list)):
        best_start = best_list[i][0]
        best_end = best_list[i][1]
        test = overlap(int(best_start), int(best_end), int(local_start), int(local_end))
        print(test)
        if test:
            flag = False
            break
        flag = True
    if flag:
        best_list.append([local_start, local_end])

主要思想是检查每个元素，如果它通过了所有重叠的测试，那么添加它（我代码的最后一行）。以前没有。你知道吗

相关问题更多 >

编程相关推荐

热门问题

热门文章