有没有办法为字典列表中的下一个字典中的同一个键获取下一个值？

list_a = [{'Ch': 'I', 'name': 'test_1', 'site': 2}, {'Ch': 'II', 'name': 'test_2', 'site': 8}, {'Ch': 'II', 'name': 'test_3', 'site': 10}] list_b = [{'Ch': 'I', 'name': 'gene_a', 'start': 1, 'end': 3}, {'Ch': 'II', 'name': 'gene_b', 'start': 3, 'end': 6}]

for item_a in list_a: for item_b in list_b: if item_a['Ch'] == item_b['Ch'] and item_a['site'] >= item_b['start'] and item_a['site'] <= item_b['end']: print item_b['name'], item_a['site']

if item_a['site'] >= item_b['start'] and item_a['site'] >= item_b['end'] and item_a['site'] <= the next site in the next dictionary in list_a... or the beginning of the next gene in the next dictionary... ???

2条回答

网友

1楼 · 编辑于 2024-06-07 07:28:15

更有效的方法是按排序顺序将节解析为结构perCh值：

from collections import defaultdict
import bisect

ranges = defaultdict(list)
for info in list_b:
    bisect.insort(ranges[info['Ch']], (info['start'], info['end'], info['name']))

bisect.insort()调用按排序顺序将新条目插入列表，从而为您节省另一个排序循环。你知道吗

现在用这个来定位给定list_aCh值的范围：

for gene in list_a:
    for start, stop, name in ranges[gene['Ch']]:
        if start <= gene['site'] <= stop:
            print name, gene['site']
            break

当然，这仍然不会根据“stop”参数搜索下一个匹配项，但是后一个循环可以被折叠成一个生成器表达式，适合在next()函数中使用，并且由于范围已排序，因此可以继续搜索下一个站点名称：

for gene in list_a:
    site = gene['site']
    range = iter(ranges[gene['Ch']])
    # skip anything with start > site
    name = previous = next((name for start, stop, name in range if start <= site), None)

    # search on for a matching stop, looking ahead. If we find a stop < site
    # the previous entry matched. If we ran of the end of our options, the last
    # entry matched.
    for start, stop, name  in range:
        if site > stop:
            previous = name
            continue
        if start > site:
            name = previous
        break

    print name, site

rangeiterable“记住”第一次next()搜索停止的位置，我们可以循环遍历它，从该点开始继续搜索合适的stop值。你知道吗

注意，假设stop值是总是将等于或大于start值；测试下一个项目start值也没有意义；如果site <= stop是True，那么site <= start也是也是^{True。你知道吗

网友
2楼 · 编辑于 2024-06-07 07:28:15

我想你可以做些更直截了当的事。你知道吗
在列表b中，您可以添加一个名为site:的新键，您可以将其设置为（start+end）/2。你知道吗
然后合并列表a和列表b，并按排序后的列表中的键（Ch:，site:）对它们进行排序。你知道吗
然后一次列出一个。如果它是一个基因（来自列表a），请跳过它并跟踪它的名称：如果它是一个站点（来自列表b），请将它的名称设置为上一个项目的名称：或使用您保存的名称。你知道吗
可能有一些“什么是最接近的”做调整，但我相信你可以做的前瞻性和背后，你目前的立场，做一些适当的业务逻辑。你知道吗

相关问题更多 >

编程相关推荐

热门问题

热门文章