带近似浮点的列表列表的Python比较

first_list = [['ATOM', 'N', 'SER', -1.081, -16.465, 17.224], ['ATOM', 'C', 'SER', 2.805, -3.504, 6.222], ['ATOM', 'O', 'SER', -17.749, 16.241, -1.333]] secnd_list = [['ATOM', 'N', 'SER', -1.082, -16.465, 17.227], ['ATOM', 'C', 'SER', 2.142, -3.914, 6.222], ['ATOM', 'O', 'SER', -17.541, -16.241, -1.334]]

def aprox (x, y): if x == float and y == float: delta = 0.2 >= abs(x - y) return delta else: rest = x, y return rest def compare (data1, data2): diff = [x for x,y in first_list if x not in secnd_list and aprox(x,y)] + [x for x,y in secnd_list if x not in first_list and aprox(x,y)] return diff

3条回答

网友

1楼 · 编辑于 2024-06-10 20:21:27

可能是您可以迭代两个元素中的每个元素，然后比较子元素：然后，当任何子元素不相等时，可以根据其类型（即如果两个）将其添加到结果中字符串不相等，可以将其添加到结果中，或者如果它是float，math.isclose()可以用于近似：

注：为匹配预期输出进行了更正，first_list的第三个元素中缺少负号

import math

first_list = [['ATOM', 'N', 'SER', -1.081, -16.465,  17.224], 
              ['ATOM', 'C', 'SER', 2.805, -3.504,  6.222], 
              ['ATOM', 'O', 'SER', -17.749, -16.241,  -1.333]] # changes made

secnd_list = [['ATOM', 'N', 'SER', -1.082, -16.465,  17.227],
              ['ATOM', 'C', 'SER', 2.142, -3.914,  6.222], 
              ['ATOM', 'O', 'SER', -17.541, -16.241,  -1.334]]

diff = []
for e1, e2 in zip(first_list, secnd_list):
    for e_sub1, e_sub2 in zip(e1, e2):
        # if sub-elements are not equal
        if e_sub1 != e_sub2:
            # if it is string and not equal
            if isinstance(e_sub1, str):
                diff.append(e1)
                break # one element not equal so no need to iterate other sub-elements
            else:  # is float and not equal
                # Comparison made to 0.2
                if not math.isclose(e_sub1, e_sub2, rel_tol=2e-1):
                    diff.append(e1)
                    break # one element not equal so no need to iterate other sub-elements
diff

输出：

[['ATOM', 'C', 'SER', 2.805, -3.504, 6.222]]

网友

2楼 · 编辑于 2024-06-10 20:21:27

这是有点笨重，但我做了它的飞行，它应该得到你想要的结果。正如我在代码中提到的，您将阈值设置为0.2，这意味着应该返回两行，而不是像您提到的那样返回一行。你知道吗

def discrepancies(x, y):
    for _, (row1, row2) in enumerate(zip(x, y)):
        for _, (item1, item2) in enumerate(zip(row1[3:],row2[3:])):
            if abs(item1 - item2) >= 0.2:
                print row1
                break

discrepancies(first_list, secnd_list)
['ATOM', 'C', 'SER', 2.805, -3.504, 6.222]
['ATOM', 'O', 'SER', -17.749, 16.241, -1.333]

两个注意事项，当每个for循环添加O（n）时，这将变得相当慢，对于列表中较大的列表，我将使用itertools.izip函数，我相信它是被调用的。希望这有帮助！你知道吗

网友

3楼 · 编辑于 2024-06-10 20:21:27

线路

if x == float and y == float

不准确。。。检查变量类型的正确方法是使用type()函数。。。尝试将上面的行替换为

if type(x) is float and type(y) is float:

相关问题更多 >

编程相关推荐

热门问题

热门文章