查找两个字符串之间的所有公共子字符串，而不考虑大小写和ord

t = int(input()) #t cases while t > 0: A = str(input()) #1st string B = str(input()) #2nd string low_A = A.lower() low_B = B.lower() answer = "" anslist=[] for i in range(len(A)): common = "" for j in range(len(B)): if (i + j < len(A) and low_A[i + j] == low_B[j]): common += B[j] else: #if (len(common) > len(answer)): answer = common if answer != '' and len(answer) > 1: anslist.append(answer) common = "" if common != '': anslist.append(common) if len(anslist) == 0: print('[]') #print if no common substring else: print(anslist) t -= 1

2条回答

网友

1楼 · 编辑于 2024-05-15 23:20:17

这是Finding all the common substrings of given two strings的一个副本，它提供了一个Java解决方案，为此我尽了最大努力将其转换为Python，并对其进行了“增强”，使其不区分大小写：

def find_common(s, t):
    table = [len(t)*[0] for i in range(len(s))]
    longest = 0
    result = set()
    for i, ch1 in enumerate(s.lower()):
        for j, ch2 in enumerate(t.lower()):
            if ch1 != ch2:
                continue
            table[i][j] = 1 if i == 0 or j == 0 else 1 + table[i - 1][j - 1]
            if table[i][j] > longest:
                longest = table[i][j]
                result.clear()
            if table[i][j] == longest:
                result.add(s[i - longest + 1:i + 1]);
    return result


print(find_common('Bonywasawarrior', 'Bonywasxwarrior'))
print(find_common('01101001', '101010'))
print(find_common('ABCDXGHIJ', 'ghijYAbCd'))

印刷品：

{'Bonywas', 'warrior'}
{'1010'}
{'GHIJ', 'ABCD'}

网友

2楼 · 编辑于 2024-05-15 23:20:17

您可以在while循环中增加一个偏移量，以使公共字符与相应索引的偏移量保持串联，直到它们变得不同。要查找最长、不重叠的公共子字符串，可以使用递归遍历子字符串分区的不同路径的函数，并返回子字符串长度最长的路径：

def common_strings(a, b, i=0, j=0):
    candidates = []
    len_a = len(a)
    len_b = len(b)
    if j == len_b:
        candidates.append(common_strings(a, b, i + 1, 0))
    elif i < len_a:
        offset = 0
        while i + offset < len_a and j + offset < len_b and a[i + offset].lower() == b[j + offset].lower():
            offset += 1
        if offset > 1:
            candidates.append([a[i: i + offset]] + common_strings(a, b, i + offset, j + offset))
        candidates.append(common_strings(a, b, i, j + 1))
    return candidates and max(candidates, key=lambda t: sorted(map(len, t), reverse=True))

以便：

print(common_strings('ABCDXGHIJ', 'ghijYAbCd'))
print(common_strings('Bonywasawarrior', 'Bonywasxwarrior'))
print(common_strings('01101001', '101010'))

输出：

['ABCD', 'GHIJ']
['Bonywas', 'warrior']
['1010']

相关问题更多 >

编程相关推荐

热门问题

热门文章