如何使用正则表达式在Python中查找模式时转义\triangle、\bold等

3条回答

网友

1楼 · 编辑于 2024-05-14 19:08:12

>>> txt = r"\triangle \bold \new \regex"   #Notice the leading r
>>> txt
'\\triangle \\bold \\new \\regex'
>>> txt.split('\\')
['', 'triangle ', 'bold ', 'new ', 'regex']

网友

2楼 · 编辑于 2024-05-14 19:08:12

您已经提到，您正在使用变量s来存储字符串，而不是在其中使用r前缀。所以有一个问题。如果字符串中有\u或\x或\U或\N，则将引发SyntaxError。例如：

>>> s = 'There is no way o\ut'
  File "<stdin>", line 1
SyntaxError: (unicode error) 'unicodeescape' codec can't decode bytes in position 17-18: truncated \uXXXX escape
>>> s = 'Cross symbol(\x) says it is wrong'
  File "<stdin>", line 1
SyntaxError: (unicode error) 'unicodeescape' codec can't decode bytes in position 13-14: truncated \xXX escape
>>> s = 'What an escape Seque\Nce'
  File "<stdin>", line 1
SyntaxError: (unicode error) 'unicodeescape' codec can't decode bytes in position 20-21: malformed \N character escape
>>> s = 'What an escape Seq\Uence'
  File "<stdin>", line 1
SyntaxError: (unicode error) 'unicodeescape' codec can't decode bytes in position 18-20: truncated \UXXXXXXXX escape

因此，如果我假设您的字符串没有\u或\x或\U或\N，那么您可以尝试以下方法：

>>> import re
>>> def repEsc(s):
        s = re.sub('\a', r'\\a', s)
        s = re.sub('\b', r'\\b', s)
        s = re.sub('\f', r'\\f', s)
        s = re.sub('\v', r'\\v', s)
        s = re.sub('\n', r'\\n', s)
        s = re.sub('\r', r'\\r', s)
        s = re.sub('\t', r'\\t', s)
        return s
>>> s = '\triangle \bold \new \regex'
>>> s = repEsc(s)
>>> s
'\\triangle \\bold \\new \\regex'
>>> print(s)
\triangle \bold \new \regex

网友

3楼 · 编辑于 2024-05-14 19:08:12

不确定这是否只是一个解决方法，但您可以从头开始重建字符串。试试这个：

import re

string = "\triangle \bold \new \regex"


escape_dict = {
    '\a' : r'\a',
    '\b' : r'\b',
    '\c' : r'\c',
    '\f' : r'\f',
    '\n' : r'\n',
    '\r' : r'\r',
    '\t' : r'\t',
    '\v' : r'\v',
    '\'' : r'\'',
    '\"' : r'\"'
}

def raw(string):
    new_string = ""
    for char in string:
        try: 
            new_string += escape_dict[char]
        except KeyError: 
            new_string += char
    return new_string

matches = re.findall(r"\\\w+", raw(string))
print(matches)

但是，我想看看你是否可以在代码的前面修改一些东西

相关问题更多 >

编程相关推荐

热门问题

热门文章

如何使用正则表达式在Python中查找模式时转义\triangle、\bold等

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >