正则表达式删除单词开头、中间以及开头和结尾的连字符

2条回答

网友

1楼 · 编辑于 2024-05-15 00:15:31

以下是一种方法：

inp = "ease singapore-based -fis -sgfis- fatca"
output = re.sub(r'(?<=\w)-|-(?=\w)', '', inp)
print(output)  # ease singaporebased fis sgfis fatca

上面使用的正则表达式表示要匹配：

(?<=\w)-  match a hyphen preceded by a word character
|         OR
-(?=\w)   match a hyphen followed by a word character

然后，我们用空字符串替换这些匹配的连字符，以删除它们

网友

2楼 · 编辑于 2024-05-15 00:15:31

解决方案1：

re.sub(r'-\b|\b-', ' ', "ease singapore-based fis -sgfis- fatca")
# trim multiple spaces here

表达1：

"-\b|\b-"

\b作为分词线

或解决方案2

re.sub(r'\s-\b|\b-\s', ' ', "ease singapore-based fis -sgfis- fatca")

表达2：

"\s-\b|\b-\s"

\s从空白字符开始

如果您需要“基于新加坡”成为“基于新加坡”，请使用解决方案2并将其与\b-\b结合使用：

因此，您将以(\b-\b)|(\s-\b|\b-\s)结束

解决方案3：

re.sub(r'(\b-\b)|(\s-\b|\b-\s)', ' ', "ease singapore-based fis -sgfis- fatca")
# no space trimming required