擅长:python、mysql、java
<p>根据<a href="https://stackoverflow.com/a/54815485/16354567">this</a>,在<code>re.sub()</code>中包含“\”的模式中不允许有未知转义,这是您遇到的错误</p>
<p>您可以做的是将原始字符串返回到像<a href="https://stackoverflow.com/a/24242596/16354567">this</a>这样的“正常”字符串,但我不确定这是否是最佳实践:</p>
<pre class="lang-py prettyprint-override"><code>import codecs
import re
def preprocessing(content):
for d in range(10):
arabic_digit = codecs.decode(rf"\u066{d}", 'unicode_escape')
persian_digit = codecs.decode(rf"\u06F{d}", 'unicode_escape')
content = re.sub(arabic_digit, persian_digit, content)
return content
</code></pre>