在python中设置隐式默认编码\解码错误处理

2条回答

网友

1楼 · 编辑于 2024-05-19 01:41:44

脚本不能调用是有原因的sys.setdefaultencoding. 不要这样做，有些库（包括Python附带的标准库）希望默认值为“ascii”。在

相反，在读入程序时（通过文件、stdin、socket等）显式地将字符串解码为Unicode，并在写出字符串时显式地对字符串进行编码。在

显式解码采用一个参数，指定不可编码字节的行为。在

网友

2楼 · 编辑于 2024-05-19 01:41:44

您可以定义自己的自定义处理程序，并使用它来做您想做的事。请参见以下示例：

import codecs
from logging import getLogger

log = getLogger()

def custom_character_handler(exception):
    log.error("%s for %s on %s from position %s to %s. Using '?' in-place of it!",
            exception.reason,
            exception.object[exception.start:exception.end],
            exception.encoding,
            exception.start,
            exception.end )
    return ("?", exception.end)

codecs.register_error("custom_character_handler", custom_character_handler)

print( b'F\xc3\xb8\xc3\xb6\xbbB\xc3\xa5r'.decode('utf8', 'custom_character_handler') )
print( codecs.encode(u"abc\u03c0de", "ascii", "custom_character_handler") )

运行它，您将看到：

^{pr2}$

参考文献：

相关问题更多 >

编程相关推荐

热门问题

热门文章

在python中设置隐式默认编码\解码错误处理

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >