处理有效和无效utf-8字符串的扩展
pyutf8的Python项目详细描述
pyutf8提供了有效处理验证的工具 以及清除utf-8字符串。主要用例是“垃圾输入utf-8输出”。
用法:
>>> from pyutf8 import valid_utf8_bytes >>> SNOWMAN = u'\N{SNOWMAN}' >>> SNOWMAN_BYTES = SNOWMAN.encode('utf-8') >>> valid_utf8_bytes(SNOWMAN) == SNOWMAN_BYTES True >>> valid_utf8_bytes(SNOWMAN_BYTES) == SNOWMAN_BYTES True >>> valid_utf8_bytes('\xff' + SNOWMAN_BYTES + '\xff') == SNOWMAN_BYTES True