UnicodeEncodeError:“ascii”编解码器无法对位置20中的字符u“\xa0”进行编码：序号不在范围（128）内

Traceback (most recent call last): File "foobar.py", line 792, in <module> p.agent_info = str(agent_contact + ' ' + agent_telno).strip() UnicodeEncodeError: 'ascii' codec can't encode character u'\xa0' in position 20: ordinal not in range(128)

3条回答

网友

1楼 · 编辑于 2024-04-19 21:11:37

您需要阅读PythonUnicode HOWTO。这个错误是very first example。

基本上，停止使用str将unicode转换为编码文本/字节。

相反，请正确使用^{}对字符串进行编码：

p.agent_info = u' '.join((agent_contact, agent_telno)).encode('utf-8').strip()

或者完全使用unicode。

网友

2楼 · 编辑于 2024-04-19 21:11:37

这是典型的python unicode痛点！请考虑以下几点：

a = u'bats\u00E0'
print a
 => batsà

到目前为止一切都很好，但如果我们称之为str（a），让我们看看会发生什么：

str(a)
Traceback (most recent call last):
  File "<stdin>", line 1, in <module>
UnicodeEncodeError: 'ascii' codec can't encode character u'\xe0' in position 4: ordinal not in range(128)

哦，迪普，那对任何人都没有好处！若要修复错误，请使用.encode显式编码字节，并告诉python要使用的编解码器：

a.encode('utf-8')
 => 'bats\xc3\xa0'
print a.encode('utf-8')
 => batsà

喂\u00E0！

问题是，当您调用str（）时，python使用默认的字符编码来尝试对您给它的字节进行编码，在您的例子中，这些字节有时是unicode字符的表示。要解决这个问题，您必须告诉python如何使用.encode（'whatever_unicode'）处理您给它的字符串。大多数时候，使用utf-8应该没问题。

关于这个主题的一个很好的解释，请参见Ned Batchelder的PyCon谈话：http://nedbatchelder.com/text/unipain.html

网友

3楼 · 编辑于 2024-04-19 21:11:37

我找到了一个很好的方法来移除符号并继续将字符串保持为字符串，如下所示：

yourstring = yourstring.encode('ascii', 'ignore').decode('ascii')

需要注意的是，使用ignore选项是很危险的，因为它会从使用它的代码中无声地删除任何unicode（和国际化）支持，如下所示（convert unicode）：

>>> u'City: Malmö'.encode('ascii', 'ignore').decode('ascii')
'City: Malm'

相关问题更多 >

编程相关推荐

热门问题

热门文章