如果列表中的字符串失败并带有变音符号

#!/usr/bin/env python # -*- coding: utf-8 -*- def article(nomPays): voyelles = ['A','E','É','I','O','U','Y'] if nomPays == 'Mexique': return 'du' elif nomPays[0] in voyelles: return 'de l\'' elif nomPays[-1] == 'e':#signe négatif pour compter à partir de la dernière lettre return 'de la' else: return 'du' print article('Érythrée')

3条回答

网友

1楼 · 编辑于 2024-05-16 05:59:01

在^{之前添加u：

voyelles = [u'A',u'E',u'É',u'I',u'O',u'U',u'Y']
...
print article(u'Érythrée')

示例：

^{pr2}$

网友

2楼 · 编辑于 2024-05-16 05:59:01

问题是在python2中使用str，其中str是一个字节序列，因此{}将给出字符串的第一个字节，而不是第一个字符。在单字节编码中，这不是一个问题，但是对于像UTF-8这样的多字节编码，“赤道”的第一个字节是一个前导字节，而不是整个字符“а”。在

您需要更改为使用unicode来获取第一个字符：

firstChar = unicode(nomPays, 'UTF-8')[0].encode('UTF-8')

实际上，使用startswith可能更容易：

^{pr2}$

或者，您可以在整个应用程序中使用unicode，或者切换到python3，在那里所有这些都处理得更好。在

网友

3楼 · 编辑于 2024-05-16 05:59:01

它是字节字符串，而不是unicode字符串，因此字符串的第一个元素是：

>>> 'Érythrée'[0]
'\xc3'

这是因为UT8编码。在

相关问题更多 >

编程相关推荐

热门问题

热门文章