为什么“ǃ”.isalpha（）为真而“！”.isalpha（）为假？

2条回答

网友

1楼 · 编辑于 2024-05-29 05:15:50

从文档：

Return True if all characters in the string are alphabetic and there is at least one character, False otherwise. Alphabetic characters are those characters defined in the Unicodecharacter database as “Letter”, i.e., those with general category property being one of “Lm”, “Lt”, “Lu”, “Ll”, or “Lo”. Note that this is different from the “Alphabetic” property defined in the Unicode Standard.

这意味着您使用的utf字符在utf数据库中定义为字母

>>> ord("ǃ")
   451

查看Wikipedia List of UTF characters，字符ǃ位于Latin Extended B之下，这就是为什么isalpha返回True

网友

2楼 · 编辑于 2024-05-29 05:15:50

检查Unicode Database中的字符。类似于ǃ（\u1c3）的感叹号是一个字母：

import unicodedata
for c in "!ǃ":
    print(c,'{:04x}'.format(ord(c)),unicodedata.category(c), unicodedata.name(c))

! 0021 Po EXCLAMATION MARK
ǃ 01c3 Lo LATIN LETTER RETROFLEX CLICK

相关问题更多 >

编程相关推荐

热门问题

热门文章

为什么“ǃ”.isalpha（）为真而“！”.isalpha（）为假？

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >