显示Unicode字符信息的工具。
uctools的Python项目详细描述
显示Unicode字符信息的工具(UTF-8)。
版权所有?2018,Luís Gomes<;luismsgomes@gmail.com>;。
提供了以下命令行工具:
- ucinfo
- writes on stdout the name of each unicode character read from stdin
- ucenum
- enumerates on stdout all unicode characters of a chosen category
ucinfo
ucinfo工具从stdin读取utf-8文本并写入stdout信息 关于每个字符,每行一个。 输出有5个制表符分隔的列:
- the character itself, if printable, or an escaped representation of it
- the decimal codepoint of the character
- the number of bytes that the character occupies
- the Unicode category of the character
- the Unicode name of the character
ucenum
- ucenum工具以类别缩写作为参数并输出列表
属于该类别的所有字符。分类如下:
- Lu
- Letter, Uppercase
- Ll
- Letter, Lowercase
- Lt
- Letter, Titlecase
- Lm
- Letter, Modifier
- Lo
- Letter, Other
- Mn
- Mark, Nonspacing
- Mc
- Mark, Spacing Combining
- Me
- Mark, Enclosing
- Nd
- Number, Decimal Digit
- Nl
- Number, Letter
- No
- Number, Other
- Pc
- Punctuation, Connector
- Pd
- Punctuation, Dash
- Ps
- Punctuation, Open
- Pe
- Punctuation, Close
- Pi
- Punctuation, Initial quote (may behave like Ps or Pe depending on usage)
- Pf
- Punctuation, Final quote (may behave like Ps or Pe depending on usage)
- Po
- Punctuation, Other
- Sm
- Symbol, Math
- Sc
- Symbol, Currency
- Sk
- Symbol, Modifier
- So
- Symbol, Other
- Zs
- Separator, Space
- Zl
- Separator, Line
- Zp
- Separator, Paragraph
- Cc
- Other, Control
- Cf
- Other, Format
- Cs
- Other, Surrogate
- Co
- Other, Private Use
- Cn
- Other, Not Assigned