我在哪里可以得到统一的列表,通过类code字符? [英] Where can I get a list of Unicode chars by class?
问题描述
我是新来学习的Uni code和不知道我有多少根据我ASCII背景学习,但我读的对规则的C#规范标识符,以确定字符的的天青表(这是直接基于C#规格)。
I'm new to learning Unicode, and not sure how much I have to learn based on my ASCII background, but I'm reading the C# spec on rules for identifiers to determine what chars are permitted within Azure Table (which is directly based on the C# spec).
我在哪里可以找到属于这些类别的统一code字符的列表:
Where can I find a list of Unicode characters that fall into these categories:
-
字母字符
:类鲁,LL,中尉,LM,螺,或标准升一个统一code字 -
组合字符
:类锰或MC的UNI code字 -
十进制数字符
:类钕的UNI code字 -
连接字符
:类的UNI code角色PC -
格式化字符
:类Cf的UNI code字
letter-character
: A Unicode character of classes Lu, Ll, Lt, Lm, Lo, or Nlcombining-character
: A Unicode character of classes Mn or Mcdecimal-digit-character
: A Unicode character of the class Ndconnecting-character
: A Unicode character of the class Pcformatting-character
: A Unicode character of the class Cf
推荐答案
您可以从官方的Uni code数据文件中的一个自动的方式获取这些信息,的Uni codeData.txt
,这是发表在这里:
You can retrieve this information in an automated fashion from the official Unicode data file, UnicodeData.txt
, which is published here:
- UnicodeData.txt (at unicode.org)
这是在每行分号分隔值的文件。第三列告诉你字符类每个字符。
This is a file with semicolon-separated values in each line. The third column tells you the character class of each character.
这样做的好处是,你可以为每个字符获得字符的名称,所以你知道是什么比只看字符本身(例如,你想知道什么ბ是什么?这是正确的,这是一个更好的主意潘基文在格鲁吉亚: - 。)
)
The benefit of this is that you can get the character name for each character, so you have a better idea of what it is than by just looking at the character itself (e.g. would you know what ბ is? That’s right, it’s Ban. In Georgian. :-)
)
这篇关于我在哪里可以得到统一的列表,通过类code字符?的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!