我在哪里可以得到统一的列表,通过类code字符? [英] Where can I get a list of Unicode chars by class?

查看:144
本文介绍了我在哪里可以得到统一的列表,通过类code字符?的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

我是新来学习的Uni code和不知道我有多少根据我ASCII背景学习,但我读的对规则的C#规范标识符,以确定字符的的天青表(这是直接基于C#规格)。

I'm new to learning Unicode, and not sure how much I have to learn based on my ASCII background, but I'm reading the C# spec on rules for identifiers to determine what chars are permitted within Azure Table (which is directly based on the C# spec).

我在哪里可以找到属于这些类别的统一code字符的列表:

Where can I find a list of Unicode characters that fall into these categories:


  • 字母字符:类鲁,LL,中尉,LM,螺,或标准升一个统一code字

  • 组合字符:类锰或MC的UNI code字

  • 十进制数字符:类钕的UNI code字

  • 连接字符:类的UNI code角色PC

  • 格式化字符:类Cf的UNI code字

  • letter-character: A Unicode character of classes Lu, Ll, Lt, Lm, Lo, or Nl
  • combining-character: A Unicode character of classes Mn or Mc
  • decimal-digit-character: A Unicode character of the class Nd
  • connecting-character: A Unicode character of the class Pc
  • formatting-character: A Unicode character of the class Cf

推荐答案

您可以从官方的Uni code数据文件中的一个自动的方式获取这些信息,的Uni codeData.txt ,这是发表在这里:

You can retrieve this information in an automated fashion from the official Unicode data file, UnicodeData.txt, which is published here:

  • UnicodeData.txt (at unicode.org)

这是在每行分号分隔值的文件。第三列告诉你字符类每个字符。

This is a file with semicolon-separated values in each line. The third column tells you the character class of each character.

这样做的好处是,你可以为每个字符获得字符的名称,所以你知道是什么比只看字符本身(例如,你想知道什么ბ是什么?这是正确的,这是一个更好的主意潘基文在格鲁吉亚: - 。)

The benefit of this is that you can get the character name for each character, so you have a better idea of what it is than by just looking at the character itself (e.g. would you know what ბ is? That’s right, it’s Ban. In Georgian. :-))

这篇关于我在哪里可以得到统一的列表,通过类code字符?的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆