用Unicode字母化阿拉伯语和日语文本? [英] Alphabetize Arabic and Japanese text that is in Unicode?
问题描述
有人可以用任何代码将Unicode的阿拉伯语和日语文本按字母顺序排列吗?如果代码是红宝石,那就太好了.
Does anyone have any code for alphabetizing Arabic and Japanese text that is in Unicode? If the code was in ruby that would be great.
推荐答案
Unicode代码点未按字母顺序列出(例如Z< a),但是无论如何它们都试图以大约该顺序排列.由 Unicode归类算法定义的规范unicode顺序,并且它们也是语言-可以在语言环境信息中指定的特定顺序(法语顺序与德语或捷克语顺序并不完全相同,即使使用相同的字母).我认为 ICU库包含您要查找的特定于语言的算法.
Unicode code points are not listed in alphabetic order (Z < a, for example), but they try to be approximately in that order anyway. There is a canonical unicode order, defined by the Unicode Collation Algorithm and they are also language-specific ordering (french order is not exacly the same as german or czech order, even with the same alphabet), which can be specified in locale information. I think the ICU library contains the language specific algorithms you are looking for.
这篇关于用Unicode字母化阿拉伯语和日语文本?的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!