UTF-8是否适合所有常用语言? [英] Is UTF-8 enough for all common languages?

查看:154
本文介绍了UTF-8是否适合所有常用语言?的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

我只想在Django项目中开发一个翻译应用程序,使注册用户具有某些权限,可以翻译其最新版本中显示的每一条消息。

I just wanted to develop a translation app in a Django projects which enables registered users with certain permissions to translate every single message it appears in latest version.

我的问题是,这个翻译应用程序中的数据库表应该使用什么字符集?看起来有些欧洲语言字符不能存储在UTF-8中?

My question is, what character set should I use for database tables in this translation app? Looks like some european language characters cannot be stored in UTF-8?

推荐答案


语言字符不能存储在UTF-8中?

Looks like some european language characters cannot be stored in UTF-8?

不正确。 UTF-8可以存储任何字符集,但不限于Klingon。 UTF-8是您的国际化一站式服务。如果您对字符有问题,那么它们最有可能是编码问题,或者缺少对用于显示数据的字体的字符范围的支持(尽管非常不可能使用欧洲语言字符,但通常情况下,例如在查看欧洲计算机上的印度网站。另见此问题

Not true. UTF-8 can store any character set without limitations except maybe for Klingon. UTF-8 is your one stop shop for internationalization. If you have problems with characters, they are most likely to be encoding problems, or missing support for that character range in the font you're using to display the data with (Extremely unlikely for a european language character though, but common e.g. when viewing indian sites on an european computer. See also this question)

如果不能呈现非西方字符集,则可能是用户的内置字体没有涵盖UTF-8的范围。

If a non-western character set can't be rendered, it could be that the user's built in font does not have that range of UTF-8 covered.

更新:克林贡确实不是 官方UTF-8

Update: Klingon it is indeed not part of official UTF-8:


一些现代发明的脚本尚未包含在Unicode(例如,Tengwar)中,或者由于缺乏真实世界的使用(例如,Klingon)而无法包含在Unicode中。在ConScript Unicode注册表中列出了非官方但广泛使用的Private使用区号代码。

Some modern invented scripts which have not yet been included in Unicode (e.g., Tengwar) or which do not qualify for inclusion in Unicode due to lack of real-world use (e.g., Klingon) are listed in the ConScript Unicode Registry, along with unofficial but widely-used Private Use Area code assignments.

但是,有一个志愿者项目,在私人区域向Klingon正式分配了代码点F8D0-F8FF。 克林贡人物画廊

However, there is a volunteer project that has inofficially assigned code points F8D0-F8FF in the private area to Klingon. Gallery of Klingon characters

这篇关于UTF-8是否适合所有常用语言?的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆