什么是非Unicode字符 [英] What are Non Unicode characters

查看:420
本文介绍了什么是非Unicode字符的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

亲爱的朋友。



我知道Unicode字符,但我不知道什么是非Unicode字符。如果有人用例子解释什么是非Unicode字符,那么可以这样做。

Dear Friends.

I know about Unicode characters but I do not know that what are the Non Unicode characters. So can if some one explain with examples that what are the non Unicode characters.

推荐答案

非Unicode字符,就像每个非概念一样,是模糊的。 />
简单英语中的意思是每个字符的身份未通过Unicode表分配。



这个仅仅意味着两件事:

每个数字被视为机器为字符但超过Unicode规范(例如,大于2 21 的32位数或者属于Unicode规范的未分配空格或映射空间。

在其他对于机器而言存在但对于人类读者而言毫无意义的东西。



或者,它可能指的是其身份未通过Unicode规范定义的字符但是来自其他一些尚未被Unicode取代的规范。

例如基于Windows代码页的SBCS / MBCS ,EBCDIC字符集等等。

假设为Unicode的字符将产生除预期之外的字形。



或者更多,你可以称之为字符的所有内容,由你自己定义,独立于Unicode规范。
"Non Unicode character", like every non-concept, is vague.
In plain English means "every character whose identity is not assigned by means of the Unicode tables".

This merely can mean two things:
every "number" that is treated be a machine as "character" but exceed the Unicode specification (for example a 32 bit number greater than 221 or that falls into the "unassigned spaces" or "mapping spaces" of the Unicode specs.
In other word, something that exist for the machine but is meaningless for human reader.

Or, it may refer to character whose identity is not defined by means of the Unicode specification but from some other specification that has not been superseded by Unicode.
For example the SBCS/MBCS based on Windows code-pages, the EBCDIC character set, etc.etc.
Characters that, if assumed as Unicode, will result in glyphs other than the ones that was intended.

Or, even more, everything you can call "character" that is defined by yourself, independently by the Unicode specifications.


Emilio上面提到的EBCDIC(扩展二进制编码的十进制交换代码) )。这是由IBM开发的(和ASCII一样)有256个字符,由0到255表示。



这是维基百科上你需要的所有信息:字符编码 [ ^ ]



甚至提到摩尔斯电码,因为它是一个早期的编码系统!你在做这个项目吗?
Emilio above mentioned EBCDIC (Extended Binary Coded Decimal Interchange Code). This was developed by IBM and (like ASCII) has 256 characters represented by 0 to 255.

Here's all the information you should need on Wikipedia: Character Encoding[^]

Even Morse Code is mentioned as it was an early coding system! Are you doing a project on this?


对Google太懒了?



这里: Unicode和Nonunicode之间的差异 [ ^ ]
Too lazy to Google?

Here: Difference between Unicode and Nonunicode[^]


这篇关于什么是非Unicode字符的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆