什么是用于表示组合字符的unicode字形? [英] What's the unicode glyph used to indicate combining characters?

查看:815
本文介绍了什么是用于表示组合字符的unicode字形?的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

我的应用程序需要显示孤立组合字符。我想使用与官方Unicode图表相同的格式,使用虚线圆圈占位符。例如,请参阅:



快速浏览图表, U + 25CC圆圈。这看起来不错,但是这个字符的注释是:

lockquote

注意这个
字符的引用字形有意地大于
用于
的虚线圆形字形表示在此
标准中组合字符;例如0300


说(我认为)U + 25CC不是正确的字符。 (或者,如果是的话,也许只是措辞不佳而已)。
$ b 因此:如果组合变音标记上使用的虚线圆圈不是U + 25CC,这个小鼻屎的正确代码是什么?



我试过了:


  • 从PDF中复制文本并检查文本,但在PDF中禁用了副本。
  • 在Gmail中通过电子邮件发送给我自己,然后查看附件为HTML,但有转换为U + 0024(美元符号)。这意味着要么转换失败,要么只是在PDF中玩一些字体渲染游戏。


[澄清] 我意识到,U + 25CC看起来不错(假设一个人的字体支持它),但它听起来像规范说这是错误的字符。许多Unicode字符具有相似的字形,但在语义上是不同的字符。 拉丁大写字母A(U + 0041)和希腊大写字母阿尔法(U + 0391)对于大多数字体来说看起来是一样的,但是它们有不同的语义含义,不可互换。

解决方案

我不认为是一个官方占位符。我读到这个笔记的方式,他们任意选择 U + 25CC ,纯粹是为了显示目的。然后,在图表中列出真实的虚线圆圈,它有点大,强调它是不是在那里用作占位符。 (或者,也许他们缩小了在其他图表;正如你所说,这个说明措辞不佳)。

无论如何,我没有看到任何理由不使用 U + 25CC 作为您的占位符。


My application needs to display "orphaned" combining characters. I would like to use the same format as the "official" unicode charts, using the dotted circle placeholder. See, for example:

A quick scan through the charts and I came up with U+25CC "DOTTED CIRCLE". That looks good, but the note on this character reads:

note that the reference glyph for this character is intentionally larger than the dotted circle glyph used to indicate combining characters in this standard; see, for example, 0300

Which says (I think) that U+25CC is not the correct character. (Or, if it is, perhaps just a poorly worded note.)

So: if the dotted circle used on the "Combining Diacritical Marks" is not U+25CC, what is the correct code for that little booger?

I have tried:

  • Copying the text from the PDF and inspecting it, but the copy is disabled in the PDF.
  • Emailing it to myself in Gmail and then viewing the attachment as HTML, but there is gets converted to U+0024 ("DOLLAR SIGN"). Which means that either the conversion failed or they are just playing some font rendering games in the PDF.

[Clarification] I realize that the U+25CC looks OK (assuming one's font supports it), but it sounds like the spec says that this is the wrong character. Many unicode characters have similar glyphs but are different characters, semantically speaking. "Latin Capital Letter A" (U+0041) and "Greek Capital Letter Alpha" (U+0391) will look identical for most fonts, but they have different semantic meanings and are not interchangable.

解决方案

I don't think there is an official placeholder character. The way I read that note, they chose U+25CC arbitrarily, purely for display purposes. Then, in the chart where the "real" dotted circle is listed, they made it a little larger to emphasize that it's not being used as a placeholder there. (Or maybe they shrunk it in the other charts; as you said, the note's poorly worded.)

Whatever the case, I don't see any reason not to use U+25CC as your placeholder.

这篇关于什么是用于表示组合字符的unicode字形?的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆