SGML Charset探险家.. [英] SGML Charset explorer..

查看:68
本文介绍了SGML Charset探险家..的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

我最近加载了一个HTML编辑器

所以我可以使用

编辑器找到那个

特别模糊字符的字符代码'' s''插入特殊字符''对话框。


它发生在我身上必须有一个

更好的方式。可能有几十个,

但这是我的解决方案..
http://www.physci.org/codes/charset.jsp

这个页面是我的''charset explorer'',它显示

一次表456中的字符代码。


它还有一个页面链接,每个页面给出更大的

表示每个字符。 Vis。
http://www.physci。 org / codes / char.jsp?char = 65
http://www.physci.org/codes/char.jsp?char=84
http://www.physci.org/codes/char.jsp?char=1944


我希望它能带来......
http://www.physci.org/codes/char.jsp?char=9786

...到你的杯子里。


-

Andrew Thompson

* http: //www.PhySci.org/ 开源软件套件

* http://www.PhySci.org/codes/ Web& IT帮助

* http://www.1point1C.org/科学&技术

解决方案

" Andrew Thompson" <硒******** @ www.invalid>写道:

我最近加载了一个HTML编辑器
所以我可以使用
编辑器找到那个
特别模糊的字符的字符代码''插入特殊字符''对话框。

它发生在我身上必须有一个更好的方法。可能有几十个,

http://www.eki。 ee / letter / 是我的选择参考。

但这是我的解决方案..
http://www.physci.org/codes/charset.jsp



http://www.physci.org/代码/字符集.... 8859-1& frame = 1

ISO-8859-1中的字符127到159(以及所有其他ISO-8859编码)

是控制字符。你似乎在那里有一些Windows-1252字符



http://www.physci.org/codes/charset....8859-1&frame=2

ISO-8859-1中只有256个字符,所以这些来自哪里?

来自?

http://www.physci.org/codes/ charset .... 8859-5& frame = 1

实际上并没有显示任何西里尔字符。主要是因为

你把它们编码为& #XXX;和HTML中的数字字符引用

总是指unicode。


Steve


-

我的理论给你起见,我的异端邪说你愤怒,

我从不回信,你不喜欢我的领带。 - 医生


Steve Pugh< st *** @ pugh.net> < http://steve.pugh.net/>


2004年2月18日星期三,Steve Pugh写道:

http:// www.physci.org/codes/charset....8859-1&frame=1
ISO-8859-1中的字符127到159(以及所有其他ISO-8859编码)
是控制字符。你似乎在那里有一些Windows-1252字符。


归咎于你自己的浏览器!
http://www.physci.org/codes/charset....8859-1&frame=2
只有ISO-8859-1中有256个字符,所以这些来自哪里?


该网站有点令人困惑。只有frame = ...对于

显示的字符很重要。然后使用

不同的字符集参数发送同一个文档。这应该没有效果 - 但是

实际上浏览器会为每个字符集采用不同的字体

参数。
http://www.physci.org/codes/charset .... 8859-5& frame = 1
实际上并没有显示任何西里尔字符。




< http://www.physci .org / codes / charset.jsp?cs = iso-8859-5& frame = 5>


Andreas Prilop< nh ***** *@rrzn-user.uni-hannover.de>写道:

2004年2月18日星期三,Steve Pugh写道:

http://www.physci.org/codes/charset....8859-1&frame= 1
ISO-8859-1中的字符127到159(以及所有其他ISO-8859编码)
是控制字符。你似乎在那里有一些Windows-1252字符。



责备你自己的浏览器!




归咎于我自己的浏览器!我错误地显示的每个浏览器,例如,
,™作为商标标志。这是'NN4,NN6,NN7,IE5,

IE5.5,IE6,Op5,Op6,Op7,Moz 1.6,Firefox 0.8甚至是Lynx。


但该网站声称 -

" SGML字符153.这是字符?。

在HTML中你会写它:

< p>这是字符™。< / p>"

< http://www.physci.org/codes/ char.jsp?char = 153>

这是完全错误的。

http://www.physci.org/codes/charset。 ... 8859-1& frame = 2
ISO-8859-1中只有256个字符,所以这些来自哪里?


<这个网站有点令人困惑。只有frame = ...对于显示的字符很重要。然后使用不同的字符集参数发送同一个文档。这应该没有效果 - 但实际上浏览器会为每个字符集
参数采用不同的字体。




这真的是非常误导。< br>

http://www.physci.org/codes/charset....8859-5&frame=1
实际上并不是显示任何西里尔字符。



< http://www.physci.org/codes/charset.jsp?cs = iso-8859-5& frame = 5>




即显示unicode字符0401-0500(而不是更多

有用的0400-04FF)。


史蒂夫


-

我的理论给你了,我的异端邪说你,

我从不回信和你不喜欢我的领带。 - 医生


Steve Pugh< st *** @ pugh.net> < http://steve.pugh.net/>


I was recently loading an HTML editor
so I could find the charcode of that
particularly obscure character using the
editor''s ''insert special character'' dialog.

It occured to me there had to be a
better way. There are probably dozens,
but here is my solution..
http://www.physci.org/codes/charset.jsp

This page is my ''charset explorer'', it displays
character codes in a table 456 at a time.

It also has links to a page giving larger
representations of each character. Vis.
http://www.physci.org/codes/char.jsp?char=65
http://www.physci.org/codes/char.jsp?char=84
http://www.physci.org/codes/char.jsp?char=1944

I hope it brings a..
http://www.physci.org/codes/char.jsp?char=9786
...to your mug.

--
Andrew Thompson
* http://www.PhySci.org/ Open-source software suite
* http://www.PhySci.org/codes/ Web & IT Help
* http://www.1point1C.org/ Science & Technology

解决方案

"Andrew Thompson" <Se********@www.invalid> wrote:

I was recently loading an HTML editor
so I could find the charcode of that
particularly obscure character using the
editor''s ''insert special character'' dialog.

It occured to me there had to be a
better way. There are probably dozens,
http://www.eki.ee/letter/ is my reference of choice.
but here is my solution..
http://www.physci.org/codes/charset.jsp



http://www.physci.org/codes/charset....8859-1&frame=1
Characters 127 to 159 in ISO-8859-1 (and all other ISO-8859 encodings)
are control characters. You seem to have some Windows-1252 characters
in there instead.

http://www.physci.org/codes/charset....8859-1&frame=2
There are only 256 characters in ISO-8859-1, so where did these come
from?

http://www.physci.org/codes/charset....8859-5&frame=1
Doesn''t actually display any cyrillic characters. Mainly because
you''ve coded them as &#XXX; and numeric character references in HTML
always refer to unicode.

Steve

--
"My theories appal you, my heresies outrage you,
I never answer letters and you don''t like my tie." - The Doctor

Steve Pugh <st***@pugh.net> <http://steve.pugh.net/>


On Wed, 18 Feb 2004, Steve Pugh wrote:

http://www.physci.org/codes/charset....8859-1&frame=1
Characters 127 to 159 in ISO-8859-1 (and all other ISO-8859 encodings)
are control characters. You seem to have some Windows-1252 characters
in there instead.
Blame your own browser!
http://www.physci.org/codes/charset....8859-1&frame=2
There are only 256 characters in ISO-8859-1, so where did these come
from?
The site is a bit confusing. Only "frame=..." is important for the
displayed characters. One and the same document is then sent with
different charset parameters. That should have no effect - but
actually browsers will take a different typeface for each charset
parameter.
http://www.physci.org/codes/charset....8859-5&frame=1
Doesn''t actually display any cyrillic characters.



<http://www.physci.org/codes/charset.jsp?cs=iso-8859-5&frame=5>


Andreas Prilop <nh******@rrzn-user.uni-hannover.de> wrote:

On Wed, 18 Feb 2004, Steve Pugh wrote:

http://www.physci.org/codes/charset....8859-1&frame=1
Characters 127 to 159 in ISO-8859-1 (and all other ISO-8859 encodings)
are control characters. You seem to have some Windows-1252 characters
in there instead.



Blame your own browser!



Blame all my own browsers! Every browser I have incorrectly displays,
for example, ™ as a trademark sign. That''s NN4, NN6, NN7, IE5,
IE5.5, IE6, Op5, Op6, Op7, Moz 1.6, Firefox 0.8 and even Lynx.

But the site is claiming that -
"SGML character 153. This is the character "?".
In HTML you would write it:
<p>This is the character "™".</p>"
<http://www.physci.org/codes/char.jsp?char=153>
which is just plain wrong.

http://www.physci.org/codes/charset....8859-1&frame=2
There are only 256 characters in ISO-8859-1, so where did these come
from?



The site is a bit confusing. Only "frame=..." is important for the
displayed characters. One and the same document is then sent with
different charset parameters. That should have no effect - but
actually browsers will take a different typeface for each charset
parameter.



It really is deeply misleading.

http://www.physci.org/codes/charset....8859-5&frame=1
Doesn''t actually display any cyrillic characters.



<http://www.physci.org/codes/charset.jsp?cs=iso-8859-5&frame=5>



That is displaying unicode characters 0401-0500 (rather than the more
useful 0400-04FF).

Steve

--
"My theories appal you, my heresies outrage you,
I never answer letters and you don''t like my tie." - The Doctor

Steve Pugh <st***@pugh.net> <http://steve.pugh.net/>


这篇关于SGML Charset探险家..的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆