我需要帮助,努力使我的代码在UTF-8编码上标准化 [英] need help with agonizing struggle to standardize my code on UTF-8 encoding

查看:52
本文介绍了我需要帮助,努力使我的代码在UTF-8编码上标准化的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

我正在尝试为我的网站提供UTF-8的字符编码。在过去的6年中,

网站以hodge-podge的方式建成。

验证器告诉我,我有很多不属于

UTF-8编码的角色。除了手动更改它们之外,还有谁能想到找到并转换这些字符的聪明方法?

http://validator.w3.org/check?uri=ht...krubner。 com%2F

I''m just now trying to give my site a character encoding of UTF-8. The
site has been built in a hodge-podge way over the last 6 years. The
validator tells me I''ve lots of characters that don''t belong to the
UTF-8 encoding. Other than changing them by hand, can anyone think of
a clever way to find and convert these characters?

http://validator.w3.org/check?uri=ht...krubner.com%2F

推荐答案



" lawrence" < LK ****** @ geocities.com>在消息中写道

news:da ************************** @ posting.google.c om ...

"lawrence" <lk******@geocities.com> wrote in message
news:da**************************@posting.google.c om...
我正在尝试为我的网站提供UTF-8的字符编码。
网站在过去的6年里一直以hodge-podge的方式建造。
验证器告诉我,我有很多不属于
UTF-8编码的角色。除了手动更改它们之外,有没有人会想到找到并转换这些字符的聪明方法?

http://validator.w3.org/check?uri=ht...krubner.com%2F
I''m just now trying to give my site a character encoding of UTF-8. The
site has been built in a hodge-podge way over the last 6 years. The
validator tells me I''ve lots of characters that don''t belong to the
UTF-8 encoding. Other than changing them by hand, can anyone think of
a clever way to find and convert these characters?

http://validator.w3.org/check?uri=ht...krubner.com%2F




您是否正在寻找一种比阅读错误消息更聪明的方式

以上链接给出的你好吗?


对不起,我无法验证这个文件,因为839号线,846号线,

856,929,931,933,935, 937,939,1075,1079,1081,1083,1085,1087,1089,

1091,1139,1159,1296,1298,1416,1456,1464,1502,1508,1512,1522,

1524,1533,1537,1565,1567,1569,1571,1640,1648,1650,1900,1926,2182

它包含一个或多个字节,我不能解释为utf-8(在其他

字中,找到的字节不是指定字符中的有效值
编码)。请检查文件的内容和字符

编码指示。


在查看第839行后,我首先猜测了源代码

问题,在看到第846行和第856行后,我表示肯定。



Are you looking for a way that''s more clever than reading the error message
that the above link gives you?

"Sorry, I am unable to validate this document because on lines 839, 846,
856, 929, 931, 933, 935, 937, 939, 1075, 1079, 1081, 1083, 1085, 1087, 1089,
1091, 1139, 1159, 1296, 1298, 1416, 1456, 1464, 1502, 1508, 1512, 1522,
1524, 1533, 1537, 1565, 1567, 1569, 1571, 1640, 1648, 1650, 1900, 1926, 2182
it contained one or more bytes that I cannot interpret as utf-8 (in other
words, the bytes found are not valid values in the specified Character
Encoding). Please check both the content of the file and the character
encoding indication."

After looking at line 839 I had my first guess as to the source of the
problem and after seeing lines 846 and 856 I was positive.


文章< da **** **********************@posting.google.com>,
lk ****** @ geocities.com (劳伦斯)写道:
In article <da**************************@posting.google.com >,
lk******@geocities.com (lawrence) writes:
我正在尝试为我的网站提供UTF-的字符编码8。


为什么?它目前是否已经破损?

网站在过去的6年里一直是以hodge-podge的方式建造的。


这没有什么不妥。

UTF-8编码。除了手动更改它们之外,还有谁能想到找到并转换这些字符的聪明方法?
I''m just now trying to give my site a character encoding of UTF-8. The
Why? Is it currently broken?
site has been built in a hodge-podge way over the last 6 years. The
Nothing necessarily wrong with that.
UTF-8 encoding. Other than changing them by hand, can anyone think of
a clever way to find and convert these characters?




man iconv。


-

Nick Kew


尼克的宣言: http://www.htmlhelp.com/~nick/


"哈兰·梅辛格(Harlan Messinger) < H ********* @ comcast.net>在消息新闻中写道:< 2s ************* @ uni-berlin.de> ...
"Harlan Messinger" <h.*********@comcast.net> wrote in message news:<2s*************@uni-berlin.de>...
" lawrence" < LK ****** @ geocities.com>在消息中写道
新闻:da ************************** @ posting.google.c om ...
"lawrence" <lk******@geocities.com> wrote in message
news:da**************************@posting.google.c om...
我正在尝试为我的网站提供UTF-8的字符编码。
网站在过去的6年里一直以hodge-podge的方式建造。
验证器告诉我,我有很多不属于
UTF-8编码的角色。除了手动更改它们之外,有没有人会想到找到并转换这些字符的聪明方法?

http://validator.w3.org/check?uri=ht...krubner.com%2F
你是否正在寻找一种比阅读上述链接给你的错误信息更聪明的方式?
I''m just now trying to give my site a character encoding of UTF-8. The
site has been built in a hodge-podge way over the last 6 years. The
validator tells me I''ve lots of characters that don''t belong to the
UTF-8 encoding. Other than changing them by hand, can anyone think of
a clever way to find and convert these characters?

http://validator.w3.org/check?uri=ht...krubner.com%2F
Are you looking for a way that''s more clever than reading the error message
that the above link gives you?




我不确定我理解你。阅读错误消息并没有告诉我如何解决问题。我正在寻找某种类型的脚本,

可能会将当前编码中的字符大量转换为UTF-8。



"对不起,我无法验证此文档,因为在第839,846行,
856,929,931,933,935,937,939,1075,1079,1081,1083,1085,1087,1089,
......

看完第839行之后,我第一次猜到了问题的根源,在看到第846行和第856行后,我感到非常积极。



I''m not sure I understand you. Reading the error message doesn''t tell
me how to fix the problem. I''m looking for some kind of script that
might mass convert characters from the current encodings to UTF-8.


"Sorry, I am unable to validate this document because on lines 839, 846,
856, 929, 931, 933, 935, 937, 939, 1075, 1079, 1081, 1083, 1085, 1087, 1089, ......

After looking at line 839 I had my first guess as to the source of the
problem and after seeing lines 846 and 856 I was positive.




这很好,但你为什么不告诉我? 839行是这样的:


< / div>


第846行以单词This开头。


你看到了什么模式?你的回复中有点神秘。



That''s nice, but why don''t you tell me? Line 839 is this:

</div>

Line 846 starts with the word "This".

What pattern are you seeing? You''re being a bit cryptic in your reply.


这篇关于我需要帮助,努力使我的代码在UTF-8编码上标准化的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆