Perl 的使用编码编译指示破坏 UTF 字符串 [英] Perl's use encoding pragma breaking UTF strings

查看：63 发布时间：2021/6/15 20:48:59 perl

本文介绍了Perl 的使用编码编译指示破坏 UTF 字符串的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

我对 Perl 和编码编译指示有疑问.

I have a problem with Perl and Encoding pragma.

(我在任何地方都使用 utf-8，在输入、输出、perl 脚本本身.我不想使用其他编码，永远不会.)

(I use utf-8 everywhere, in input, output, the perl scripts themselves. I don't want to use other encoding, never ever.)

不过.当我写

binmode(STDOUT, ':utf8');
use utf8;
$r = "\x{ed}";
print $r;

我看到字符串 "í"(这是我想要的 - 并且什么是 U+00ED Unicode 字符).但是当我像这样添加使用编码"编译指示时

I see the string "í" (which is what I want - and what is U+00ED unicode char). But when I add the "use encoding" pragma like this

binmode(STDOUT, ':utf8');
use utf8;
use encoding 'utf8';
$r = "\x{ed}";
print $r;

我看到的只是一个盒子字符.为什么?

all I see is a box character. Why?

此外，当我添加 Data::Dumper 并让 Dumper 像这样打印新字符串时

Moreover, when I add Data::Dumper and let the Dumper print the new string like this

binmode(STDOUT, ':utf8');
use utf8;
use encoding 'utf8';
$r = "\x{ed}";
use Data::Dumper;
print Dumper($r);

我看到 perl 将字符串 更改为 "\x{fffd}".为什么?

I see that perl changed the string to "\x{fffd}". Why?

Perl 的使用编码编译指示破坏 UTF 字符串 [英] Perl&#39;s use encoding pragma breaking UTF strings