基于 Sublime Text 3 的 Python 2.7 不打印“\uFFFD"字符 [英] Python 2.7 build on Sublime Text 3 doesn't print the '\uFFFD' character

查看：86 发布时间：2021/6/26 19:08:43 python python-2.7 unicode sublimetext3 stdout

本文介绍了基于 Sublime Text 3 的 Python 2.7 不打印“\uFFFD"字符的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我使用的是基于 Sublime Text 3 的 Python 2.7，但在打印时遇到问题.
在某些情况下，对于 '\uFFFD' - 'REPLACEMENT CHARACTER' 的输出非常混乱.

I'm using Python 2.7 build on Sublime Text 3 and have an issue with printing out.
In some cases I get a pretty confusing output for '\uFFFD' - the 'REPLACEMENT CHARACTER'.

例如:

print u'\ufffd' # should be '�' - the 'REPLACEMENT CHARACTER'
print u'\u0061' # should be 'a'
-----------------------------------------------------
[Finished in 0.1s]

倒序后:

print u'\u0061' 
print u'\ufffd'
-----------------------------------------------------
a
�
[Finished in 0.1s]

所以，Sublime 可以打印出 ' ' 字符，但由于某种原因在第一种情况下不能这样做.
而且输出对语句顺序的依赖似乎很奇怪.

So, Sublime can printout the '�' character, but for some reason doesn't do it in the 1st case.
And the dependence of the output on the order of statements seems quite strange.

替换字符的问题通常会导致非常不可预测的打印输出行为.
例如，我想打印出带有错误替换的解码字节:

The problem with replacement char leads to very unpredictable printout behavior in general.
For example, I want to printout decoded bytes with error replacement:

cp1251_bytes = '\xe4\xe0' # 'да' in cp1251 
print cp1251_bytes.decode('utf-8', errors='replace')
-----------------------------------------------------
��
[Finished in 0.1s]

让我们替换字节:

cp1251_bytes = '\xed\xe5\xf2' # 'нет' in cp1251
print cp1251_bytes.decode('utf-8', errors='replace')
-----------------------------------------------------
[Finished in 0.1s]

再添加一个打印语句:

cp1251_bytes = '\xed\xe5\xf2' # 'нет' in cp1251 
print cp1251_bytes.decode('cp1251') 
print cp1251_bytes.decode('utf-8', errors='replace')
-----------------------------------------------------
нет
���
[Finished in 0.1s]

<小时>

以下是一些其他测试用例的实现说明:

Below is the illustration of implementation some other test cases:

总结，在描述的打印输出行为中有以下模式:

这取决于打印语句中 '\ufffd' 字符的偶数/奇数

这取决于打印语句的顺序

这取决于具体的构建运行

Summarizing, there are the following patterns in the described printout behavior:

it depends on the even/odd number of '\ufffd' chars in print statement

it depends on the order of print statements

it depends on the specific build run

为什么会发生这种情况?

如何解决问题?

Why does this happen?

How to fix the problem?

我的 Python 2.7 sublime-build 文件:

My Python 2.7 sublime-build file:

{   
    "cmd": ["C:\\_Anaconda3\\envs\\python27\\python", "-u", "$file"],
    "file_regex": "^[ ]*File \"(...*?)\", line ([0-9]*)",
    "selector": "source.python",
    "env": {"PYTHONIOENCODING": "utf-8"}
}

Python 2.7 与 Anaconda 分开安装，行为完全相同.

With Python 2.7 installed separately from Anaconda the behavior is exactly the same.

基于 Sublime Text 3 的 Python 2.7 不打印“\uFFFD"字符 [英] Python 2.7 build on Sublime Text 3 doesn't print the '\uFFFD' character

问题描述

推荐答案

相关文章

Python最新文章

热门教程

热门工具

登录关闭

基于 Sublime Text 3 的 Python 2.7 不打印“\uFFFD"字符 [英] Python 2.7 build on Sublime Text 3 doesn&#39;t print the &#39;\uFFFD&#39; character

问题描述

推荐答案

相关文章

Python最新文章

热门教程

热门工具

登录 关闭

基于 Sublime Text 3 的 Python 2.7 不打印“\uFFFD"字符 [英] Python 2.7 build on Sublime Text 3 doesn't print the '\uFFFD' character

登录关闭