解码编码UTF-8不会导致原始unicode [英] decode-encode UTF-8 doesn't lead to the original unicode

查看：128 发布时间：2020/7/13 4:51:13 python unicode encoding utf-8 decoding

本文介绍了解码编码UTF-8不会导致原始unicode的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

当我尝试通过对两个Unicode字符进行解码和再次编码来分隔两个Unicode字符时，我没有得到相同的Unicode，但是得到了一个不同的Unicode字符.

When I am trying to separate two Unicode characters by decoding and encoding them again I do not get the same Unicode in return but I get a different one.

我尝试这样做时附有答复.

Attached are the responses when I try to do so.

>>> s ='\xf0\x9f\x93\xb1\xf0\x9f\x9a\xac'
>>> u = s.decode("utf-8")
>>> u
u'\U0001f4f1\U0001f6ac'
>>> u[0].encode("utf-8")
'\xed\xa0\xbd'
>>> u[1].encode("utf-8")
'\xed\xb3\xb1'
>>> u[0]
u'\ud83d'
>>> u[1]
u'\udcf1'

解码编码UTF-8不会导致原始unicode [英] decode-encode UTF-8 doesn't lead to the original unicode

问题描述

推荐答案

相关文章

Python最新文章

热门教程

热门工具

登录关闭

解码编码UTF-8不会导致原始unicode [英] decode-encode UTF-8 doesn&#39;t lead to the original unicode

问题描述

推荐答案

相关文章

Python最新文章

热门教程

热门工具

登录 关闭

解码编码UTF-8不会导致原始unicode [英] decode-encode UTF-8 doesn't lead to the original unicode

登录关闭