我如何 .decode('string-escape') 在 Python3 中? [英] how do I .decode('string-escape') in Python3?

查看:127
本文介绍了我如何 .decode('string-escape') 在 Python3 中?的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

我有一些需要转义的转义字符串.我想在 Python 中执行此操作.

例如,在python2.7中我可以这样做:

<预><代码>>>>"\123something special".decode('string-escape')'特别的东西'>>>

我如何在 Python3 中做到这一点?这不起作用:

<预><代码>>>>b"\123something special".decode('string-escape')回溯(最近一次调用最后一次):文件<stdin>",第 1 行,在 <module> 中LookupError:未知编码:字符串转义>>>

我的目标是成为 abel 接受这样的字符串:

s00u00p00p00o00r00t00@00p00s00i00l00o00c00.00c00o00m00

然后变成:

"support@psiloc.com"

在我进行转换后,我会检查我所拥有的字符串是用 UTF-8 还是 UTF-16 编码的.

解决方案

如果你想str-to-str 解码转义序列,那么输入和输出都是统一码:

def string_escape(s, encoding='utf-8'):return (s.encode('latin1') # 到字节,'unicode-escape'需要.decode('unicode-escape') # 执行实际的八进制转义解码.encode('latin1') # 1:1 映射回字节.decode(encoding)) # 解码原始编码

测试:

>>>string_escape('\123something special')'特别的东西'>>>string_escape(r's00u00p00p00o00r00t00@'r'00p00s00i00l00o00c00.00c00o00m00','utf-16-le')'support@psiloc.com'

I have some escaped strings that need to be unescaped. I'd like to do this in Python.

For example, in python2.7 I can do this:

>>> "\123omething special".decode('string-escape')
'Something special'
>>> 

How do I do it in Python3? This doesn't work:

>>> b"\123omething special".decode('string-escape')
Traceback (most recent call last):
  File "<stdin>", line 1, in <module>
LookupError: unknown encoding: string-escape
>>> 

My goal is to be abel to take a string like this:

s00u00p00p00o00r00t00@00p00s00i00l00o00c00.00c00o00m00

And turn it into:

"support@psiloc.com"

After I do the conversion, I'll probe to see if the string I have is encoded in UTF-8 or UTF-16.

解决方案

If you want str-to-str decoding of escape sequences, so both input and output are Unicode:

def string_escape(s, encoding='utf-8'):
    return (s.encode('latin1')         # To bytes, required by 'unicode-escape'
             .decode('unicode-escape') # Perform the actual octal-escaping decode
             .encode('latin1')         # 1:1 mapping back to bytes
             .decode(encoding))        # Decode original encoding

Testing:

>>> string_escape('\123omething special')
'Something special'

>>> string_escape(r's00u00p00p00o00r00t00@'
                  r'00p00s00i00l00o00c00.00c00o00m00',
                  'utf-16-le')
'support@psiloc.com'

这篇关于我如何 .decode('string-escape') 在 Python3 中?的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆