HTMLParser.HTMLParser().unescape()不起作用 [英] HTMLParser.HTMLParser().unescape() doesn't work

查看：52 发布时间：2021/5/14 20:36:32 python html unicode

本文介绍了HTMLParser.HTMLParser().unescape()不起作用的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

我想将HTML实体转换回其可读格式，例如'& pound;'转换为'£'，'& deg;'转换为'°'等.

I would like to convert HTML entities back to its human readable format, e.g. '£' to '£', '°' to '°' etc.

我已经阅读了有关此问题的几篇文章

I've read several posts regarding this question

根据他们的说法，我选择使用未记录的函数unescape()，但它对我不起作用...

and according to them, I chose to use the undocumented function unescape(), but it doesn't work for me...

我的代码示例如下:

import HTMLParser

htmlParser = HTMLParser.HTMLParser()
decoded = htmlParser.unescape('&copy; 2013')
print decoded

当我运行此python脚本时，输出仍然是:

When I ran this python script, the output is still:

&copy; 2013

代替

© 2013

我正在使用Python 2.X，可在Windows 7和Cygwin控制台上使用.我用Google搜索，没有发现任何类似的问题.有人可以帮助我吗?

I'm using Python 2.X, working on Windows 7 and Cygwin console. I googled and didn't find any similar problems..Could anyone help me with this?

HTMLParser.HTMLParser().unescape()不起作用 [英] HTMLParser.HTMLParser().unescape() doesn&#39;t work