为什么将HTML代码打印为字符串会给出十六进制数字作为python输出? [英] Why does printing html code as a string give hexadecimal numbers as output in python?

查看：76 发布时间：2020/11/24 2:34:31 python html regex

本文介绍了为什么将HTML代码打印为字符串会给出十六进制数字作为python输出?的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我写了一个Python代码来修改我的html内容.但是在再次将其写入html文件时，我得到了奇怪的十六进制数字

I wrote a Python code to modify my html contents. But on writing that again to the html file, I get weird hexadecimal numbers

import re

search="www.abc.com"

description="blah blah"

f = open('myhtml.html','r+')
content = f.read()
exp_keyword = re.compile(r'\.(\S+)\.')
reducedSearch = exp_keyword.findall(search)[0]

regexLink = re.compile(reducedSearch+r'\.'+r'.+'+'</a>',re.DOTALL)
matchregexLink = regexLink.search(content)
endOfMatch = matchregexLink.span()[1]   

#slice the string
s1 = content[:endOfMatch]
s2=content[endOfMatch:]

content = s1+description+s2
print(content)
f.truncate(0)
f.write(content)

<html>
 <head>
 </head>
 <body>
  <div id="phy">
   <p>
    ett
   </p>
   <div class="links">
    <ul>
     <a href="www.abcd.com">
      Link
     </a>
     <a href="www.abc.com">
      Link
     </a>
    </ul>
   </div>
  </div>
 </body>
</html>

0000 0000 0000 0000 0000 0000 0000 0000
0000 0000 0000 0000 0000 0000 0000 0000
0000 0000 0000 0000 0000 0000 0000 0000
0000 0000 0000 0000 0000 0000 0000 0000
0000 0000 0000 0000 0000 0000 0000 0000
0000 0000 0000 0000 0000 0000 0000 0000
0000 0000 0000 0000 0000 0000 0000 0000
0000 0000 0000 0000 0000 0000 0000 0000
0000 0000 0000 0000 0000 0000 0000 0000
0000 0000 0000 0000 0000 0000 0000 0000
0000 0000 0000 0000 0000 0000 0000 0000
0000 0000 0000 0000 0000 0000 0000 0000
0000 0000 0000 0000 0000 0000 0000 0000
0000 0000 0000 0000 0000 0000 0000 0000
0000 0000 0000 0000 0000 0000 0000 0000
0000 0000 0000 0000 003c 6874 6d6c 3e0a
203c 6865 6164 3e0a 203c 2f68 6561 643e
0a20 3c62 6f64 793e 0a20 203c 6469 7620
6964 3d22 7068 7922 3e0a 2020 203c 703e
0a20 2020 2065 7474 0a20 2020 3c2f 703e
0a20 2020 3c64 6976 2063 6c61 7373 3d22
6c69 6e6b 7322 3e0a 2020 2020 3c75 6c3e
0a20 2020 2020 3c61 2068 7265 663d 2277
7777 2e61 6263 642e 636f 6d22 3e0a 2020
2020 2020 4c69 6e6b 0a20 2020 2020 3c2f
613e 0a20 2020 2020 3c61 2068 7265 663d
2277 7777 2e61 6263 2e63 6f6d 223e 0a20
2020 2020 204c 696e 6b0a 2020 2020 203c
2f61 3e62 6c61 6820 626c 6168 0a20 2020
203c 2f75 6c3e 0a20 2020 3c2f 6469 763e
0a20 203c 2f64 6976 3e0a 203c 2f62 6f64
793e 0a3c 2f68 746d 6c3e 0a

这些奇怪的十六进制数字是我作为输出得到的.但是，当我在代码中打印content时，它给出了正确的答案.为什么这样? 我的预期答案是在包含www.abc.com链接的</a>结束标记后写的blah blah.

These weird hexadecimal numbers is what I get as output. However, when I print content in the code, it gives correct answer. Why so? My expected answer is blah blah written after the closing </a> tag containing the www.abc.com link.

为什么将HTML代码打印为字符串会给出十六进制数字作为python输出? [英] Why does printing html code as a string give hexadecimal numbers as output in python?

问题描述

推荐答案

相关文章

前端开发最新文章

热门教程

热门工具

登录关闭

为什么将HTML代码打印为字符串会给出十六进制数字作为python输出? [英] Why does printing html code as a string give hexadecimal numbers as output in python?

问题描述

推荐答案

相关文章

前端开发最新文章

热门教程

热门工具

登录 关闭

登录关闭