在python中获取html标签值 [英] Getting html tag value in python
本文介绍了在python中获取html标签值的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!
问题描述
import urllib2
import sys
url =mydomain.com
usock = urllib2.urlopen(url)
data = usock.read()
usock.close()
打印数据
获得这样的HTML标记并且它可以工作。
我想要做的是从< font class =big>< / font>
标签。为前。我需要这个例子的数据值:
< font class =big>数据< / font>
如何操作?
解决方案
您可以使用HTML解析器模块,例如 BeautifulSoup
:
from bs4导入BeautifulSoup为BS
url = mydomain.com
usock = urllib2.urlopen(url)
data = usock.read()
usock.close()
soup = BS(data)
打印soup.find('font',{'class':'big'})。text
这发现一个标记< font>
,其中 class =big
。然后打印它的内容。
I'm newbie to python. Here is my code working on python 2.7.5
import urllib2
import sys
url ="mydomain.com"
usock = urllib2.urlopen(url)
data = usock.read()
usock.close()
print data
Getting HTML markup like that and it works.
What I want to do is, to get value from inside <font class="big"></font>
tag. for ex. I need data value from this example:
<font class="big">Data</font>
How to do it?
解决方案
You can use a HTML parser module such as BeautifulSoup
:
from bs4 import BeautifulSoup as BS
url ="mydomain.com"
usock = urllib2.urlopen(url)
data = usock.read()
usock.close()
soup = BS(data)
print soup.find('font', {'class':'big'}).text
This finds a tag <font>
with a class="big"
. It then prints its content.
这篇关于在python中获取html标签值的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!
查看全文