Python lxml.html XPath“属性不等于"操作员未按预期工作 [英] Python lxml.html XPath "attribute not equal" operator not working as expected

查看：60 发布时间：2020/5/4 8:26:14 python html xpath screen-scraping lxml

本文介绍了Python lxml.html XPath“属性不等于"操作员未按预期工作的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我正在尝试运行以下脚本:

I'm trying to run the following script:

#!python

from urllib import urlopen #urllib.request for python3
from lxml import html

url =   'http://mpk.lodz.pl/rozklady/1_11_D2D3/00d2/00d2t001.htm?r=KOZINY'+\
        '%20-%20Srebrzy%F1ska,%20Cmentarna,%20Legion%F3w,%20pl.%20Wolno%B6ci'+\
        ',%20Pomorska,%20Kili%F1skiego,%20Przybyszewskiego%20-%20LODOWA'

raw_html = urlopen(url).read()
tree = html.fromstring(raw_html) #need to .decode('windows-1250') in python3
ret = tree.xpath('//td [@class!="naglczas"]')
print ret
assert(len(ret)==1)

我希望它选择一个未将其类别设置为"naglczas"的td.相反，它向我返回了一个空列表.这是为什么?我想这是有一些愚蠢的原因，但我尝试使用Google谷歌搜索，但没有发现任何可以解释的原因.

I expect it to select the one td that doesn't have its class set to 'naglczas'. Instead, it returns me an empty list. Why is that? I guess there's some silly reason, but I tried googling and found nothing that would explain it.

推荐答案

您的xpath表达式将找到

Your xpath expression will find

td元素的类不是"naglczas"

a td element that has a class which is not "naglczas"

您似乎想要(因为同一个班级的仅有3个td-s具有您不想要的班级)

You seem to want(since the only 3 td-s with a class have the same class you don't want)

一个没有"naglczas"类的td元素

a td element which does not have a class of "naglczas"

这些听起来可能很相似，但是却有所不同. 像

Those might sound similar, but they are different. Something like

tree.xpath('//td[not(@class="naglczas")]')

应该为您带来想要的东西.

should get you what you want.

另外，您不需要使用urllib打开url，lxml可以使用lxml.html.parse()为您完成此操作.

Also, you don't need to use urllib to open the url, lxml can do that for you, using lxml.html.parse().

这篇关于Python lxml.html XPath“属性不等于"操作员未按预期工作的文章就介绍到这了，希望我们推荐的答案对大家有所帮助，也希望大家多多支持IT屋！

查看全文

Python lxml.html XPath“属性不等于"操作员未按预期工作 [英] Python lxml.html XPath "attribute not equal" operator not working as expected

问题描述

推荐答案

相关文章

前端开发最新文章

热门教程

热门工具

登录关闭

Python lxml.html XPath“属性不等于"操作员未按预期工作 [英] Python lxml.html XPath &quot;attribute not equal&quot; operator not working as expected

问题描述

推荐答案

相关文章

前端开发最新文章

热门教程

热门工具

登录 关闭

Python lxml.html XPath“属性不等于"操作员未按预期工作 [英] Python lxml.html XPath "attribute not equal" operator not working as expected

登录关闭