scrapy 和 xpath 函数“匹配"语法 [英] scrapy and xpath function 'matches' syntax

查看：70 发布时间：2021/7/6 20:10:09 regex xpath scrapy

本文介绍了scrapy 和 xpath 函数“匹配"语法的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我运行的是scrapy 0.20.2.

I'm running scrapy 0.20.2.

$ scrapy shell "http://newyork.craigslist.org/ata/"

我想将所有指向广告页面的链接列表与 index.html 分开

I would like to make the list of all links to advertisements pages set apart the index.html

$ sel.xpath('//a[contains(@href,html)]')
... 
<Selector xpath='//a[contains(@href,"html")]' data=u'<a href="/mnh/atq/4243973984.html">Wicke'>,
<Selector xpath='//a[contains(@href,"html")]' data=u'<a href="/mnh/atd/4257230057.html" class'>,
<Selector xpath='//a[contains(@href,"html")]' data=u'<a href="/mnh/atd/4257230057.html">Recla'>,
<Selector xpath='//a[contains(@href,"html")]' data=u'<a href="/ata/index100.html" class="butt'>]

我想使用 XPath 匹配函数来匹配正则表达式 [0-9]+.html 形式的链接.

I would like to use the XPath matches function to match links the form of the regex [0-9]+.html.

$ sel.xpath('//a[matches(@href,"[0-9]+.html")]')
...
ValueError: Invalid XPath: //a[matches(@href,"[0-9]+.html")]

怎么了?谢谢.

scrapy 和 xpath 函数“匹配"语法 [英] scrapy and xpath function 'matches' syntax

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录关闭

scrapy 和 xpath 函数“匹配"语法 [英] scrapy and xpath function &#39;matches&#39; syntax

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录 关闭

scrapy 和 xpath 函数“匹配"语法 [英] scrapy and xpath function 'matches' syntax

登录关闭