Nokogiri 可以解释 javascript 吗?- 网页抓取 [英] Can Nokogiri interpret javascript? - Web Scraping
问题描述
我们正在尝试抓取此页面上的可用性:http://www.equityapartments.com/new-york/new-york-city-apartments/midtown-west/mantena-apartments.aspx
We are trying to scrape the availabilities on this page: http://www.equityapartments.com/new-york/new-york-city-apartments/midtown-west/mantena-apartments.aspx
我需要使用我的蜘蛛来选择所有平面图"并获取所有可用性.但我相信数据实际上是通过 Javascript 请求发送的.有没有办法让我的 Nokogiri 蜘蛛渲染它?或者模拟点击按钮的过程?
I need to use my spider to select on the "All Floorplans" and fetch all the availabilities. But the data are actually sent through Javascript request I believe. Is there a way for my Nokogiri spider to render it? Or maybe simulate the process of clicking on buttons?
推荐答案
Nokogiri 只是一个
Nokogiri is just a parser. It also allows to search content.
要与网页交互,您需要使用其他东西,例如Watir 和 PhantomJS>.
To interact with web pages you need to use something else, e.g. Watir and PhantomJS.
将它们组合在一起:
browser = Watir::Browser.new(:phantomjs)
browser.goto(your_url_above)
browser.link(text: 'All floorplans').click
document = Nokogiri::HTML(browser.html)
document.search(...)
这篇关于Nokogiri 可以解释 javascript 吗?- 网页抓取的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!