有哪些不错的基于 Ruby 的网络爬虫? [英] What are some good Ruby-based web crawlers?
问题描述
我正在考虑自己编写,但我想知道是否有任何用 Ruby 编写的优秀网络爬虫.
I am looking at writing my own, but I am wondering if there are any good web crawlers out there which are written in Ruby.
除了成熟的网络爬虫之外,任何可能有助于构建网络爬虫的宝石都会很有用.我知道这部分问题在几个地方都涉及到,但适用于构建网络爬虫的宝石列表也将是一个很好的资源.
Short of a full-blown web crawler, any gems that might be helpful in building a web crawler would be useful. I know this part of the question is touched upon in a couple of places, but a list of gems applicable to building a web crawler would be a great resource as well.
推荐答案
我正在构建 wombat,这是一种用于抓取网页和提取内容的 Ruby DSL.在 github 上查看 https://github.com/felipecsl/wombat
I am building wombat, a Ruby DSL to crawl web pages and extract content. Check it out on github https://github.com/felipecsl/wombat
它仍处于早期阶段,但已经具备基本功能.很快就会添加更多内容.
It is still in an early stage but is already functional with basic functionality. More stuff will be added really soon.
这篇关于有哪些不错的基于 Ruby 的网络爬虫?的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!