什么是最好的屏幕抓取语言? [英] Whats the best screen scraping language?

查看:40
本文介绍了什么是最好的屏幕抓取语言?的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

我想创建一个桌面应用程序 (c# prob),它可以抓取或操作 3rd 方网页上的表单.基本上我在桌面应用程序的表单中输入我的数据,它会转到第 3 方网站,并使用脚本或后台的任何内容,在那里输入我的数据(包括我的登录名)并为我单击提交按钮.我只是想避免加载浏览器!

Hi I want to create a desktop app (c# prob) that scrapes or manipulates a form on a 3rd party web page. Basically I enter my data in the form in the desktop app, it goes away to the 3rd party website and, using the script or whatever in the background, enters my data there (incl my login) and clicks the submit button for me.I just want to avoid loading up the browser!

在这方面没有做太多(任何!)工作,我想知道像 perl、python、ruby 等脚本语言是否允许我这样做?或者只是使用 c# 和 .net 完成所有抓取工作?哪个是最好的 IYO?

Not having done much (any!) work in this area I was wondering would a scripting language like perl, python, ruby etc allow me to do such? Or simply do it all the scraping using c# and .net? Which one is best IYO?

我认为脚本可能需要从不同平台上的应用程序(例如,symbian mobile 中我不能像桌面版本那样在 c# 中开发它)中的某些东西挂接到同一个脚本中.

I was thinking script as may need to hook into the same script something from applications on different platforms (eg symbian mobile where I wouldnt be able to develop it in c# as I would the desktop version).

它不是网络应用程序,否则我也可以使用原始站点.我意识到这一切听起来毫无意义,但这种特定形式的自动化对我来说真的可以节省时间.

Its not a web app otherwise I may as well use the original site. I realise it all sounds pointless but the automation for this specific form would be a real time saver for me.

推荐答案

别忘了看看 BeautifulSoup,强烈推荐.

Do not forget to look at BeautifulSoup, comes highly recommended.

例如,请参阅options-for-html-scraping.如果您需要为此任务选择一种编程语言,我会说 Python.

See, for example, options-for-html-scraping. If you need to select a programming language for this task, I'd say Python.

对您的问题更直接的解决方案,请参阅 twill,一种用于网络浏览的简单脚本语言.

A more direct solution to your question, see twill, a simple scripting language for Web browsing.

这篇关于什么是最好的屏幕抓取语言?的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆