在python中打开网站框架或图像 [英] Opening a website frame or image in python

查看:178
本文介绍了在python中打开网站框架或图像的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

所以我相当流利的python和已经使用urllib2和饼干很多的网站自动化。我只是偶然发现了webbrowser模块,可以在默认浏览器中打开一个网址。我想知道是否可能从该URL只选择一个对象,并打开它。具体来说,我想打开一个验证码,以便用户可以输入它,并继续做别的事情。



这是包含html中的验证码的行,我认为:

  script type =text / javascriptsrc =http://api.recaptcha.net/challenge?k=6LcZ-AAAAAAAANX-xwVtzow1f4RpSrbSViRUx9Js>< / script> < input type =submitname =submitBtnvalue =Submit

该api链接打开:

  var RecaptchaState = {
site:'6LcZ-AAAAAAAANX-xwVtzow1f4RpSrbSViRUx9Js',
的挑战:03AHJ_VuvoUHPdfoXLsVHGa7a26GR9s9Y5dkyKmqk2XsJ1SdiwF_2u0SV_sKnr1artkpc-5MjUe7SYD40xr7sAyvikKwpFCQTBdKUFfl76UP6EbDhezoTC8B1X8fjixuIJ4wJhI6yTc8vlX4ioh6je9lwFbPXllbGh2w',
is_incorrect:假的,
programming_error:'',
ERROR_MESSAGE:'',
服务器:http://www.google .com / recaptcha / api /',
timeout:18000
};

document.write('< scr'+'ipt type =text / javascripts'+'rc ='+ RecaptchaState.server +'js / recaptcha.js> ; / scr'+'ipt>');

任何信息都有助于此。

解决方案

这不可能与webbrowser模块。所有的webbrowser都提供了一个简单的方法来识别默认的web浏览器并且给它一个URL。



如果你想渲染页面的一部分,你需要它可以获取任意的HTML片段,或者可以在加载页面后注入一些Javascript来删除不需要的元素。



为此,你需要的是构建一个目的



这可以使用以下库的任何组合来完成:





p>


  • 如果GPL许可正确,请使用PyQt。

  • 如果GPL许可不正确:


    • 对于Linux,使用PySide或PyGTK和PyWebKitGTK(GTKMozEmbed是重的)

    • 对于Windows,使用wxPython with wxIEHtmlWindow

    • 对于OSX,您必须询问其他人。



So i am fairly fluent with python and have used urllib2 and Cookies a lot for website automation. I just stumbled upon the "webbrowser" module which can open a url in your default browser. Im wondering if its possible to select just one object from that url and open that up. Specifically i want to open a "captcha" so that the user can input it, and continue doing something else.

this is line containing the captcha in the html, i think:

script type="text/javascript" src="http://api.recaptcha.net/challenge?k=6LcZ-AAAAAAAANX-xwVtzow1f4RpSrbSViRUx9Js"></script>    <input type="submit" name="submitBtn" value="Submit" 

and clicking on that api link opens this:

var RecaptchaState = {
    site : '6LcZ-AAAAAAAANX-xwVtzow1f4RpSrbSViRUx9Js',
    challenge : '03AHJ_VuvoUHPdfoXLsVHGa7a26GR9s9Y5dkyKmqk2XsJ1SdiwF_2u0SV_sKnr1artkpc-5MjUe7SYD40xr7sAyvikKwpFCQTBdKUFfl76UP6EbDhezoTC8B1X8fjixuIJ4wJhI6yTc8vlX4ioh6je9lwFbPXllbGh2w',
    is_incorrect : false,
    programming_error : '',
    error_message : '',
    server : 'http://www.google.com/recaptcha/api/',
    timeout : 18000
};

document.write('<scr'+'ipt type="text/javascript" s'+'rc="' + RecaptchaState.server + 'js/recaptcha.js"></scr'+'ipt>');

any info would help with this.

解决方案

It's not possible with the webbrowser module. All webbrowser does is provide a simple way to identify the default web browser and feed a URL to it.

If you want to render just a portion of a page, you need something that can either take arbitrary HTML fragments or can inject some Javascript after loading a page to strip out the unwanted elements.

For that, what you need is to build a purpose-specific web browser that's nothing more than a dialog box containing a web widget.

That can be done using any of the following combinations of libraries:

My advice:

  • If GPL licensing is OK, use PyQt.
  • If GPL licensing isn't OK:
    • For Linux, use PySide or PyGTK with PyWebKitGTK (GTKMozEmbed is heavy)
    • For Windows, use wxPython with wxIEHtmlWindow
    • For OSX, you'll have to ask someone else.

这篇关于在python中打开网站框架或图像的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆