刮除HTML与JavaScript的蟒蛇产生 [英] scrape html generated by javascript with python

查看：109 发布时间：2016/8/15 12:52:13 javascript python browser screen-scraping

本文介绍了刮除HTML与JavaScript的蟒蛇产生的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我要凑与Python的网站。我得到HTML源$ C $ C与urlib模块，但我也需要刮掉由JavaScript函数（包括在HTML源代码）生成一些HTML code。这是什么做的功能，在网站是，当你preSS一个按钮，它可以输出一些HTML code。我如何preSS这个按钮与Python code？ scrapy能帮我吗？我抓获萤火POST请求，但是当我试图通过它的网址我得到一个403错误。有什么建议？

I need to scrape a site with python. I obtain the source html code with the urlib module, but I need to scrape also some html code that is generated by a javascript function (which is included in the html source). What this functions does "in" the site is that when you press a button it outputs some html code. How can I "press" this button with python code? Can scrapy help me? I captured the POST request with firebug but when I try to pass it on the url I get a 403 error. Any suggestions?

刮除HTML与JavaScript的蟒蛇产生 [英] scrape html generated by javascript with python

问题描述

推荐答案

相关文章

前端开发最新文章

热门教程

热门工具

登录关闭

刮除HTML与JavaScript的蟒蛇产生 [英] scrape html generated by javascript with python

问题描述

推荐答案

相关文章

前端开发最新文章

热门教程

热门工具

登录 关闭

登录关闭