用 Python 下载网页及其所有资源文件 [英] Downloading a web page and all of its resource files in Python

查看：33 发布时间：2021/9/15 18:37:39 python urllib2 wget

本文介绍了用 Python 下载网页及其所有资源文件的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我希望能够使用 Python 下载页面及其所有相关资源(图像、样式表、脚本文件等).我(有点)熟悉 urllib2 并且知道如何下载单个 url，但是在我开始在 BeautifulSoup + urllib2 上进行黑客攻击之前，我想确保还没有等效于wget --page-requisites http://www.google.com".

I want to be able to download a page and all of its associated resources (images, style sheets, script files, etc) using Python. I am (somewhat) familiar with urllib2 and know how to download individual urls, but before I go and start hacking at BeautifulSoup + urllib2 I wanted to be sure that there wasn't already a Python equivalent to "wget --page-requisites http://www.google.com".

我特别感兴趣的是收集有关下载整个网页(包括所有资源)所需时间的统计信息.

Specifically I am interested in gathering statistical information about how long it takes to download an entire web page, including all resources.

谢谢标记

用 Python 下载网页及其所有资源文件 [英] Downloading a web page and all of its resource files in Python

问题描述

推荐答案

相关文章

Python最新文章

热门教程

热门工具

登录关闭

用 Python 下载网页及其所有资源文件 [英] Downloading a web page and all of its resource files in Python

问题描述

推荐答案

相关文章

Python最新文章

热门教程

热门工具

登录 关闭

登录关闭