如何刮去网站上的所有内容? [英] How to scrape all content from a website?

查看:98
本文介绍了如何刮去网站上的所有内容?的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

我开发网站,有时客户已经拥有网站,但需要完全修改网站,但大部分内容和图片需要保持不变。我正在寻找软件,即使它花钱或是一个桌面应用程序,它可以轻松地让我输入一个URL并将所有内容都刮到我本地机器上的指定文件夹。任何帮助将不胜感激。

I develop websites and sometimes clients already have websites but need them totally revamped but most of the content and images need to stay the same. I'm looking for software, even if it costs or is a desktop application that will easily allow me to enter a URL and scrape all content to a designated folder on my local machine. Any help would be much appreciated.

推荐答案

htttrack 将为你工作得很好。这是一个离线浏览器,将拉下网站。您可以根据需要进行配置。这显然不会拉下PHP,因为php是服务器端代码。唯一可以拉下的是html和javascript以及任何推送到浏览器的图像。

htttrack will work just fine for you. It is an offline browser that will pull down websites. You can configure it as you wish. This will not pull down PHP obviously since php is server side code. The only thing you can pull down is html and javascript and any images pushed to the browser.

这篇关于如何刮去网站上的所有内容?的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆