如何从Internet存档批量下载文件 [英] How to bulk download files from the internet archive

查看:174
本文介绍了如何从Internet存档批量下载文件的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

我检查了Internet存档的原始站点,他们提到了要遵循的几个步骤,其中包括在Windows上通过Cygwin使用wget实用程序,我按照上述步骤进行,进行了高级搜索并提取了CSV文件,将其转换为.txt,然后尝试运行以下命令

I checked the original site of the internet archive and they mentioned there a couple of steps to follow, which included the use of the wget utility using Cygwin over windows, I followed the steps above, I made an advanced search and extracted the CSV file, converted it to .txt and then tried to run the following commands

wget -r -H -nc -np -nH --cut-dirs=1 -A .pdf,.epub -e robots=off -l1 -i ./itemlist.txt -B 'http://archive.org/download/

此后模拟器被卡住,没有日志消息,甚至没有错误消息表明任何实际的进展,我想知道我做错了什么

The emulator gets stuck afterwards and no log message or even an error message appears indicating any practical progress, I want to know what wrong have I done so far.

推荐答案

一段时间后,我想出了解决方法,在Internet存档帮助博客中发布的命令是为了帮助使用wget实用程序而发布的一般命令,我们在这里需要的命令只是以下命令

After Some time I figured out how to resolve this matter, the commands posted in the internet archive help blog are general commands posted to help use the wget utility , the commands we will need right here are simply those which follow

--cutdirs=1



-A .pdf,.epub



-e robots=off



-i ./itemlist.txt

,当然还有网址来源:

B- 'archive.org/download/'

这篇关于如何从Internet存档批量下载文件的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆