如何从Amazon AWS下载数据集 [英] How to download dataset from amazon AWS

查看:497
本文介绍了如何从Amazon AWS下载数据集的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

我试图获取大约1GB的大型数据集,而我主要发现大型数据集位于Amazon Web Service之类的 https://aws.amazon.com/datasets/google-books-ngrams/ 并指出数据可用

I am trying to get the large dataset of around 1GB and I mostly found that the large dataset are in amazon web service like https://aws.amazon.com/datasets/google-books-ngrams/ and states that data is available

s3://datasets.elasticmapreduce/ngrams/books/

s3://datasets.elasticmapreduce/ngrams/books/

现在,我正在尝试下载此数据.为此,我在AWS中创建了虚拟机.如何在AWS中下载这2 GB数据,以便以后可以通过scp传输到本地计算机?

Now I am trying to download this data. so for this purpose I created my virtual machine in AWS. how can I download this 2 gb data in my AWS so that later I could transfer to my local machine through scp??

推荐答案

正如Mark B所说,您不需要在AWS中启动服务器来下载这些数据集.它们是公共托管在S3中的,因此您可以使用AWS S3 CLI的cp工具通过运行以下命令将它们下载到本地计算机:

As Mark B commented, you don't need to spin up a server in AWS to download these data sets. They are publicly hosted in S3, so you can use the AWS S3 CLI's cp tool to download them to your local machine, by running:

aws s3 cp --recursive s3://datasets.elasticmapreduce/ngrams/books/ local

相关文档可以在此处找到.

Relevant documentation can be found here.

这篇关于如何从Amazon AWS下载数据集的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆