AWS:饲养成本下降的方式,而备份S3文件到冰川? [英] AWS: Ways of keeping cost down while backing up S3 files to Glacier?

查看:217
本文介绍了AWS:饲养成本下降的方式,而备份S3文件到冰川?的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

由于我们的项目的一部分,我们已经创造了相当的S3浓密的文件夹/文件树中的所有文件占用了有关数据的6TB。目前,我们有这样的数据,是坏的没有备份。我们想要做的定期备份。好像冰川是要走的路。

As part of our project, we have created quite a bushy folder/file tree on S3 with all the files taking up about 6TB of data. We currently have no backup of this data which is bad. We want to do periodic back ups. Seems like Glacier is the way to go.

现在的问题是:什么样的方法来保持的总成本备份下来

The question is: what are the ways to keep the total cost of a back up down?

我们的大多数文件是文本,所以我们可以COM presses他们并上传整个ZIP压缩文件。这将需要处理(在EC2上),所以我很好奇是否有拇指任何规则,为压缩和解与刚刚上传uncom pressed文件进行比较运行的EC2实例的额外费用。

Most of our files are text so we can compresses them and upload whole ZIP archives. This will require processing (on EC2) so I am curious whether there is any rule of thumb to compare extra cost of running an EC2 instance for zipping versus just uploading uncompressed files.

此外,我们将不得不支付的数据传输,所以我想知道是否有任何方式备份不是(我)从S3下载文件到一个实例除外; (二)在其原始形式上传文件或压缩了冰川。

Also, we would have to pay for data transfer so I am wondering if there is any way of backing up other than (i) download file from S3 to an instance; (ii) upload file in its raw form or zipped up to Glacier.

推荐答案

我普遍认为冰川作为的替代的存储S3,不是的其他的存储。也就是说,数据将经常被储存在S3或冰川,但很少两者。

I generally think of Glacier as an alternative storage to S3, not an additional storage. I.e., data would most often be stored either in S3 or Glacier, but rarely both.

如果您信任S3的广告11个9的持久性,那么你就不会备份,因为S3本身就容易丢失数据。

If you trust S3's advertised eleven nines of durability, then you're not backing up because S3 itself is likely to lose the data.

您可能需要备份的数据,因为(像我这样做),你看你的亚马逊帐户作为单点故障(例如,证书遭到破坏或亚马逊阻止您的帐户,因为他们认为你正在做的事情滥用)。然而,在这种情况下,冰川是不是足够的备份,因为它仍然在亚马逊伞下降。

You might want to back up the data because (like I do) you see your Amazon account as a single point of failure (e.g., credentials are compromised or Amazon blocks your account because they believe you are doing something abusive). However, in that case, Glacier is not a sufficient backup as it still falls under the Amazon umbrella.

我建议备份S3数据亚马逊之外,如果你是担心失去在S3中的数据,由于用户的错误,损害凭据,等等。

I recommend backing up S3 data outside of Amazon if you are concerned about losing the data in S3 due to user error, compromised credentials, and the like.

我建议使用冰川作为一个地方存档长期,当你知道你不会需要太多访问它,即使有便宜的存储数据。当事情被转换到冰川,你会再从S3删除它们。

I recommend using Glacier as a place to archive data for long term, cheap storage when you know you're not going to need to access it much, if ever. When things are transitioned to Glacier, you would then delete them from S3.

亚马逊提供了从S3到冰川伟大的工程自动存档,但额外的费用要小心,如果你的文件的平均大小小。这里有一个文章,我写了这种危险:

Amazon provides automatic archival from S3 to Glacier which works great, but beware of the extra costs if the average size of your files is small. Here's an article I wrote on that danger:

过渡S3的成本对象冰川
  <一href="http://alestic.com/2012/12/s3-glacier-costs">http://alestic.com/2012/12/s3-glacier-costs

Cost of Transitioning S3 Objects to Glacier
http://alestic.com/2012/12/s3-glacier-costs

如果您仍然要复制从S3到冰川,这里是与你的问题的一些要点:

If you still want to copy from S3 to Glacier, here are some points related to your questions:

  • 您presumably保持数据的冰川很长一段时间,所以COM pressing它可能是值得的短期CPU使用率。确切的权衡取决于像数据的COM pressibility因素,需要多长时间,以COM preSS,你需要多长时间来执行COM pression。

  • You will presumably leave the data in Glacier a long time, so compressing it is probably worth the short term CPU usage. The exact trade off depends on factors like the compressibility of your data, how long it takes to compress, and how often you need to perform the compression.

不收取任何费用从S3下载数据到一个EC2实例。没有上传数据到冰川的数据传输费用。

There is no charge for downloading data from S3 to an EC2 instance. There is no data transfer charge for uploading data into Glacier.

如果您上传许多小文件到冰川,每件收费上载能积少成多。您可以通过将许多小文件进行存档,并上传它节省成本。

If you upload many small files to Glacier, the upload per item charges can add up. You can save on cost by combining many small files into an archive and uploading it.

另一个S3的功能,可以帮助防止由于用户的错误或攻击的意外丢失是打开S3版本并启用MFA(多因素认证)。这prevents有人从能够永久删除对象,除非它们在你的身上的凭据加上一个物理设备。

Another S3 feature that can help protect against accidental loss through user error or attacks is to turn on S3 versioning and enable MFA (multi-factor authentication). This prevents anybody from being able to permanently delete objects unless they have the credentials plus a physical device in your possession.

这篇关于AWS:饲养成本下降的方式,而备份S3文件到冰川?的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆