将AWS Dynamodb备份到S3 [英] Backup AWS Dynamodb to S3
问题描述
在 Amazon文档上已建议使用 http://aws.amazon.com/dynamodb/ 等,您可以使用Elastic Map Reduce来备份dynamodb表,
我对此有一个大致的了解,但对此却找不到任何指南或教程,
It has been suggested on Amazon docs http://aws.amazon.com/dynamodb/ among other places, that you can backup your dynamodb tables using Elastic Map Reduce,
I have a general understanding of how this could work but I couldn't find any guides or tutorials on this,
所以我的问题是如何使dynamodb备份自动化(使用EMR)?
So my question is how can I automate dynamodb backups (using EMR)?
到目前为止,我想我需要使用一个映射函数创建一个流作业,该函数从dynamodb中读取数据,并将reducemodb的数据写入到S3中,我相信这些可以用Python(或Java或其他几种语言)编写。
So far, I think I need to create a "streaming" job with a map function that reads the data from dynamodb and a reduce that writes it to S3 and I believe these could be written in Python (or java or a few other languages).
任何注释,澄清,代码示例,更正均受到赞赏。
Any comments, clarifications, code samples, corrections are appreciated.
推荐答案
随着AWS Data Pipeline的引入以及用于dynamodb到S3备份的现成模板,最简单的方法是在Data Pipeline中安排备份 [链接] ,
With introduction of AWS Data Pipeline, with a ready made template for dynamodb to S3 backup, the easiest way is to schedule a back up in the Data Pipeline [link],
需求(数据转换,非常精细的谷物控制...)考虑@greg
In case you have special needs (data transformation, very fine grain control ...) consider the answer by @greg
这篇关于将AWS Dynamodb备份到S3的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!