Hadoop备份和恢复工具和指导 [英] Hadoop backup and recovery tool and guidance

查看:198
本文介绍了Hadoop备份和恢复工具和指导的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

我是hadoop需要了解备份和恢复的详细信息的新手。我已经修改了oracle备份和恢复,它可以帮助hadoop吗?我应该从哪里开始

和恢复。正如s.singh指出的那样,数据复制不是DR。

HDFS支持快照。这可以用来防止用户错误,恢复文件等。就是说,这不是Hadoop集群发生故障时的DR。 ( http://hadoop.apache.org/ docs / current / hadoop-project-dist / hadoop-hdfs / HdfsSnapshots.html



你最好的选择是保持非现场备份。这可以是另一个Hadoop集群,S3等,并可以使用distcp执行。 ( http://hadoop.apache.org/docs/stable1/distcp2.html),( https://wiki.apache.org/hadoop/AmazonS3



这是Cloudera在讨论DR时的Slideshare( http://www.slideshare.net/cloudera/hadoop-backup-and-disaster-recovery


I am new to hadoop need to learn details about backup and recovery. I have revised oracle backup and recovery will it help in hadoop?From where should I start

解决方案

There are a few options for backup and recovery. As s.singh points out, data replication is not DR.

HDFS supports snapshotting. This can be used to prevent user errors, recover files, etc. That being said, this isn't DR in the event of a total failure of the Hadoop cluster. (http://hadoop.apache.org/docs/current/hadoop-project-dist/hadoop-hdfs/HdfsSnapshots.html)

Your best bet is keeping off-site backups. This can be to another Hadoop cluster, S3, etc and can be performed using distcp. (http://hadoop.apache.org/docs/stable1/distcp2.html), (https://wiki.apache.org/hadoop/AmazonS3)

Here is a Slideshare by Cloudera discussing DR (http://www.slideshare.net/cloudera/hadoop-backup-and-disaster-recovery)

这篇关于Hadoop备份和恢复工具和指导的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆