亚马逊EC2与亚马逊电子病历 [英] Amazon EC2 vs. Amazon EMR

查看:135
本文介绍了亚马逊EC2与亚马逊电子病历的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

我已经实现了在蜂巢的任务。目前,它正在罚款我的单节点集群。 现在,我计划在AWS上部署它。

I have implemented a task in Hive. Currently it is working fine on my single node cluster. Now I am planning to deploy it on AWS.

我不知道关于AWS什么。如果我打算部署它那么我应该选择在Amazon EC2或Amazon EMR。

I don't know anything about the AWS. If I plan to to deploy it then what should I choose Amazon EC2 or Amazon EMR.

我想提高我的工作表现。哪一个是更好的,可靠的我吗?如何处理对他们。听说我们还可以注册我们的虚拟机的设置,因为它是在AWS上。这可能吗?

I want to improve the performance of my task. Which one is better and reliable for me? How to approach towards them. I heard that we can also register our VM setting as it is on AWS. Is it possible?

PLS建议我尽快。

非常感谢。

推荐答案

EMR是EC2实例中使用Hadoop(以及可选配置单元和/或猪)的集合安装并对其进行配置。如果您正在使用群集运行Hadoop的/蜂房/猪的工作,EMR是要走的路。相比于一个EC2实例的电子病历实例的成本一点点额外的费用。目前在亚马逊的价格快速检查表明,小EC2实例费用$为0.08 /小时,而一个小的电子病历实例成本为$ 0.015 /小时的额外费用。 在我看来,这是完全值得付出额外的钱去拯救自己安装和设置的Hadoop(以及蜂巢和猪),创建并维护和AMI和使用它的麻烦。此外,电子病历的Hadoop版本和蜂巢有一些补丁不可用(ATLEAST,目前还没有)在Apache蜂巢。如果你使用EC2,你可能会使用Apache Hadoop和配置单元(或者可能是,在Cloudera的分布),并且不会有机会获得这些补丁(如像的S3或命令的本机支持ALTER TABLE my_table的修复分区

EMR is a collection of EC2 instances with Hadoop (and optionally Hive and/or Pig) installed and configured on them. If you are using your cluster for running Hadoop/Hive/Pig jobs, EMR is the way to go. An EMR instance costs a little bit extra as compared to an EC2 instance. A quick check on Amazon prices today reveals that a small EC2 instances costs $0.08/hour while a small EMR instance costs $0.015/hour extra. In my opinion, it's totally worth paying that extra money to save yourself the hassle of installing and setting up Hadoop (along with Hive and Pig), creating and maintaining and AMI and using it. Moreover, EMR's version of Hadoop and Hive has some patches that are not available (atleast, not yet) on Apache Hive. If you use EC2, you will probably be using Apache Hadoop and Hive (or may be, the cloudera distributions) and wouldn't have access to those patches (like native support for S3 or commands like ALTER TABLE my_table RECOVER PARTITIONS

参考文献:

  • http://aws.amazon.com/ec2/pricing/
  • http://aws.amazon.com/elasticmapreduce/pricing/

这篇关于亚马逊EC2与亚马逊电子病历的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆