HBase和Hadoop [英] HBase and Hadoop

查看:95
本文介绍了HBase和Hadoop的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

HBase需要基于我迄今阅读的内容安装Hadoop。看起来HBase可以设置为使用现有的Hadoop集群(与其他一些用户共享),或者可以将其设置为使用专用的Hadoop集群?我想后者会是一个更安全的配置,但我想知道是否有人对前者有任何经验(但是我不确定我对HBase设置的理解是否正确)。

解决方案

我知道Facebook和其他大型组织将他们的HBase集群(实时访问)与他们的Hadoop集群(批量分析)出于性能原因。群集上的大型MapReduce作业有能力影响实时界面的性能,这可能会产生问题。



在较小的组织中或在某种情况下HBase响应时间不一定需要保持一致,您可以使用相同的群集。



除了性能以外,并不存在很多(或任何)共存问题担心。


HBase requires Hadoop installation based on what I read so far. And it looks like HBase can be set up to use existing Hadoop cluster (which is shared with some other users) or it can be set up to use dedicated Hadoop cluster? I guess the latter would be a safer configuration but I am wondering if anybody has any experience on the former (but then I am not very sure my understanding of HBase setup is correct or not).

解决方案

I know that Facebook and other large organizations separate their HBase cluster (real time access) from their Hadoop cluster (batch analytics) for performance reasons. Large MapReduce jobs on the cluster have the ability to impact performance of the real-time interface, which can be problematic.

In a smaller organization or in a situation in which your HBase response time doesn't necessarily need to be consistent, you can just use the same cluster.

There aren't many (or any) concerns with coexistence other than performance concerns.

这篇关于HBase和Hadoop的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆