什么是ElasticSearch-Hadoop(es-hadoop)及其对HBase的实时Web应用程序的好处? [英] What is ElasticSearch-Hadoop (es-hadoop) and its benefit over HBase for a live web application?

查看:740
本文介绍了什么是ElasticSearch-Hadoop(es-hadoop)及其对HBase的实时Web应用程序的好处?的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

我不清楚es-hadoop是从说明

It is not entirely clear to me what es-hadoop is from the description.

这只是一个连接器,将数据从您的ES集群移动到HDFS进行Hadoop分析?如果是这样,为什么不用HBase进行低延迟的文本查询?

Is this merely a "connector" that will move data over from your ES cluster to HDFS for Hadoop analytics? If so, why not just go with HBase for low-latency text queries?

es-Hadoop与常规ES不同的安装?

Is es-Hadoop a different installation than regular ES?

请详细说明。

谢谢。

推荐答案

ES-Hadoop更接近于Hadoop生态系统与ES之间的连接。它不是ES的单独版本。

ES-Hadoop is way closer to be a connector between Hadoop eco-system to ES. It is not a separate release of ES.

基本上它改进了Hadoop生态系统应用程序与ES之间的集成。在我的组织中,我们将此功能用于两个目的:

Basically it improves the integration between Hadoop eco-system application to ES. In my organisation we use this feature for 2 purposes:


  1. 在将数据索引到ES之前,我们使用Spark来分析数据并执行相关的聚合,以减少在ES上执行的索引量。 ES-Hadoop帮助我们直接从Spark数据结构索引到ES。我们正在使用一行代码开始索引过程,不需要自己编写索引程序。 (该功能是可配置的,您可以灵活地索引数据,但您喜欢)。

  1. Before indexing the data to ES, we use Spark to analyse the data and perform relevant aggregation to reduce the amount of indexing that should be performed on ES. ES-Hadoop help us to index directly from Spark data structures to ES. We are starting the indexing process with a single line of code and don't need to write the indexing program ourselves. (The feature is configurable, and you have the flexibility to index the data however you like).

在我们的组织中,我们使用ES作为我们的近实时分析簇。 ES中的数据将以一种能够为我们的客户产生最佳性能的方式。有时(通常当我们有一些新功能的想法)时,我们必须从ES获取数据,并对数据执行一些复杂的处理。在这些情况下,我们可以在一行代码中从ES数据创建Spark数据结构。

In our organisation we use ES as our near real time analytics cluster. The data in ES is placed in a way that will produce the best performance for our clients. Sometimes (usually when we have ideas about some new features) we have to get the data from ES and perform some complex processing on the data. In those cases we can create Spark data structure from ES data in a single line of code as well.

ES-Hadoop更接近于一个写得很好的连接器。您仍然必须将数据从您的ES集群传输到Hadoop。

So, ES-Hadoop is closer to be a well written connector. You still have to transport the data from your ES cluster to Hadoop.

我不确定与HBase的比较,您不能真正比较HBase的功能这是与ES相比的一个关键价值存储,它是通用搜索引擎+在最后版本中实现了非常好的分析功能。正如我所看到的,我们正在处理访问不同问题的不同工具。

I'm not sure about the comparison to HBase, You can't really compare the features of HBase which is a key value store in compare to ES which is a general purpose search engine + implemented very nice analytics capabilities in the last versions. As I see it, we are dealing with different tools that access different set of problems.

这篇关于什么是ElasticSearch-Hadoop(es-hadoop)及其对HBase的实时Web应用程序的好处?的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆