如何在Kafka中同时实现分布式处理和高可用性? [英] How to achieve distributed processing and high availability simultaneously in Kafka?

查看：167 发布时间：2020/4/25 8:32:59 message-queue scalability apache-kafka high-availability kafka-consumer-api

本文介绍了如何在Kafka中同时实现分布式处理和高可用性?的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我有一个包含n个分区的主题.为了进行分布式处理，我创建了两个在不同计算机上运行的进程.他们使用相同的分组ID订阅该主题，并分配n/2个线程，每个线程处理一个流(每个进程n/2个分区).

I have a topic consisting of n partitions. To have distributed processing I create two processes running on different machines. They subscribe to the topic with same groupd id and allocate n/2 threads, each of which processes single stream(n/2 partitions per process).

这样，我将实现负载分配，但是现在，如果进程1崩溃，则进程2将无法使用分配给进程1的分区中的消息，因为它在开始时仅侦听n/2个流.

With this I will have achieved load distribution, but now if process 1 crashes, than process 2 cannot consume messages from partitions allocated to process 1, as it listened only on n/2 streams at the start.

否则，如果我为HA配置并在两个进程上启动n个线程/流，则当一个节点发生故障时，所有分区将由另一节点处理.但是在这里，我们已经破坏了分配，因为所有分区一次将由一个节点处理.

Or else, if I configure for HA and start n threads/streams on both processes, then when one node fails, all partitions will be processed by other node. But here, we have compromised distribution, as all partitions will be processed by a single node at a time.

有没有办法同时实现这两个目标?

Is there a way to achieve both simultaneously and how?

如何在Kafka中同时实现分布式处理和高可用性? [英] How to achieve distributed processing and high availability simultaneously in Kafka?

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录关闭

如何在Kafka中同时实现分布式处理和高可用性? [英] How to achieve distributed processing and high availability simultaneously in Kafka?

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录 关闭

登录关闭