Kafka如何同时实现分布式处理和高可用? [英] How to achieve distributed processing and high availability simultaneously in Kafka?

查看：25 发布时间：2021/11/12 2:51:04 message-queue scalability apache-kafka high-availability kafka-consumer-api

本文介绍了Kafka如何同时实现分布式处理和高可用?的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我有一个由 n 个分区组成的主题.为了进行分布式处理，我创建了两个在不同机器上运行的进程.他们订阅具有相同分组 id 的主题并分配 n/2 个线程，每个线程处理单个流(每个进程 n/2 个分区).

I have a topic consisting of n partitions. To have distributed processing I create two processes running on different machines. They subscribe to the topic with same groupd id and allocate n/2 threads, each of which processes single stream(n/2 partitions per process).

有了这个，我将实现负载分配，但现在如果进程 1 崩溃，那么进程 2 无法使用来自分配给进程 1 的分区的消息，因为它在开始时只侦听 n/2 个流.

With this I will have achieved load distribution, but now if process 1 crashes, than process 2 cannot consume messages from partitions allocated to process 1, as it listened only on n/2 streams at the start.

否则，如果我为 HA 配置并在两个进程上启动 n 个线程/流，那么当一个节点出现故障时，所有分区都将由其他节点处理.但在这里，我们妥协了分布，因为所有分区将一次由一个节点处理.

Or else, if I configure for HA and start n threads/streams on both processes, then when one node fails, all partitions will be processed by other node. But here, we have compromised distribution, as all partitions will be processed by a single node at a time.

有没有办法同时实现?如何实现?

Is there a way to achieve both simultaneously and how?

Kafka如何同时实现分布式处理和高可用? [英] How to achieve distributed processing and high availability simultaneously in Kafka?

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录关闭

Kafka如何同时实现分布式处理和高可用? [英] How to achieve distributed processing and high availability simultaneously in Kafka?

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录 关闭

登录关闭