配置以加速分布式模式下的拓扑 [英] Configuration to speed up the topology in distributed mode

查看：69 发布时间：2019/6/11 15:26:16 Apache hadoop

本文介绍了配置以加速分布式模式下的拓扑的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

I have a topology running with parallelism as (1,8,1)(spout,logic bolt, write bolt) with number of ackers set as 12( 12 are available slots in my cluster). The max spout pending is 200 and timeout.secs is 200. I have to process 14 lac inputs.

My cluster consist of 1 nimbus & 3 supervisors ( dual core , 4 GB each) SO currently it takes 44 hours to give the output for 14 lac inputs.

My application is running separately as an application on another server. I did this to separate the application from the storm jar because I suspected that application was reserving some memory. I ran 50 instances of the application so it can process 50 tuples at a time But even this didnt help.

with this configuration, the topology processes around 6000/6500 tuples in 10 mins Messages are not failing anywhere but overall latency 16000 ms. If i try increasing the paralelism the rate of topology reduces(4000/ 3000 in 10 mins).

The behaviour is not constanst so kindly help me to do the math.

我的尝试：

我已经在我的问题中提到了一切

What I have tried:

I have mentioned everything in my question

配置以加速分布式模式下的拓扑 [英] Configuration to speed up the topology in distributed mode

问题描述

推荐答案

相关文章

其他开发语言最新文章

热门教程

热门工具

登录关闭

配置以加速分布式模式下的拓扑 [英] Configuration to speed up the topology in distributed mode

问题描述

推荐答案

相关文章

其他开发语言最新文章

热门教程

热门工具

登录 关闭

登录关闭