为什么工作节点看不到另一个工作节点上累加器的更新? [英] Why does worker node not see updates to accumulator on another worker nodes?

查看：85 发布时间：2020/9/4 2:33:18 java apache-spark

本文介绍了为什么工作节点看不到另一个工作节点上累加器的更新?的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我在地图操作中使用LongAccumulator作为共享计数器.但是似乎我没有正确使用它，因为工作节点上计数器的状态没有更新.这是我的计数器类的样子:

I'm using a LongAccumulator as a shared counter in map operations. But it seems that I'm not using it correctly because the state of the counter on the worker nodes is not updated. Here's what my counter class looks like:

public class Counter implements Serializable {

   private LongAccumulator counter;

   public Long increment() {
      log.info("Incrementing counter with id: " + counter.id() + " on thread: " + Thread.currentThread().getName());
      counter.add(1);
      Long value = counter.value();
      log.info("Counter's value with id: " + counter.id() + " is: " + value + " on thread: " + Thread.currentThread().getName());
      return value;
   }

   public Counter(JavaSparkContext javaSparkContext) {
      counter = javaSparkContext.sc().longAccumulator();
   }
}

据我了解的文档，当应用程序在多个工作程序节点中运行时，这应该可以正常工作:

As far as I can understand the documentation this should work fine when the application is run within multiple worker nodes:

累加器是仅通过关联和交换操作累加"的变量，因此可以有效地并行支持.它们可用于实现计数器(如在MapReduce中)或总和. Spark本机支持数字类型的累加器，程序员可以添加对新类型的支持.

Accumulators are variables that are only "added" to through an associative and commutative operation and can therefore be efficiently supported in parallel. They can be used to implement counters (as in MapReduce) or sums. Spark natively supports accumulators of numeric types, and programmers can add support for new types.

但是这是当计数器在2个不同的工作线程上递增并且看起来状态在节点之间不共享时的结果:

But here is the result when the counter is incremented on 2 different workers and as it looks like the state is not shared between the nodes:

INFO计数器:在线程上执行ID为866的递增计数器:执行程序任务启动worker-6 INFO计数器:ID为866的计数器的值为:1在线程上:执行程序任务启动worker-6
INFO计数器:在线程上执行ID为866的递增计数器:执行程序任务启动worker-0 INFO计数器:ID为866的计数器的值为:线程1:执行程序任务启动worker-0

INFO Counter: Incrementing counter with id: 866 on thread: Executor task launch worker-6 INFO Counter: Counter's value with id: 866 is: 1 on thread: Executor task launch worker-6
INFO Counter: Incrementing counter with id: 866 on thread: Executor task launch worker-0 INFO Counter: Counter's value with id: 866 is: 1 on thread: Executor task launch worker-0

我是否理解累加器概念错误，或者必须使用任何设置来启动任务?

Do I understand the accumulators conception wrong or is there any setting that I must start the task with?

为什么工作节点看不到另一个工作节点上累加器的更新? [英] Why does worker node not see updates to accumulator on another worker nodes?

问题描述

推荐答案

相关文章

Java开发最新文章

热门教程

热门工具

登录关闭

为什么工作节点看不到另一个工作节点上累加器的更新? [英] Why does worker node not see updates to accumulator on another worker nodes?

问题描述

推荐答案

相关文章

Java开发最新文章

热门教程

热门工具

登录 关闭

登录关闭