任务未读取Spark累加器值 [英] Spark Accumulator value not read by task

查看：149 发布时间：2020/9/4 7:26:19 java apache-spark rdd

本文介绍了任务未读取Spark累加器值的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我正在初始化累加器

final Accumulator<Integer> accum = sc.accumulator(0);

然后在使用map函数时，我试图增加累加器，然后在设置变量时使用累加器值.

And then while in map function , I'm trying to increment the accumulator , then using the accumulator value in setting a variable.

JavaRDD<UserSetGet> UserProfileRDD1 = temp.map(new Function<String, UserSetGet>() {

            @Override
            public UserSetGet call(String arg0) throws Exception {

                    UserSetGet usg = new UserSetGet();

                    accum.add(1);
                    usg.setPid(accum.value().toString();


            }
  });

但是我收到以下错误.

16/03/14 09:12:58错误执行程序.执行程序:任务0.0中的异常 stage 2.0(TID 2)java.lang.UnsupportedOperationException:无法读取任务中的累加器值

16/03/14 09:12:58 ERROR executor.Executor: Exception in task 0.0 in stage 2.0 (TID 2) java.lang.UnsupportedOperationException: Can't read accumulator value in task

已编辑-根据Avihoo Mamka的回答，不可能在任务中获得累加器值.

EDITED - As per the answer from Avihoo Mamka, getting accumulator value in tasks is not possible.

所以无论如何我都可以并行实现相同的目标.这样，每次在我的地图函数中增加一个变量(例如静态变量)时，便会设置Pid值.

So is there anyway I can achieve the same in parallel. Such that the Pid value gets set each time a variable(eg like static variable) is incremented in my map function.

任务未读取Spark累加器值 [英] Spark Accumulator value not read by task

问题描述

推荐答案

相关文章

Java开发最新文章

热门教程

热门工具

登录关闭

任务未读取Spark累加器值 [英] Spark Accumulator value not read by task

问题描述

推荐答案

相关文章

Java开发最新文章

热门教程

热门工具

登录 关闭

登录关闭