Spark Structure Streaming作业在集群模式下失败 [英] Spark Structure Streaming job failing in cluster mode

查看：188 发布时间：2020/9/4 2:43:38 apache-spark hadoop apache-spark-sql

本文介绍了Spark Structure Streaming作业在集群模式下失败的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我在应用程序中使用spark-sql-2.4.1 v.

I am using spark-sql-2.4.1 v in my application.

在将数据写入hdfs文件夹时，我在Spark流应用程序中遇到了这个问题

While writing data on to hdfs folder I am facing this issue in spark-streaming application

错误:

    yarn.Client: Deleted staging directory hdfs://dev/user/xyz/.sparkStaging/application_1575699597805_47
    20/02/24 14:02:15 ERROR yarn.Client: Application diagnostics message: User class threw exception: org.apache.hadoop.security.AccessControlException: Permission denied: user= xyz, access=WRITE, inode="/tmp/hadoop-admin":admin:supergroup:drwxr-xr-x
    .
    .
    .
    Caused by: org.apache.hadoop.ipc.RemoteException(org.apache.hadoop.security.AccessControlException): Permission denied: user=xyz, access=WRITE, inode="/tmp/hadoop-admin":admin:supergroup:drwxr-xr-x
            at org.apache.hadoop.hdfs.server.namenode.FSPermissionChecker.check(FSPermissionChecker.java:350)
            at org.apache.hadoop.hdfs.server.namenode.FSPermissionChecker.checkPermission(FSPermissionChecker.java:251)

在将数据写入HDFS文件夹时，当我在yarn-cluster模式下运行时，我在spark-streaming应用程序中遇到此问题时即

While writing data on to HDFS folder I am facing this issue in spark-streaming application when I run in yarn-cluster mode I face this issue i.e.

    --master yarn \
    --deploy-mode cluster \

但是当我在"yarn-client"模式下运行时，它运行良好即

But when I run in "yarn-client" mode its run fine i.e.

    --master yarn \
    --deploy-mode client \

任何人都遇到此错误，并且知道此问题的根源是什么?

Anyone come across this error and know what is the root-case of this problem?

这里的基本问题，为什么要尝试用"/tmp/hadoop-admin/"而不是相应的用户目录(即hdfs://qa2/user/xyz/ ??

Fundamental question here , why it is trying to write in "/tmp/hadoop-admin/" instead of respective user directory i.e. hdfs://qa2/user/xyz/ ??

我已经解决了这个问题

https://issues.apache.org/jira/browse/SPARK-26825

但是不确定如何在我的spark-sql应用程序中实现它. 任何帮助都将受到高度赞赏.

But not sure how to implement it in my spark-sql application. Any help is highly appreciated.

Spark Structure Streaming作业在集群模式下失败 [英] Spark Structure Streaming job failing in cluster mode

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录关闭

Spark Structure Streaming作业在集群模式下失败 [英] Spark Structure Streaming job failing in cluster mode

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录 关闭

登录关闭