Apache Spark:纱线日志分析 [英] Apache Spark: Yarn logs Analysis

查看：95 发布时间：2020/5/4 3:35:42 apache-spark hdfs logstash yarn spark-streaming

本文介绍了Apache Spark:纱线日志分析的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我有一个流媒体应用程序，我想使用Elasticsearch-Kibana分析作业日志.我的工作是在纱线簇上运行的，因此将yarn.log-aggregation-enable设置为true时，日志已写入HDFS.但是，当我尝试执行此操作时:

I am having a spark-streaming application, and I want to analyse the logs of the job using Elasticsearch-Kibana. My job is run on yarn cluster, so the logs are getting written to HDFS as I have set yarn.log-aggregation-enable to true. But, when I try to do this :

hadoop fs -cat ${yarn.nodemanager.remote-app-log-dir}/${user.name}/logs/<application ID>

我看到一些加密/压缩的数据.这是什么文件格式?如何从该文件读取日志?我可以使用logstash来阅读此内容吗?

I am seeing some encrypted/compressed data. What file format is this? How can I read the logs from this file? Can I use logstash to read this?

此外，如果有更好的方法来分析Spark日志，我欢迎您提出建议.

Also, if there is a better approach to analyse Spark logs, I am open to your suggestions.

谢谢.

Apache Spark:纱线日志分析 [英] Apache Spark: Yarn logs Analysis

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录关闭

Apache Spark:纱线日志分析 [英] Apache Spark: Yarn logs Analysis

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录 关闭

登录关闭