使用 Spark 引擎执行 SQL 时，如何在 hive UDF 中获取 Spark 的 partitionId 或 taskContext? [英] How to get the partitionId or taskContext of Spark in hive UDF when SQL executed with Spark engine?

查看：128 发布时间：2021/6/25 18:37:02 apache-spark hive apache-spark-sql user-defined-functions

本文介绍了使用 Spark 引擎执行 SQL 时，如何在 hive UDF 中获取 Spark 的 partitionId 或 taskContext?的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

比如我们用Spark引擎执行下面的SQL，我们需要my_udf(row)返回 Spark 中的分区 ID.

For example, we execute the SQL below with Spark engine, we need my_udf(row) return the partition id in Spark.

add jar hdfs:///dir/udf/udf.jar; 
create temporary function my_udf as 'com.my.MyUDF';

select row, my_udf(row) from table;

我已经知道如何在 MR 引擎中执行的 Hive UDF 中获取 taskId:如何在 hive UDF 中获取 taskID 或 mapperID(类似于 Spark 中的 partitionID)?，但在 Spark 引擎中执行时无效.请告诉我如何在Hive UDF中获取Spark的partitionID或taskContext，非常感谢！

I have known how to get taskId in Hive UDF executed in MR engine: How to get the taskID or mapperID(something like partitionID in Spark) in a hive UDF? , but it does not effective when executed in Spark engine. Please tell me how to get partitionID or taskContext of Spark in Hive UDF, thanks very much !

使用 Spark 引擎执行 SQL 时，如何在 hive UDF 中获取 Spark 的 partitionId 或 taskContext? [英] How to get the partitionId or taskContext of Spark in hive UDF when SQL executed with Spark engine?

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录关闭

使用 Spark 引擎执行 SQL 时，如何在 hive UDF 中获取 Spark 的 partitionId 或 taskContext? [英] How to get the partitionId or taskContext of Spark in hive UDF when SQL executed with Spark engine?

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录 关闭

登录关闭