spark：聚合器和UDAF有什么区别？ [英] spark: What is the difference between Aggregator and UDAF？

查看：285 发布时间：2020/6/2 20:34:53 apache-spark apache-spark-sql spark-dataframe aggregate

本文介绍了spark：聚合器和UDAF有什么区别？的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

在Spark的文档中，聚合器：

In Spark's documentation, Aggregator:

抽象类聚合器[-IN，BUF，OUT]扩展了可序列化

abstract class Aggregator[-IN, BUF, OUT] extends Serializable

用户定义的聚合的基类，可以是
在数据集操作中用于获取组中的所有元素，而
则将它们减少为单个值。

A base class for user-defined aggregations, which can be used in Dataset operations to take all of the elements of a group and reduce them to a single value.

UserDefinedAggregateFunction是：

UserDefinedAggregateFunction is:

抽象类UserDefinedAggregateFunction扩展了Serializable

abstract class UserDefinedAggregateFunction extends Serializable

用于实现用户定义的聚合函数
（UDAF）的基类。

The base class for implementing user-defined aggregate functions (UDAF).

根据数据集聚合器-Databricks ，聚合器类似于UDAF，但是接口以JVM对象而不是行的形式表示。

According to Dataset Aggregator - Databricks, "an Aggregator is similar to a UDAF, but the interface is expressed in terms of JVM objects instead of as a Row ."

它似乎这两个类非常相似，除了界面类型以外还有其他区别吗？

It seems these two classes are very similar, what are other differences apart from the types in the interface?

类似的问题是： Spark中UDAF与聚合器的性能

spark：聚合器和UDAF有什么区别？ [英] spark: What is the difference between Aggregator and UDAF？

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录关闭

spark：聚合器和UDAF有什么区别？ [英] spark: What is the difference between Aggregator and UDAF？

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录 关闭

登录关闭