Impala GROUP BY分区列 [英] Impala GROUP BY partitioned column

查看：330 发布时间：2020/11/27 4:56:13 hadoop2 impala

本文介绍了Impala GROUP BY分区列的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

理论问题

让我们说我的表有四列:A，B，C，D. A和D的值相等，表按A列划分.

Lets say I have table with four columns : A,B,C,D. Values of A and D are equal, table is partitioned by column A.

明智的性能，如果我发出此查询，会有所不同吗? 按A选择SUM(B)GROUP; 或这一个: SELECT SUM(B)GROUP BY D;

Performance wise, would it make any difference if I issue this query SELECT SUM(B) GROUP BY A ; or this one : SELECT SUM(B) GROUP BY D ;

我要问的是，通过在分区列上使用GROUP BY可以提高性能吗?

In different words I'm asking, is there any performance gain by using the GROUP BY on partitioned column ?

谢谢