为什么 Flink SQL 对所有表都使用 100 行的基数估计? [英] Why does Flink SQL use a cardinality estimate of 100 rows for all tables?

查看：22 发布时间：2021/11/12 0:59:48 java scala apache-flink apache-calcite flink-sql

本文介绍了为什么 Flink SQL 对所有表都使用 100 行的基数估计?的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我更深入地查看了 Flink 基本代码，并在 calcite 评估/估计对象中查询的行数时检查了这一点.出于某种原因，对于任何表源，它总是返回100.

I looked more deeply in the Flink base code and I checked that when calcite evaluate/estimate the number of rows for the query in object. For some reason it returns always 100 for any table source.

实际上在 Flink 中，在程序计划创建的过程中，对于每个转换后的规则，它被称为 VolcanoPlanner 类由 TableEnvironment.runVolcanoPlanner.规划器尝试通过调用 RelMetadataQuery.getRowCount

In Flink in fact, during the process of the program plan creation, for each transformed rule it is called the VolcanoPlanner class by the TableEnvironment.runVolcanoPlanner. The planner try to optimise and calculate some estimation by calling RelMetadataQuery.getRowCount

我通过创建一个失败的test 应该断言 0 作为关系表 'S' 的行数，但它总是返回 100.

I reproduced the error by creating a failing test which should assert 0 as row count for relation table 'S' but it returns always 100.

为什么会这样?有人对这个问题有答案吗?

Why this is happening? Does anyone has an answer to this issue?

为什么 Flink SQL 对所有表都使用 100 行的基数估计? [英] Why does Flink SQL use a cardinality estimate of 100 rows for all tables?

问题描述

推荐答案

相关文章

Java开发最新文章

热门教程

热门工具

登录关闭

为什么 Flink SQL 对所有表都使用 100 行的基数估计? [英] Why does Flink SQL use a cardinality estimate of 100 rows for all tables?

问题描述

推荐答案

相关文章

Java开发最新文章

热门教程

热门工具

登录 关闭

登录关闭