Postgresql使用IN vs NOT IN时的巨大性能差异 [英] Postgresql huge performance difference when using IN vs NOT IN

查看：681 发布时间：2020/5/30 0:13:25 postgresql sql-execution-plan explain

本文介绍了Postgresql使用IN vs NOT IN时的巨大性能差异的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我有2张桌子， transaksi和 buku。 transaksi大约有25万行，而buku大约有17万行。两个表的列均称为 k999a，并且两个表均不使用索引。现在我检查这两个语句。

I have 2 tables, "transaksi" and "buku". "transaksi" has around ~250k rows, and buku has around ~170k rows. Both tables have column called "k999a", and both tables use no indexes. Now I check these 2 statements.

声明1：

explain select k999a from transaksi where k999a not in (select k999a from buku);

陈述1输出：

 Seq Scan on transaksi  (cost=0.00..721109017.46 rows=125426 width=9)
   Filter: (NOT (SubPlan 1))
   SubPlan 1
     ->  Materialize  (cost=0.00..5321.60 rows=171040 width=8)
           ->  Seq Scan on buku  (cost=0.00..3797.40 rows=171040 width=8)

声明2：

explain select k999a from transaksi where k999a in (select k999a from buku);

陈述2输出：

Hash Semi Join  (cost=6604.40..22664.82 rows=250853 width=9)
   Hash Cond: (transaksi.k999a = buku.k999a)
   ->  Seq Scan on transaksi  (cost=0.00..6356.53 rows=250853 width=9)
   ->  Hash  (cost=3797.40..3797.40 rows=171040 width=8)
         ->  Seq Scan on buku  (cost=0.00..3797.40 rows=171040 width=8)

为什么在不在查询中，postgresql会循环联接，使查询花费很长时间吗？

Why in the NOT IN query, postgresql does loop join, making the query takes a long time?

PS：Windows 10上的postgresql版本9.6.1

PS: postgresql version 9.6.1 on windows 10

推荐答案

这是预料之中的。使用 WHERE NOT EXISTS 可能会获得更好的性能：

This is to be expected. You may get better performance using WHERE NOT EXISTS instead:

SELECT k999a
FROM transaksi
WHERE NOT EXISTS (
    SELECT 1 FROM buku WHERE buku.k999a = transaksi.k999a LIMIT 1
);

这里很好地解释了每种方法的原因： https://explainextended.com/2009 / 09/16 / not-in-vs-not-exists-vs-left-join-is-null-postgresql /

Here is a good explanation as to why for each of the methods: https://explainextended.com/2009/09/16/not-in-vs-not-exists-vs-left-join-is-null-postgresql/

这篇关于Postgresql使用IN vs NOT IN时的巨大性能差异的文章就介绍到这了，希望我们推荐的答案对大家有所帮助，也希望大家多多支持IT屋！

查看全文

Postgresql使用IN vs NOT IN时的巨大性能差异 [英] Postgresql huge performance difference when using IN vs NOT IN

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录关闭

Postgresql使用IN vs NOT IN时的巨大性能差异 [英] Postgresql huge performance difference when using IN vs NOT IN

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录 关闭

登录关闭