如何删除 SQL Server 中的重复行? [英] How to delete duplicate rows in SQL Server?
问题描述
如何删除不存在唯一行ID
的重复行?
How can I delete duplicate rows where no unique row id
exists?
我的桌子是
col1 col2 col3 col4 col5 col6 col7
john 1 1 1 1 1 1
john 1 1 1 1 1 1
sally 2 2 2 2 2 2
sally 2 2 2 2 2 2
我想在删除重复项后留下以下内容:
I want to be left with the following after the duplicate removal:
john 1 1 1 1 1 1
sally 2 2 2 2 2 2
我尝试了一些查询,但我认为它们取决于行 ID,因为我没有得到想要的结果.例如:
I've tried a few queries but I think they depend on having a row id as I don't get the desired result. For example:
DELETE
FROM table
WHERE col1 IN (
SELECT id
FROM table
GROUP BY id
HAVING (COUNT(col1) > 1)
)
推荐答案
我喜欢 CTE 和 ROW_NUMBER
因为两者结合起来让我们可以看到哪些行被删除(或更新),因此只需更改DELETE FROM CTE...
到 SELECT * FROM CTE
:
I like CTEs and ROW_NUMBER
as the two combined allow us to see which rows are deleted (or updated), therefore just change the DELETE FROM CTE...
to SELECT * FROM CTE
:
WITH CTE AS(
SELECT [col1], [col2], [col3], [col4], [col5], [col6], [col7],
RN = ROW_NUMBER()OVER(PARTITION BY col1 ORDER BY col1)
FROM dbo.Table1
)
DELETE FROM CTE WHERE RN > 1
DEMO(结果不同;我假设这是由于您的拼写错误)
DEMO (result is different; I assume that it's due to a typo on your part)
COL1 COL2 COL3 COL4 COL5 COL6 COL7
john 1 1 1 1 1 1
sally 2 2 2 2 2 2
由于 PARTITION BY col1
,此示例通过单列 col1
确定重复项.如果您想包含多个列,只需将它们添加到 PARTITION BY
:
This example determines duplicates by a single column col1
because of the PARTITION BY col1
. If you want to include multiple columns simply add them to the PARTITION BY
:
ROW_NUMBER()OVER(PARTITION BY Col1, Col2, ... ORDER BY OrderColumn)
这篇关于如何删除 SQL Server 中的重复行?的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!