SQL SERVER – Delete Duplicate Records – Rows

March 1, 2007

Following code is useful to delete duplicate records. The table must have identity column, which will be used to identify the duplicate records. Table in example is has ID as Identity Column and Columns which have duplicate data are DuplicateColumn1, DuplicateColumn2 and DuplicateColumn3.

DELETE FROM MyTable WHERE ID NOT IN ( SELECT MAX(ID) FROM MyTable GROUP BY DuplicateColumn1, DuplicateColumn2, DuplicateColumn3)

Watch the view to see the above concept in action:

[youtube=http://www.youtube.com/watch?v=ioDJ0xVOHDY]

Reference : Pinal Dave (https://blog.sqlauthority.com)

Duplicate Records, SQL Scripts

SQL SERVER – T-SQL Script to find the CD key from Registry

SQL SERVER – QUOTED_IDENTIFIER ON/OFF and ANSI_NULL ON/OFF Explanation

SQL SERVER – Using MaxTransferSize parameter with SQL Server Backups

March 30, 2015

SQL SERVER – Difference Between EXEC and EXECUTE vs EXEC() – Use EXEC/EXECUTE for SP always

September 13, 2007

SQL SERVER – Encrypted Stored Procedure and Activity Monitor

October 10, 2010

450 Comments. Leave new

preveen
November 20, 2007 8:43 pm
hi
how to find 3rd maximum salary, like that how delete duplicate values in a table
Thanks
Praveen
Reply
- Madhivanan
  May 17, 2010 1:51 pm
  I have already replied to many people
  Refer this post and choose the effecient method
  Reply
suresh
November 29, 2007 2:52 pm
sir,
i am very much imperssed with u r ans. but if no of columes will be more than 100 then what is the procedure
please repaly to this one to my mail id if possible
Reply
Ramesh
December 8, 2007 4:39 pm
Hi,
Among Joins and Subquery, which one is better approach?
Can u pls tell me this.
Thanks
Ramesh
Reply
- Madhivanan
  May 17, 2010 1:52 pm
  It depnds on the specific case you are using
  Usually joins are effecient
  Reply
Kannan
December 13, 2007 5:01 pm
very helpful for me to delete duplicate records
Reply
Jim
December 18, 2007 10:42 pm
Hi,
I need to delete all the duplicate records with MAPID being the duplicated FIELD where ADDRESSCOUNT = 0 being the other defining criteria. My table looks like;
ID MAPID ADDRESSCOUNT
111 54560 4
132 54560 0
198 23429 1
240 29584 1
248 29584 0
Any help appreciated.
Using MS SQL Server 2000.
Regards,
Jim
Reply
- Madhivanan
  May 17, 2010 1:52 pm
  Post the expected result so that it is easy for us to write the query
  Reply
milind more
December 19, 2007 11:29 am
how i find recent row updated i would like to use this row in trigger after update in table
Reply
Arun
December 19, 2007 7:35 pm
hi Mr. Dave,
This question was asked to me in an interview and I was unable to answer.
Now I got the solution.
ThankU Very Much
Reply
Anand
December 21, 2007 2:02 am
Hi jim(54),
this should work for you.
DELETE
FROM duptest
WHERE MAPID IN
(select MAPID
from duptest
group by mapid
having count(mapid) > 1) and AddressCount = 0
cheers,
anand.
Reply
- sandeep
  April 21, 2010 2:09 pm
  hello ur solution is rite but all duplicate record is deleted i want to keep one record from duplicacy row in table
  Reply
  - jymy solunki
    June 30, 2012 3:06 pm
    This logic is delete all record only not be douplicate
jaya
December 25, 2007 3:49 pm
very nice logic…u r great
Reply
ATIN
December 26, 2007 11:57 am
–For finding second highest salary
select max(salary) from emp where salary<max(salary)
Reply
sandy
December 31, 2007 11:54 pm
How to recover deleted records from any perticular table
Reply
ronak
January 1, 2008 3:18 pm
in case of no id column in table one can delete duplicate rows as below
create view abc as select *,row_number() OVER (PARTITION BY dupcol1,dupcol2,… ORDER BY dupcol1,dupcol2,…) as rnum from
table
delete from abc where rnum > 1
drop view abc
Reply
Kanhaiya
January 2, 2008 3:42 pm
Hi All,
I would like to share one suggestion that is :
should’t We conclude each topic with one best answer (if we can).
Reply
Priya
January 4, 2008 12:35 pm
Hi,
Thank you so much for sharing your knowledge.
Great Work.
Reply
satish
January 4, 2008 6:18 pm
hi,
realy this is super i have not found any where very nice.
cheers,
Satish
Reply
Suchita
January 5, 2008 1:43 pm
Hello Sir,
I recently joined your site, and found it really very helpful.
How about using ‘ROWID’ to delete the duplicate rows.
Please check this query.
DELETE
FROM MyTable
WHERE ROWID NOT IN
(SELECT MIN(ROWID)
FROM MyTable
GROUP BY DUPL_COL1,DUPL_COL2,DUPL_COL3)…
–All col names
Now, my question is that if I have more than two duplicate records I want to keep 2 of them and to remove rest.
How can I do it?
Please help out.
Thank you.
Reply
ssatish kumar
January 5, 2008 2:01 pm
Hi
its a great thing to share knowledge
thanks for your help
ssatish kumar
Reply
Angadi Doddappa
January 8, 2008 11:49 am
Hi ATIN(59),
For finding only second highest salary – –
select * from
(select * from employee orderby salary desc
where rownum>=2)
minus
select * from
(select * from employee orderby salary desc
where rownum>=1) ;
And to get Only Top 2 Highest salary —
select * from
(select * from employee orderby salary desc
where rownum>=2);
Thanks & Regards
Angadi Doddappa
Reply
Sameer Bhatnagar
January 12, 2008 1:19 pm
Thanks Pinal Dave….
U are doing a great Job…
All the Best all Of u…
Jai Hind…
Reply
Ranjith
January 15, 2008 12:01 pm
Hi,
This page looks really cool, hope I will get answer for my question, I have a table with 35 columns and have duplicate rows based on 6 columns. So how do I remove duplicates and keep the original rows in the table, keep in mind table has around 500,000 rows.
Reply
- Madhivanan
  May 17, 2010 1:54 pm
  You need to use thos six columns in the GROUP BY clause
  Reply