SQL SERVER – Count Duplicate Records – Rows

July 11, 2007

In my previous article SQL SERVER – Delete Duplicate Records – Rows, we have seen how we can delete all the duplicate records in one simple query. In this article we will see how to find count of all the duplicate records in the table. Following query demonstrates usage of GROUP BY, HAVING, ORDER BY in one query and returns the results with duplicate column and its count in descending order.

SELECT YourColumn, COUNT(*) TotalCount FROM YourTable GROUP BY YourColumn HAVING COUNT(*) > 1 ORDER BY COUNT(*) DESC

Watch the view to see the above concept in action:

[youtube=http://www.youtube.com/watch?v=ioDJ0xVOHDY]

Reference : Pinal Dave (https://blog.sqlauthority.com) , SQL SERVER – Delete Duplicate Records – Rows

Duplicate Records, SQL Scripts

SQL SERVER – 2005 – List All Stored Procedure Modified in Last N Days

SQL SERVER – 2008 – Scheduled Launch at an Event in Los Angeles on Feb. 27, 2008

187 Comments. Leave new

Michael
April 23, 2011 7:26 pm
Thank you for the duplicate sql query you supplied. It was almost exactly what I was looking for.
Reply
Ashish
June 5, 2011 3:10 pm
regarding duplicate records:
i want to fetch only duplicate record from a table. suppose in a table there is 2000 records , out of which 10 records are duplicate. so how we can get only 10 duplicate records.
Reply
- madhivanan
  June 10, 2011 4:19 pm
  select col from table
  group by col
  having count(*)>1
  Reply
Barrington
June 21, 2011 11:30 pm
Hi here,
Apologies if this has been asked before, but I’ve not been able to see it. The query at the top to count duplicate records and order them in descending order is great, however, this appears to only work with fields from other tables.
I have set up an expression that combines a few fields together (eg concatenating surname, forname and DOB from a table into a new field in the query and called it “Fuzzy_ID”).
I wish to find duplicates of the “Fuzzy_ID”, but get errors regarding “Fuzzy_ID” not being an identifier in the GROUP BY line and can’t get the HAVING COUNT(*) to work either.
Must I create two queries? IE would there be one query (QUERY_A) to create FUZZY_ID field and then create a second query (QUERY_B) to use the SQL above basing the count on FUZZY_ID from QUERY_A?
Many thanks
Reply
Krishna
June 27, 2011 8:37 am
Hi,
How can i get the particular character location/position from table column.
ex. select * from TABLENAME Functionname(table_column,’?’)
Reply
- madhivanan
  June 27, 2011 2:27 pm
  Use charindex function
  charindex(‘?’,col)
  Reply
  - Krishna
    June 28, 2011 1:29 pm
    Thank you
sikha
July 2, 2011 10:12 am
hi ,
in my query i got 2 rows .how to count the row from this select statement.
my selected query is ds
select PSCInspectionID,SUM(ISMDeficiencies + NonISMDeficiencies ) from tPSCInspection
join mPSCMoU on mPSCMoU.PSCMoUID =tPSCInspection.PSCMoUID where mPSCMoU.IsParisMoU=1
group by PSCInspectionID having SUM(ISMDeficiencies + NonISMDeficiencies )>=5
the out put is
i got two rows
if i run ds query to count the row
SELECT COUNT(*) as total_duplicates FROM
(
select PSCInspectionID,SUM(ISMDeficiencies + NonISMDeficiencies ) from tPSCInspection
join mPSCMoU on mPSCMoU.PSCMoUID =tPSCInspection.PSCMoUID where mPSCMoU.IsParisMoU=1
group by PSCInspectionID having SUM(ISMDeficiencies + NonISMDeficiencies )>=5
) as t
i got an error
like ds
Msg 8155, Level 16, State 2, Line 15
No column name was specified for column 2 of ‘t’.
wht is wrong in my query
can u pls help me
Reply
Krishna
July 27, 2011 10:58 am
Hi,
IN SQL 2005 – Some Stored procedures have “modify” enabled some are having “modify” disabled on Right click.
while checking the properties of that StoredProcedures
in Options “encrypted = True”
How to change cncrypted=FALSE.
Reply
URLShortener
September 2, 2011 9:20 pm
Thanks has helped alot
Reply
Derek
September 13, 2011 6:28 pm
Thank you. I’d forgotten how to do this and your post pushed me back in the right direction.
Reply
Javed Ahmad Qureshi
October 12, 2011 4:59 pm
I have a question, could anybody guide me to have a WHILE … LOOP query to run the duplication check on first two columns of all tables in a specified database?
Reply
- madhivanan
  October 12, 2011 6:58 pm
  Why do you want to use while loop? What is wrong with the example given in this post? Using While loop is ineffecient
  Reply
  - Javed Ahmad Qureshi
    October 13, 2011 9:31 am
    I am into an upgrade process of my software application, and it gives me error of duplicate records found in tables. My database contains more than 1000 tables and I’m unable to execute the query on each table manually. Is there any possibility that a single query can check all tables in the whole database for duplication?
  - madhivanan
    October 13, 2011 5:11 pm
    Duplicate on what basis? Are the columns fixed?
Suresh Kumar VC
October 25, 2011 12:08 pm
How can I get a count query result for the following email table where direction column contains 1 for incoming and 2 for outgoing :
id | email data | direction
1 | abcded |1
1 | sdfdfdf |2
1 | asdadsf |1
1 | dfdfdf |2
I want the out put as :Email Report:
id | incoming | outgoing
1 | 2 | 2
Can any one please advise me…Thanks a lot in advance.
God Bless you all for a happy,peaceful and non violent , humanly world
Reply
- madhivanan
  November 3, 2011 3:23 pm
  select id, sum(case when direction=1 then 1 else 0 end) as incoming,sum(case when direction=2 then 1 else 0 end) as incoming from table
  group by id
  Reply
  - Suresh Kumar VC
    November 16, 2011 10:17 pm
    Hello Madhivanan,
    Thanks a lot and all the best wishes…Regards, Suresh Kumar VC
uday
October 25, 2011 3:34 pm
how can i sum alternate rows in table
Example: 12 AM
3AM
6AM
10AM
1st record is IN Time
2nd record is out Time
now i need to sum the alternate rows and this the time where a Employee comes in and goes out so i need to get his working hrs
Now result should be as below
3 HRs (12 to 3 AM)
4 hrs (6 to 10 AM)
Reply
RAHUL
November 2, 2011 8:36 am
i have one table empid. day1,day2,day3,da4; and its value is e001,pp,pp,pp,aa…….
now iwant to know how many columns having value pp of empid e001………….plz tell me fast…
Reply
- madhivanan
  November 2, 2011 5:16 pm
  select empid,case when day1=’pp’ then 1 end +case when day2=’pp’ then 1 end +case when day3=’pp’ then 1 end +case when day4=’pp’ then 1 end +case when day5=’pp’ then 1 end from table where empid=’e001′
  Reply
RAHUL
November 2, 2011 8:43 am
RESULT SHOULD BE 3…..TELL ME
Reply
guna
November 2, 2011 4:20 pm
Hello sir … 100 rows in a table i want split first 25 and second 25 rows only … 25 wise
how do please help me sir…
Reply
- madhivanan
  November 2, 2011 5:12 pm
  You need pagination. Refer method 4 here
  Reply
guna
November 3, 2011 12:01 pm
sir .. i have 2 variables … a and b … a=25, b=148, and divide b/a=c,
this balance 0 then print c value otherwise balance available then add 1 to c ..
c+=1;
how to write query please help me sir…
Reply
- madhivanan
  November 3, 2011 1:47 pm
  you need to use
  select a*1.0/b
  For more informations refer this
  Reply
Ben
November 26, 2011 5:36 am
Hi,
I’m looking for a similar but different query that I’ve been twisting my brain trying to figure out. Your post is the closest I’ve come to finding an answer.
I’m querying a table where multiple people might be selling the same item (ItemID). What I want to do is get all unique items, and for each unique item, get a count of how many duplicates there are for that unique item. I want a list of all unique items, not just the ones that have duplicates. (For ones with no duplicates, the duplicate count can can be one or zero, I don’t care.)
A simplified version of the ForSale table is:
[ID] [uniqueidentifier] NOT NULL CONSTRAINT [DF_Selling_ID] DEFAULT (newsequentialid()), — This is the primary key
[ItemID] [uniqueidentifier] NOT NULL,
[Description] [nvarchar](max) NULL
There is a second table “Items” that contains the information about the item in question. I’m trying to link the output of the
I’ve gotten almost all the way with your above query:
SELECT ItemID, Count(1) TotalCount
FROM ForSale
Group BY ItemID
HAVING COUNT(1) > 0
ORDER BY COUNT(1) DESC
But I can’t figure out how to get the Item info also brought into the results of the query. I thought an inner join would do the trick, but then I end up with a record for each ForSale, defeating the unique.
Any ideas?
Reply
Arman
December 5, 2011 5:28 pm
Thanks. Just as exactly what I am looking for. It worked as expected.
Reply
abc
December 12, 2011 5:27 pm
thanx….
Reply
Pablo
December 16, 2011 9:31 pm
hi, thank you, very well.
Reply
Waqar Ismail
January 18, 2012 5:16 pm
Hi pinal ,
I have some problem regarding sqlserver which is that i have db in which i have to put condition like user can save five time data and when they do six time it will not add and mention the error,.
Thanx
Waqar Isamil
Reply