0% found this document useful (0 votes)

69 views5 pages

EXISTS Conditions: An Condition Tests For Existence of Rows in A Subquery

The document describes an efficient method for deleting duplicate rows from a table using a PL/SQL stored procedure. The procedure works by: 1. Selecting duplicate rows into a cursor, sorted by the duplicate key columns 2. Looping through the cursor rows, comparing the current row's key to the previous, and deleting any rows with matching keys except the first 3. This allows controlling which duplicate row is kept for each group, and performs the deletion more efficiently than a NOT IN clause by avoiding multiple table scans. On a test table with 500k rows and 45k duplicates, the stored procedure completed the deletion much faster than an alternative SQL method.

Uploaded by

yalamandu4358

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOC, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

69 views5 pages

EXISTS Conditions: An Condition Tests For Existence of Rows in A Subquery

Uploaded by

yalamandu4358

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOC, PDF, TXT or read online on Scribd

You are on page 1/ 5

EXISTS Conditions

An EXISTS condition tests for existence of rows in a subquery.

exists_condition::=

Text description of exists_condition

Table 5-10 shows the EXISTS condition.

Table 5-10 EXISTS Conditions

Condition Operation Example

EXISTS TRUE if a subquery returns at least one SELECT department_id
FROM departments d
row. WHERE EXISTS
(SELECT * FROM employees
e
WHERE d.department_id
= e.department_id);
Description: With reference to your tip for the week 08/04/2002 submitted by Madan Patil, this is another
way of deleting duplicate rows from a table. The difference being the time it takes to delete the duplicate
rows with this method is many times faster than the earlier method.

Let us take a table containing 3 columns, then we can use the following command to delete the duplicate
rows from the table.

delete from where rowid in (

SELECT rowid FROM
group by rowid,col1,col2,col3
minus
SELECT min(rowid) FROM

group by col1,col2,col3);

To show the difference let's consider a table:

Table: EMP
EMPNO NUMBER
ENAME VARCHAR2(20)
JOB VARCHAR2(20)

CREATE TABLE EMP (

EMPNO NUMBER,
ENAME VARCHAR2(20),
JOB VARCHAR2(20)
);
/
begin
for i in 1..20 loop
insert into emp values (1,'xx','clerk');
end loop;

commit;
end;
/
begin
for i in 1..20 loop
insert into emp values (2,'yy','accountant');
end loop;
commit;
end;
/
begin

for i in 1..20000 loop

insert into emp values (3,'zz','manager');
end loop;
commit;
end;
/
begin
for i in 1..10000 loop
insert into emp values (4,'ab','accountant');
end loop;
commit;
end;

Using the previous method as in your TIP for the Week 08/04/2002

------------------------------------------------------------------------
SQL> select count(*) from emp;

COUNT(*)
----------
30040

SQL> set timing on;

SQL> DELETE FROM EMP E
2 WHERE E.ROWID > ANY (SELECT ROWID
3 FROM EMP M
4 WHERE M.EMPNO = E.EMPNO
5 AND M.ENAME = E.ENAME
6 AND M.JOB = E.JOB );

30036 rows deleted.

Elapsed: 00:03:207.48

SQL> select count(*) from emp;

COUNT(*)
----------
4

Elapsed: 00:00:00.10

Using the NEW suggested method:

-------------------------------------------
SQL> select count(*) from emp;

COUNT(*)
----------
30040
SQL> delete from emp where rowid in (
2 SELECT rowid FROM emp
3 group by rowid,empno,ename,job
4 minus
5 SELECT min(rowid) FROM emp
6 group by empno,ename,job);
30036 rows deleted.

Elapsed: 00:00:02.94

SQL> select count(*) from emp;

COUNT(*)
----------
4

Elapsed: 00:00:00.10
--------------------------------------------------------------------

As we can see the difference is multifold to achieve the same result. This is because the new method uses
the set operator to compute the list of duplicate rows. The bigger the table the better you can appreciate the
difference.

>>> MIN() allows you to select one row per group—duplicates and non-duplicates—so that you
get a list of all the rows you want to keep:

SELECT MIN(ID) AS ID, LastName, FirstName

FROM Customers
GROUP BY LastName, FirstName;
Listing 5 shows the output of the above code.

Now you just need to delete rows that are not in this list, using the last query as a subquery inside
an antijoin (the NOT IN clause):

DELETE FROM Customers

WHERE ID NOT IN
(SELECT MIN(ID)
FROM Customers
GROUP BY LastName, FirstName);
However, an antijoin query with the NOT IN clause is inefficient to make this work. In our case
two (!) full table scans need to be performed to resolve this SQL statement. That leads to
substantial performance loss for big data sets. For performance testing I created the Customers
data set with 500,000 rows and 45,000 duplicates (9 percent of the total). The above command
ran for more than one hour with no results—except that it exhausted my patience—so I killed the
process.

Another disadvantage of this syntax is that you can't control which row per group of duplicates
you can keep in the database.

A PL/SQL Solution: Deleting Duplicate Data with a Stored Procedure

Let me give you an example of a PL/SQL stored procedure, called DeleteDuplicate (see Listing
6), that cleans up duplicates. The algorithm for this procedure is pretty straightforward:

1. It selects the duplicate data in the cursor, sorted by duplicate key (LastName, FirstName
in our case), as shown in Listing 4.

2. It opens the cursor and fetches each row, one by one, in a loop.

3. It compares the duplicate key value with the previously fetched one.
4. If this is a first fetch, or the value is different, then that's the first row in a new group so it
skips it and fetches the next row. Otherwise, it's a duplicate row within the same group,
so it deletes it.

Let's run the stored procedure and check it against the Customers data:
BEGIN
DeleteDuplicates;
END;
/

SELECT LastName, FirstName, COUNT(*)

FROM Customers
GROUP BY LastName, FirstName
HAVING COUNT(*) > 1;
The last SELECT statement returns no rows, so the duplicates are gone.

The main job of extracting duplicates in this procedure is done by a SQL statement, which is
defined in the csr_Duplicates cursor. The PL/SQL procedural code is used only to implement the
logic of deleting all rows in the group except the first one. Could it all be done by one SQL
statement?

SQL Queries: 200+ Queries to Challenge you.
From Everand
SQL Queries: 200+ Queries to Challenge you.
Swaroop Kallakuri
5/5 (2)
Siebel Oracle Database Monitoring
100% (2)
Siebel Oracle Database Monitoring
36 pages
How To Remove Duplicate Records in SQL
No ratings yet
How To Remove Duplicate Records in SQL
16 pages
SQL Important Question Answer
No ratings yet
SQL Important Question Answer
47 pages
5 Ways To Delete Duplicate Records
No ratings yet
5 Ways To Delete Duplicate Records
6 pages
Advanced SQL Concepts
No ratings yet
Advanced SQL Concepts
55 pages
SQL Solved Questions
No ratings yet
SQL Solved Questions
23 pages
SQL Interview
No ratings yet
SQL Interview
17 pages
SQL
No ratings yet
SQL
21 pages
Oracle SQL FAQ: What Is SQL and Where Does It Come From?
No ratings yet
Oracle SQL FAQ: What Is SQL and Where Does It Come From?
8 pages
SQL Scenarion Based Basics
No ratings yet
SQL Scenarion Based Basics
25 pages
SQL IQ
No ratings yet
SQL IQ
6 pages
SQL Solved Questions (Imp.)
No ratings yet
SQL Solved Questions (Imp.)
21 pages
DBMS-CS502--LAB-MANUAL 2024
No ratings yet
DBMS-CS502--LAB-MANUAL 2024
28 pages
SQL-Query
No ratings yet
SQL-Query
14 pages
SQL Interview Questions and Answers: What Is SQL and Where Does It Come From?
No ratings yet
SQL Interview Questions and Answers: What Is SQL and Where Does It Come From?
9 pages
Sqlqueries IMP Interview Questions-2
No ratings yet
Sqlqueries IMP Interview Questions-2
39 pages
Tips For Writing Efficient SQL Queries. Vigyan Kaushik
No ratings yet
Tips For Writing Efficient SQL Queries. Vigyan Kaushik
6 pages
SQL Queries for Interviews
No ratings yet
SQL Queries for Interviews
18 pages
Return To Table of Contents
No ratings yet
Return To Table of Contents
8 pages
Dbms Lab Practical File
No ratings yet
Dbms Lab Practical File
9 pages
EXPT7 NCQ LMD
No ratings yet
EXPT7 NCQ LMD
7 pages
Structured Query Language
No ratings yet
Structured Query Language
13 pages
Techniques Used to Transform Data, Part 1
No ratings yet
Techniques Used to Transform Data, Part 1
12 pages
SQL Interview Questions
No ratings yet
SQL Interview Questions
7 pages
Identifying and Removing Duplicate Values
No ratings yet
Identifying and Removing Duplicate Values
11 pages
Dbms 1 To 18updated
No ratings yet
Dbms 1 To 18updated
39 pages
SQL Syntax
No ratings yet
SQL Syntax
11 pages
SQL Interview Questions
No ratings yet
SQL Interview Questions
12 pages
RTDB
No ratings yet
RTDB
88 pages
Removing Duplicate Rows From Table in Oracle: 11 Answers
No ratings yet
Removing Duplicate Rows From Table in Oracle: 11 Answers
4 pages
Data Cleaning
No ratings yet
Data Cleaning
21 pages
sql
No ratings yet
sql
5 pages
SQL PLSQL Queries Vasu
No ratings yet
SQL PLSQL Queries Vasu
126 pages
SQL Task
No ratings yet
SQL Task
7 pages
DBMS Practical File
No ratings yet
DBMS Practical File
31 pages
SQL
No ratings yet
SQL
8 pages
Cognizant Interview Guide (1)
No ratings yet
Cognizant Interview Guide (1)
8 pages
MYSQL CHEAT SHEET
No ratings yet
MYSQL CHEAT SHEET
11 pages
9_SQL NOTES
No ratings yet
9_SQL NOTES
9 pages
All Mysql Queries Cheat Sheet
No ratings yet
All Mysql Queries Cheat Sheet
8 pages
CS-502-DBMS Lab Manual
No ratings yet
CS-502-DBMS Lab Manual
20 pages
Table of Contents
No ratings yet
Table of Contents
4 pages
Dbms Lab Manual RGPV
75% (4)
Dbms Lab Manual RGPV
38 pages
SQL Query
No ratings yet
SQL Query
139 pages
Dbms Lab Manual RGPV
No ratings yet
Dbms Lab Manual RGPV
38 pages
SQL Note
No ratings yet
SQL Note
3 pages
Dbms Lab Manual RGPV
No ratings yet
Dbms Lab Manual RGPV
38 pages
unit 2 ST 2 notes-1
No ratings yet
unit 2 ST 2 notes-1
16 pages
DBS-BITF20 (A) 0-Quiz01 - Solution
No ratings yet
DBS-BITF20 (A) 0-Quiz01 - Solution
7 pages
Advanced SAS Interview Questions You'll Most Likely Be Asked
From Everand
Advanced SAS Interview Questions You'll Most Likely Be Asked
Vibrant Publishers
No ratings yet
Cassandra Query Language by Examples - Puzzles with Answers
From Everand
Cassandra Query Language by Examples - Puzzles with Answers
Cristian Scutaru
No ratings yet
Excel Techniques
From Everand
Excel Techniques
Online Trainees
2/5 (1)
DBMS Lab Manual
From Everand
DBMS Lab Manual
Jitendra Patel
1.5/5 (3)
Sql Plsql Oracle
From Everand
Sql Plsql Oracle
Andrew Igla
No ratings yet
Basic DBA Query v.1: Oracle Database
From Everand
Basic DBA Query v.1: Oracle Database
Oraclesql-plsql
5/5 (1)
Matrices with MATLAB (Taken from "MATLAB for Beginners: A Gentle Approach")
From Everand
Matrices with MATLAB (Taken from "MATLAB for Beginners: A Gentle Approach")
Peter Kattan
3/5 (4)
SQL Server Functions and tutorials 50 examples
From Everand
SQL Server Functions and tutorials 50 examples
Nino Paiotta
1/5 (1)
Oracle SQL and PL/SQL
From Everand
Oracle SQL and PL/SQL
Niraj Gupta
4.5/5 (8)
Introduction to PHP, Part 2, Second Edition
From Everand
Introduction to PHP, Part 2, Second Edition
Adam Majczak
No ratings yet
The Essential R Reference
From Everand
The Essential R Reference
Mark Gardener
No ratings yet
UNIT-1(IOT)
No ratings yet
UNIT-1(IOT)
11 pages
How To Replicate Data From SAP To A Data Lake
No ratings yet
How To Replicate Data From SAP To A Data Lake
15 pages
Disk Hardware Disk Hardware (Cont.) : Diagram From Computer Science, Volume 2, J. Stanley Warford, Heath, 1991
No ratings yet
Disk Hardware Disk Hardware (Cont.) : Diagram From Computer Science, Volume 2, J. Stanley Warford, Heath, 1991
2 pages
UNIT 1-1
No ratings yet
UNIT 1-1
9 pages
Singly Linked List in Python: Objective
No ratings yet
Singly Linked List in Python: Objective
3 pages
Oracle Dynamic SQL
No ratings yet
Oracle Dynamic SQL
41 pages
DMBI Presentations Unit-8
No ratings yet
DMBI Presentations Unit-8
28 pages
COMP810_DW_Handbook_2018-S2
No ratings yet
COMP810_DW_Handbook_2018-S2
6 pages
SAP Note 2079411 - Troubleshooting Guide To Analyse User Customizations in A Company Database
No ratings yet
SAP Note 2079411 - Troubleshooting Guide To Analyse User Customizations in A Company Database
8 pages
Nilai Uh - Analisis Bison Kelas Xii
No ratings yet
Nilai Uh - Analisis Bison Kelas Xii
87 pages
VERSI 4 - Latihan
No ratings yet
VERSI 4 - Latihan
5 pages
Cse 17CS82 M2 S4 PPT
No ratings yet
Cse 17CS82 M2 S4 PPT
19 pages
NetWorker 8.0 Avamar Integration Guide
No ratings yet
NetWorker 8.0 Avamar Integration Guide
56 pages
Q 2
No ratings yet
Q 2
13 pages
PingCAP Ebook Modern Distributed Database Fundamentals
No ratings yet
PingCAP Ebook Modern Distributed Database Fundamentals
42 pages
Performance Comparison of Graph Database and Relational Database
No ratings yet
Performance Comparison of Graph Database and Relational Database
14 pages
HDFS - Rackawareness
No ratings yet
HDFS - Rackawareness
21 pages
Unit 5 File Management PDF
No ratings yet
Unit 5 File Management PDF
40 pages
Dbms Experiment-12
No ratings yet
Dbms Experiment-12
2 pages
Research On SQL Injection Attack and Prevention Technology Based On Web
No ratings yet
Research On SQL Injection Attack and Prevention Technology Based On Web
4 pages
Introduction To Linux I Chapter 23 Exam Answer
No ratings yet
Introduction To Linux I Chapter 23 Exam Answer
4 pages
Running Head: Mis605 Systems Analysis and Design
No ratings yet
Running Head: Mis605 Systems Analysis and Design
25 pages
How To Recover and Start A Veritas Volume Manager Logical Volume Where The Volume Is DISABLED ACTIVE and Has A Plex That Is DISABLED RECOVER
No ratings yet
How To Recover and Start A Veritas Volume Manager Logical Volume Where The Volume Is DISABLED ACTIVE and Has A Plex That Is DISABLED RECOVER
12 pages
Database Management System
No ratings yet
Database Management System
18 pages
Basic SQL: Structured Query Language
No ratings yet
Basic SQL: Structured Query Language
44 pages
Experiment-2 Aim - Introduction To The Project "Student Information System" Student Information System
No ratings yet
Experiment-2 Aim - Introduction To The Project "Student Information System" Student Information System
2 pages
SQL Injection Authentication Bypass
No ratings yet
SQL Injection Authentication Bypass
4 pages
COLA-070071 - Unit 04 - Database Design and Development
No ratings yet
COLA-070071 - Unit 04 - Database Design and Development
86 pages
Nancy Stern Hofstra University Robert A. Stern: Nassau Community College
No ratings yet
Nancy Stern Hofstra University Robert A. Stern: Nassau Community College
61 pages

Uploaded by

Uploaded by

EXISTS Conditions

An EXISTS condition tests for existence of rows in a subquery.

Text description of exists_condition

Table 5-10 shows the EXISTS condition.

Table 5-10 EXISTS Conditions

Condition Operation Example

delete from where rowid in (

To show the difference let's consider a table:

CREATE TABLE EMP (

for i in 1..20000 loop

SQL> set timing on;

30036 rows deleted.

SQL> select count(*) from emp;

Using the NEW suggested method:

SQL> select count(*) from emp;

SELECT MIN(ID) AS ID, LastName, FirstName

DELETE FROM Customers

A PL/SQL Solution: Deleting Duplicate Data with a Stored Procedure

SELECT LastName, FirstName, COUNT(*)

You might also like