


How Can I Efficiently Delete Duplicate Rows in PostgreSQL While Preserving a Single Instance?
Jan 06, 2025 am 10:04 AMPreserving Unique Instances in Duplicate Row Deletion
When working with large datasets, it is sometimes necessary to eliminate duplicate rows. However, in certain scenarios, it may be desirable to retain a single copy of each duplicate row. In such cases, a targeted approach is required to perform selective deletion.
Understanding the Problem
In PostgreSQL, the situation described involves deleting all but one instance of a set of duplicate rows. For example, if there are five records with the same values, the goal is to delete four of them, leaving one intact.
Finding a Solution
A comprehensive explanation of this issue is provided in the article "Removing duplicates from a PostgreSQL database." The authors address the specific challenge of dealing with vast amounts of data that cannot be grouped effectively.
A Simple Solution
The article recommends a straightforward solution:
DELETE FROM foo WHERE id NOT IN (SELECT min(id) --or max(id) FROM foo GROUP BY hash)
In this query, "hash" represents the field or combination of fields that is being used to determine duplicates. By using either the minimum or maximum value of the "id" field for each duplicate group, one instance can be preserved.
This targeted approach allows for the efficient deletion of duplicate rows while maintaining a single copy for reference or analysis.
The above is the detailed content of How Can I Efficiently Delete Duplicate Rows in PostgreSQL While Preserving a Single Instance?. For more information, please follow other related articles on the PHP Chinese website!

Hot Article

Hot tools Tags

Hot Article

Hot Article Tags

Notepad++7.3.1
Easy-to-use and free code editor

SublimeText3 Chinese version
Chinese version, very easy to use

Zend Studio 13.0.1
Powerful PHP integrated development environment

Dreamweaver CS6
Visual web development tools

SublimeText3 Mac version
God-level code editing software (SublimeText3)

Hot Topics

Reduce the use of MySQL memory in Docker

How do you alter a table in MySQL using the ALTER TABLE statement?

How to solve the problem of mysql cannot open shared library

Run MySQl in Linux (with/without podman container with phpmyadmin)

What is SQLite? Comprehensive overview

Running multiple MySQL versions on MacOS: A step-by-step guide

How do I configure SSL/TLS encryption for MySQL connections?

How do I secure MySQL against common vulnerabilities (SQL injection, brute-force attacks)?
