Oracle 查询与删除表中的重复记录sql语句
不管我使用什么数据库都碰到有重复数据的存在了,下面我来给大这详细介绍Oracle 删除表中的重复记录的各种方法与性能分析,有需要了解的朋友可参考。
不管我使用什么数据库都碰到有重复数据的存在了,下面我来给大这详细介绍Oracle 删除表中的重复记录的各种方法与性能分析,有需要了解的朋友可参考。方法:
代码如下 | 复制代码 |
XX having count(*)>1,rowid,distinct,temporary table,procedure |
下面语句可以查询出那些数据是重复的:
代码如下 | 复制代码 |
字段1,字段2,count(*) from 表名 group by 字段1,字段2 having count(*) > 1 |
将上面的>号改为=号就可以查询出没有重复的数据了。
想要删除这些重复的数据,可以使用下面语句进行删除
代码如下 | 复制代码 |
delete from 表名 a where 字段1,字段2 in |
例
1、查询表中的重复数据
a.重复一个字段
b.重复多个字段
c.重复一整行
创建测试表:
代码如下 | 复制代码 |
create table cfa (businessid number,customer varchar2(50),branchcode varchar2(10),data_date varchar2(10)); ---------------以下为重复数据---------------------------------------------- |
对于a的情况,只有businessid重复
代码如下 | 复制代码 |
select * from cfa where businessid in (select businessid from cfa group by businessid having count(businessid)>1); |
如果是b的情况,businessid 和name同时存在重复
代码如下 | 复制代码 |
select * from cfa where (businessid,customer) in (select businessid,customer from cfa group by businessid,customer having count(*)>1); |
对于c的情况,重复一整行
参考b的方法:
代码如下 | 复制代码 |
select * from cfa where (businessid,customer,branchcode,data_date) in (select * from cfa group by businessid,customer,branchcode,data_date having count(*)>1); |
2、删除表中的重复数据
a情况,删除表中多余的重复记录,重复记录是根据单个字段(businessid)来判断,只留有rowid最小的记录
也可以只保留rowid不是最小记录,需要把代码中的min改为max这里不再赘述。
代码如下 | 复制代码 |
delete from cfa |
或者,使用下面更简单高效的语句
代码如下 | 复制代码 |
DELETE FROM cfa t |
b情况,删除表中多余的重复记录(多个字段),只留有rowid最小的记录
代码如下 | 复制代码 |
delete from cfa |
或者,使用下面更简单高效的语句
代码如下 | 复制代码 |
DELETE FROM cfa t |
上面的语句非常简单,就是将查询到的数据删除掉。不过这种删除执行的效率非常低,对于大数据量来说,可能会将吊死。所以我建议先将查询到的重复的数据插入到一个临时表中,然后对进行删除,这样,执行删除的时候就不用再进行一次查询了。如下:
CREATE TABLE 临时表 AS
代码如下 | 复制代码 |
(select 字段1,字段2,count(*) from 表名 group by 字段1,字段2 having count(*) > 1) |
上面这句话就是建立了临时表,并将查询到的数据插入其中。
下面就可以进行这样的删除操作了:
代码如下 | 复制代码 |
delete from 表名 a where 字段1,字段2 in (select 字段1,字段2 from 临时表); |
这种先建临时表再进行删除的操作要比直接用一条语句进行删除要高效得多。
c情况,这种情况就比较简单,使用临时表方法
代码如下 | 复制代码 |
create table cfabak as select distinct * from cfa; truncate table cfa;--如果是生产最好对该表backup Insert into cfa select * from cfabak; commit; |
我们要删除重复数据,只保留最新的一条数据,就可以这样写了:
代码如下 | 复制代码 |
delete from 表名 a |

Hot AI Tools

Undresser.AI Undress
AI-powered app for creating realistic nude photos

AI Clothes Remover
Online AI tool for removing clothes from photos.

Undress AI Tool
Undress images for free

Clothoff.io
AI clothes remover

AI Hentai Generator
Generate AI Hentai for free.

Hot Article

Hot Tools

Notepad++7.3.1
Easy-to-use and free code editor

SublimeText3 Chinese version
Chinese version, very easy to use

Zend Studio 13.0.1
Powerful PHP integrated development environment

Dreamweaver CS6
Visual web development tools

SublimeText3 Mac version
God-level code editing software (SublimeText3)

Hot Topics

The retention period of Oracle database logs depends on the log type and configuration, including: Redo logs: determined by the maximum size configured with the "LOG_ARCHIVE_DEST" parameter. Archived redo logs: Determined by the maximum size configured by the "DB_RECOVERY_FILE_DEST_SIZE" parameter. Online redo logs: not archived, lost when the database is restarted, and the retention period is consistent with the instance running time. Audit log: Configured by the "AUDIT_TRAIL" parameter, retained for 30 days by default.

The function in Oracle to calculate the number of days between two dates is DATEDIFF(). The specific usage is as follows: Specify the time interval unit: interval (such as day, month, year) Specify two date values: date1 and date2DATEDIFF(interval, date1, date2) Return the difference in days

The Oracle database startup sequence is: 1. Check the preconditions; 2. Start the listener; 3. Start the database instance; 4. Wait for the database to open; 5. Connect to the database; 6. Verify the database status; 7. Enable the service (if necessary ); 8. Test the connection.

The amount of memory required by Oracle depends on database size, activity level, and required performance level: for storing data buffers, index buffers, executing SQL statements, and managing the data dictionary cache. The exact amount is affected by database size, activity level, and required performance level. Best practices include setting the appropriate SGA size, sizing SGA components, using AMM, and monitoring memory usage.

The INTERVAL data type in Oracle is used to represent time intervals. The syntax is INTERVAL <precision> <unit>. You can use addition, subtraction, multiplication and division operations to operate INTERVAL, which is suitable for scenarios such as storing time data and calculating date differences.

To find the number of occurrences of a character in Oracle, perform the following steps: Get the total length of a string; Get the length of the substring in which a character occurs; Count the number of occurrences of a character by subtracting the substring length from the total length.

The method of replacing strings in Oracle is to use the REPLACE function. The syntax of this function is: REPLACE(string, search_string, replace_string). Usage steps: 1. Identify the substring to be replaced; 2. Determine the new string to replace the substring; 3. Use the REPLACE function to replace. Advanced usage includes: multiple replacements, case sensitivity, special character replacement, etc.

Oracle database server hardware configuration requirements: Processor: multi-core, with a main frequency of at least 2.5 GHz. For large databases, 32 cores or more are recommended. Memory: At least 8GB for small databases, 16-64GB for medium sizes, up to 512GB or more for large databases or heavy workloads. Storage: SSD or NVMe disks, RAID arrays for redundancy and performance. Network: High-speed network (10GbE or higher), dedicated network card, low-latency network. Others: Stable power supply, redundant components, compatible operating system and software, heat dissipation and cooling system.
