Mysql deduplication method
This article mainly introduces the related information of MySQL deduplication method. Friends who need it can refer to
MySQL deduplication method
[Elementary] There are very few duplicate lines
Use distinct to find them, and then manually delete them one by one.
[Intermediate] Deduplication according to the repetition of a single field
For example: Deduplication of the id field
Usage: Get the id For the values of duplicate fields, use the rows where the same id field is located to compare the fields with different data, and delete all duplicate rows except the row where the smallest (or largest) field is located. Generally, the primary key is used for comparison, because the value of the primary key must be a unique value and must not be the same.
id name 1 a 1 b 2 c 2 a 3 c
Result:
id name 1 a 2 a
Operation:
delete from a_tmp where id in (select * from (select b.id from a_tmp b group by b.id having count(b.id) >1) bb) and name not in (select * from (select min(a.name) from a_tmp a GROUP BY a.id having count(a.id) >1) aa);
Note:
The above bold and green words must be aliased and must use the format select * from (...), otherwise an error will be reported:
[Err] 1093 - You can't specify target table 'a_tmp ' for update in FROM clause
[Advanced] Repeat by multiple fields
For example: the same id and name Deduplication, that is: rows with the same ID and name are counted as duplicate rows, rows with the same ID but different names are counted as non-duplicate rows
Usage method: similar to a single field, generally use the primary key To compare, because the value of the primary key must be a unique value.
id name rowid 1 a 1 1 a 2 1 b 3 2 b 4 2 b 5 3 c 6 3 d 7
Result:
id name rowid 1 a 1 1 b 3 2 b 4 3 c 6 3 d 7
Operation:
First type:
delete from a_tmp where (id,name) in (select * from (select b.id,b.name from a_tmp b group by b.id,b.name having count(b.id) >1) bb) and rowid not in (select * from (select min(a.rowid) from a_tmp a group by a.id,a.name having count(a.id) >1) aa);
Second type :
Connect the values of the id and name fields and insert them into the temporary table b_tmp, so that you can use the [Intermediate] single field judgment deletion method.
#Insert the value of the connection between the two fields and the unique value field in the a_tmp table into the b_tmp table
insert into b_tmp select concat(id,name),rowid from a_tmp; #查出需要留下来的行 select id_name,max(rowid) from b_tmp group by id_name having count(id_name)>1; #使用【中级】的方法,或存储过程完成去重的工作
[Ultimate] Each row has two copies of the same data
For example:
Instructions for use: The entire row of data is the same and cannot be deleted using SQL statements because there is no conditional restriction that can be used to leave one row and delete all rows that are identical to it. . There are no different fields. You can create different fields by yourself, that is: add a field, set it to auto-increment, and set it as the primary key, and it will automatically add the upper value.
id name 1 a 1 a 1 b 1 b 2 c 2 c 3 c 3 c
Result:
id name rowid 1 a 1 1 b 3 2 c 5 3 c 7
Operation:
Add a self-increasing field and temporarily set it as the primary key.
Use the [Intermediate] and [Advanced] methods above.
The above is the detailed content of Mysql deduplication method. For more information, please follow other related articles on the PHP Chinese website!

Hot AI Tools

Undresser.AI Undress
AI-powered app for creating realistic nude photos

AI Clothes Remover
Online AI tool for removing clothes from photos.

Undress AI Tool
Undress images for free

Clothoff.io
AI clothes remover

AI Hentai Generator
Generate AI Hentai for free.

Hot Article

Hot Tools

Notepad++7.3.1
Easy-to-use and free code editor

SublimeText3 Chinese version
Chinese version, very easy to use

Zend Studio 13.0.1
Powerful PHP integrated development environment

Dreamweaver CS6
Visual web development tools

SublimeText3 Mac version
God-level code editing software (SublimeText3)

Hot Topics



In MySQL database, the relationship between the user and the database is defined by permissions and tables. The user has a username and password to access the database. Permissions are granted through the GRANT command, while the table is created by the CREATE TABLE command. To establish a relationship between a user and a database, you need to create a database, create a user, and then grant permissions.

Data Integration Simplification: AmazonRDSMySQL and Redshift's zero ETL integration Efficient data integration is at the heart of a data-driven organization. Traditional ETL (extract, convert, load) processes are complex and time-consuming, especially when integrating databases (such as AmazonRDSMySQL) with data warehouses (such as Redshift). However, AWS provides zero ETL integration solutions that have completely changed this situation, providing a simplified, near-real-time solution for data migration from RDSMySQL to Redshift. This article will dive into RDSMySQL zero ETL integration with Redshift, explaining how it works and the advantages it brings to data engineers and developers.

MySQL has a free community version and a paid enterprise version. The community version can be used and modified for free, but the support is limited and is suitable for applications with low stability requirements and strong technical capabilities. The Enterprise Edition provides comprehensive commercial support for applications that require a stable, reliable, high-performance database and willing to pay for support. Factors considered when choosing a version include application criticality, budgeting, and technical skills. There is no perfect option, only the most suitable option, and you need to choose carefully according to the specific situation.

MySQL database performance optimization guide In resource-intensive applications, MySQL database plays a crucial role and is responsible for managing massive transactions. However, as the scale of application expands, database performance bottlenecks often become a constraint. This article will explore a series of effective MySQL performance optimization strategies to ensure that your application remains efficient and responsive under high loads. We will combine actual cases to explain in-depth key technologies such as indexing, query optimization, database design and caching. 1. Database architecture design and optimized database architecture is the cornerstone of MySQL performance optimization. Here are some core principles: Selecting the right data type and selecting the smallest data type that meets the needs can not only save storage space, but also improve data processing speed.

To fill in the MySQL username and password: 1. Determine the username and password; 2. Connect to the database; 3. Use the username and password to execute queries and commands.

1. Use the correct index to speed up data retrieval by reducing the amount of data scanned select*frommployeeswherelast_name='smith'; if you look up a column of a table multiple times, create an index for that column. If you or your app needs data from multiple columns according to the criteria, create a composite index 2. Avoid select * only those required columns, if you select all unwanted columns, this will only consume more server memory and cause the server to slow down at high load or frequency times For example, your table contains columns such as created_at and updated_at and timestamps, and then avoid selecting * because they do not require inefficient query se

Copy and paste in MySQL includes the following steps: select the data, copy with Ctrl C (Windows) or Cmd C (Mac); right-click at the target location, select Paste or use Ctrl V (Windows) or Cmd V (Mac); the copied data is inserted into the target location, or replace existing data (depending on whether the data already exists at the target location).

View the MySQL database with the following command: Connect to the server: mysql -u Username -p Password Run SHOW DATABASES; Command to get all existing databases Select database: USE database name; View table: SHOW TABLES; View table structure: DESCRIBE table name; View data: SELECT * FROM table name;
