How to use distinct and group by in MySQL-Mysql Tutorial-php.cn

Table of Contents

Usage of distinct

For example:

For basic deduplication, the use of

Syntax:

The syntax difference between the two is that

In most examples,

For implicit sorting, we can refer to MySQL’s official explanation:

Home

Database

Mysql Tutorial

How to use distinct and group by in MySQL

王林

May 26, 2023 am 10:34 AM

mysql distinct group&amp;amp;amp;nbsp;by

Let’s talk about the general conclusion first:

When the semantics are the same and there is an index: group by## Both # and distinct can use indexes with the same efficiency.
With the same semantics and no index:
distinct is more efficient than group by. The reason is that both distinct and group by will perform grouping operations, but group by may perform sorting and trigger filesort, resulting in low SQL execution efficiency.

Based on this conclusion, you may ask:

Why
group by# when the semantics are the same and there is an index? ## and distinct have the same efficiency?
group by
perform a sorting operation?

distinct

and group by. Usage of distinct

Usage of distinct

SELECT DISTINCT columns FROM table_name WHERE where_conditions;

Copy after login

For example:

mysql> select distinct age from student;
+------+
| age  |
+------+
|   10 |
|   12 |
|   11 |
| NULL |
+------+
4 rows in set (0.01 sec)

Copy after login

DISTINCT

The keyword is used to return uniquely different values. It is used before the first field in the query statement and applies to all columns in the main clause. If a column has a NULL value and you use the

DISTINCT

clause on the column, MySQL will retain one NULL value and delete the other NULL values because the DISTINCT clause statement treats all NULL values as the same value. distinct Multi-column deduplication

distinct

Multi-column deduplication is performed based on the specified deduplication column information, that is, only all specified column information If they are all the same, it will be considered as duplicate information. <div class="code" style="position:relative; padding:0px; margin:0px;"><pre class='brush:php;toolbar:false;'>SELECT DISTINCT column1,column2 FROM table_name WHERE where_conditions; mysql> select distinct sex,age from student; +--------+------+ | sex | age | +--------+------+ | male | 10 | | female | 12 | | male | 11 | | male | NULL | | female | 11 | +--------+------+ 5 rows in set (0.02 sec)</pre><div class="contentsignin">Copy after login</div></div>Usage of group by

For basic deduplication, the use of

group by

is similar to distinct. Single column deduplication

Syntax:

SELECT columns FROM table_name WHERE where_conditions GROUP BY columns;

Copy after login

Execution:

mysql> select age from student group by age;
+------+
| age  |
+------+
|   10 |
|   12 |
|   11 |
| NULL |
+------+
4 rows in set (0.02 sec)

Copy after login

Multiple column deduplication

Syntax:

SELECT columns FROM table_name WHERE where_conditions GROUP BY columns;

Copy after login

Execution:

mysql> select sex,age from student group by sex,age;
+--------+------+
| sex    | age  |
+--------+------+
| male   |   10 |
| female |   12 |
| male   |   11 |
| male   | NULL |
| female |   11 |
+--------+------+
5 rows in set (0.03 sec)

Copy after login

Difference example

The syntax difference between the two is that

group by

can perform single-column deduplication, and the principle of group by The results are grouped and sorted first, and then the first piece of data in each group is returned. And deduplication is performed based on the fields following group by. For example:

mysql> select sex,age from student group by sex;
+--------+-----+
| sex    | age |
+--------+-----+
| male   |  10 |
| female |  12 |
+--------+-----+
2 rows in set (0.03 sec)

Copy after login

distinct and group by principle

In most examples,

DISTINCT

can be regarded as a special GROUP BY, their implementation is based on grouping operations, and they can all be implemented through loose index scan and compact index scan (the content of index scan will be introduced in detail in other articles, so I will not introduce it in detail here).

DISTINCT

and GROUP BY can both be scanned and searched using indexes. For example, the following two SQLs (just look at the content of the extra at the end of the table), we analyze these two SQLs, we can see that in the extra, these two SQLs use compact index scanningUsing index for group -by. So, in general, for

DISTINCT

GROUP BY

, before MYSQL8.0, GROUP Y will be implicitly sorted by fields by default. As you can see, the following sql statement uses a temporary table and also performs filesort.

mysql> explain select int6_bigger_random from test_distinct_groupby GROUP BY int6_bigger_random;
+----+-------------+-----------------------+------------+------+---------------+------+---------+------+-------+----------+---------------------------------+
| id | select_type | table                 | partitions | type | possible_keys | key  | key_len | ref  | rows  | filtered | Extra                           |
+----+-------------+-----------------------+------------+------+---------------+------+---------+------+-------+----------+---------------------------------+
|  1 | SIMPLE      | test_distinct_groupby | NULL       | ALL  | NULL          | NULL | NULL    | NULL | 97402 |   100.00 | Using temporary; Using filesort |
+----+-------------+-----------------------+------------+------+---------------+------+---------+------+-------+----------+---------------------------------+
1 row in set (0.04 sec)

Copy after login

Implicit sorting

For implicit sorting, we can refer to MySQL’s official explanation:

https://dev.mysql.com/doc/refman/5.7 /en/order-by-optimization.html

GROUP BY implicitly sorts by default (that is, in the absence of ASC or DESC designators for GROUP BY columns). However, relying on implicit GROUP BY sorting (that is, sorting in the absence of ASC or DESC designators) or explicit sorting for GROUP BY (that is, by using explicit ASC or DESC designators for GROUP BY columns) is deprecated. To produce a given sort order, provide an ORDER BY clause.

Broad explanation:

GROUP BY defaults to implicit sorting (meaning that sorting will also be performed even if the GROUP BY column does not have an ASC or DESC indicator). However, GROUP BY for explicit or implicit sorting is deprecated. To generate a given sort order, provide an ORDER BY clause.

So, before MySQL8.0,

GROUP BY

will sort the results according to the effect field (the subsequent field of GROUP BY) by default. When the index can be used, GROUP BY does not require additional sorting operations; but when the index cannot be used for sorting, the MySQL optimizer has to choose to use a temporary table and then sort itGROUP BY. And when the size of the result set exceeds the temporary table size set by the system, MySQL will copy the temporary table data to the disk before operating, and the execution efficiency of the statement will become extremely low. This is why MySQL has chosen to deprecate this operation (implicit sorting).

Based on the above reasons, Mysql has optimized and updated this in 8.0:

https://dev.mysql.com/doc/refman/8.0/en/order-by-optimization.html

Previously (MySQL 5.7 and lower), GROUP BY sorted implicitly under certain conditions. In MySQL 8.0, that no longer occurs, so specifying ORDER BY NULL at the end to suppress implicit sorting (as was done previously) is no longer necessary. However, query results may differ from previous MySQL versions. To produce a given sort order, provide an ORDER BY clause.

A rough explanation:

In the past (before MySQL5.7 version), Group by would perform implicit sorting based on certain conditions. In MySQL 8.0, this functionality has been removed, so it is no longer necessary to disable implicit ordering by adding order by null, however, query results may differ from previous MySQL versions. To produce results in a given order, specify the fields to be sorted by ORDER BY.

Therefore, our conclusion also comes out:

In the case of the same semantics and index: group by and distinct Both can use indexes and have the same efficiency. Because group by and distinct are nearly equivalent, distinct can be regarded as a special group by.
In the case of the same semantics and no index: distinct is more efficient than group by. The reason is that both distinct and group by will perform grouping operations, but group by will perform implicit sorting before MySQL8.0, causing filesort to be triggered and sql execution efficiency low. However, starting from MySQL8.0, MySQL has deleted the implicit sorting. Therefore, at this time, with the same semantics and no index, the execution efficiency of group by and distinct is almost the same. equivalent.

Compared with distinct, group by has clear semantics. And since the distinct keyword will take effect on all fields, group by is more flexible when performing composite business processing. group by can update the data according to the grouping situation. For complex processing, such as filtering data through having, or operating on data through aggregate functions.

The above is the detailed content of How to use distinct and group by in MySQL. For more information, please follow other related articles on the PHP Chinese website!

Statement of this Website

The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn

Hot AI Tools

Undresser.AI Undress

AI-powered app for creating realistic nude photos

AI Clothes Remover

Online AI tool for removing clothes from photos.

Undress AI Tool

Undress images for free

Clothoff.io

AI clothes remover

AI Hentai Generator

Generate AI Hentai for free.

Hot Article

R.E.P.O. Energy Crystals Explained and What They Do (Yellow Crystal)

4 weeks ago By 尊渡假赌尊渡假赌尊渡假赌

R.E.P.O. Best Graphic Settings

4 weeks ago By 尊渡假赌尊渡假赌尊渡假赌

Assassin's Creed Shadows: Seashell Riddle Solution

2 weeks ago By DDD

R.E.P.O. How to Fix Audio if You Can't Hear Anyone

4 weeks ago By 尊渡假赌尊渡假赌尊渡假赌

WWE 2K25: How To Unlock Everything In MyRise

1 months ago By 尊渡假赌尊渡假赌尊渡假赌

Hot Tools

Notepad++7.3.1

Easy-to-use and free code editor

SublimeText3 Chinese version

Chinese version, very easy to use

Zend Studio 13.0.1

Powerful PHP integrated development environment

Dreamweaver CS6

Visual web development tools

SublimeText3 Mac version

God-level code editing software (SublimeText3)

Hot Topics

Where is the login entrance for gmail email?

7514

CakePHP Tutorial

1378

What is the format of the account name of steam

win11 activation key permanent

nyt connections hints and answers

Related knowledge

MySQL: The Ease of Data Management for Beginners Apr 09, 2025 am 12:07 AM

MySQL is suitable for beginners because it is simple to install, powerful and easy to manage data. 1. Simple installation and configuration, suitable for a variety of operating systems. 2. Support basic operations such as creating databases and tables, inserting, querying, updating and deleting data. 3. Provide advanced functions such as JOIN operations and subqueries. 4. Performance can be improved through indexing, query optimization and table partitioning. 5. Support backup, recovery and security measures to ensure data security and consistency.

MySQL: Simple Concepts for Easy Learning Apr 10, 2025 am 09:29 AM

MySQL is an open source relational database management system. 1) Create database and tables: Use the CREATEDATABASE and CREATETABLE commands. 2) Basic operations: INSERT, UPDATE, DELETE and SELECT. 3) Advanced operations: JOIN, subquery and transaction processing. 4) Debugging skills: Check syntax, data type and permissions. 5) Optimization suggestions: Use indexes, avoid SELECT* and use transactions.

How to open phpmyadmin Apr 10, 2025 pm 10:51 PM

You can open phpMyAdmin through the following steps: 1. Log in to the website control panel; 2. Find and click the phpMyAdmin icon; 3. Enter MySQL credentials; 4. Click "Login".

How to create navicat premium Apr 09, 2025 am 07:09 AM

Create a database using Navicat Premium: Connect to the database server and enter the connection parameters. Right-click on the server and select Create Database. Enter the name of the new database and the specified character set and collation. Connect to the new database and create the table in the Object Browser. Right-click on the table and select Insert Data to insert the data.

MySQL and SQL: Essential Skills for Developers Apr 10, 2025 am 09:30 AM

MySQL and SQL are essential skills for developers. 1.MySQL is an open source relational database management system, and SQL is the standard language used to manage and operate databases. 2.MySQL supports multiple storage engines through efficient data storage and retrieval functions, and SQL completes complex data operations through simple statements. 3. Examples of usage include basic queries and advanced queries, such as filtering and sorting by condition. 4. Common errors include syntax errors and performance issues, which can be optimized by checking SQL statements and using EXPLAIN commands. 5. Performance optimization techniques include using indexes, avoiding full table scanning, optimizing JOIN operations and improving code readability.

How to create a new connection to mysql in navicat Apr 09, 2025 am 07:21 AM

You can create a new MySQL connection in Navicat by following the steps: Open the application and select New Connection (Ctrl N). Select "MySQL" as the connection type. Enter the hostname/IP address, port, username, and password. (Optional) Configure advanced options. Save the connection and enter the connection name.

How to recover data after SQL deletes rows Apr 09, 2025 pm 12:21 PM

Recovering deleted rows directly from the database is usually impossible unless there is a backup or transaction rollback mechanism. Key point: Transaction rollback: Execute ROLLBACK before the transaction is committed to recover data. Backup: Regular backup of the database can be used to quickly restore data. Database snapshot: You can create a read-only copy of the database and restore the data after the data is deleted accidentally. Use DELETE statement with caution: Check the conditions carefully to avoid accidentally deleting data. Use the WHERE clause: explicitly specify the data to be deleted. Use the test environment: Test before performing a DELETE operation.

How to use single threaded redis Apr 10, 2025 pm 07:12 PM

Redis uses a single threaded architecture to provide high performance, simplicity, and consistency. It utilizes I/O multiplexing, event loops, non-blocking I/O, and shared memory to improve concurrency, but with limitations of concurrency limitations, single point of failure, and unsuitable for write-intensive workloads.

See all articles