group-by - mysql group by原理？

Question

我们知道，在mysql中执行以下语句会报错： {代码...} 会提示#1062 - Duplicate entry '5.6.171' for key 'group_key' ，主键重复了。 group by 实际是将查询到的每列插入到临时表中，然后再排序。那为什么插入包...

阿神 · Answer

"According to this statement, floor(rand(0)*2) may produce 0 or 1. Then every time the above SQL is executed, there should be a 50% chance of successful execution." This should mean that the rand function is pseudo-random. , so the result of each execution of a given seed is the same. You can use select rand(0) from information_schema to verify. After multiple executions, the result is the same.

The execution process of group by is a scanning process, and a temporary table will be created to verify the key. But I'm also thinking about the original poster's problem, and I'm looking forward to experts answering it.

In addition to the problem you described, there is another phenomenon. For different seeds, the situations of success and failure are also different, as shown below.

迷茫 · Answer

select count(*),concat(version(),'-',floor(rand(0)*100000))x
from information_schema.tables 
group by x

Execution result: [Err] 1062 - Duplicate entry '5.5.20-log-95655' for key 'group_key'
Explanation: The result of executing floor(rand(0)*100000) contains multiple items with a value equal to 95655

Proof

select count(*),concat(version(),'-',floor(rand(0)*1000000))x from information_schema.tables group by x
Execution result: