Home > Database > Mysql Tutorial > body text

How to implement MySQL MVVC multi-version concurrency control

WBOY
Release: 2023-05-31 13:14:01
forward
1644 people have browsed it

1. Overview

Multi-version concurrency control (MVCC) is a concurrency control technology. MVVC, which implements database concurrency control, is inseparable from the version chain in the undo log, which is accomplished by maintaining multiple versions of data rows.

To put it simply, the current transaction queries the rows that are being changed by another transaction (dirty reading will occur if read at this time). Instead of locking and waiting, the historical version of the data is read, which reduces the response. time.

MVVC is implemented through two technologies: undo log and Read View.

2. Snapshot read and current read

The implementation of MVCC in MySQL InnoDB is mainly to improve the concurrent performance of the database and use a better way to handle read-write conflicts, so that even if there are When there is a read-write conflict, no locking and non-blocking concurrent reading can be achieved, and this reading refers to snapshot reading, not current reading. The current read is actually a locking operation.

1. Current reading

In order to ensure that the record read is the latest data and to prevent other concurrent transactions from modifying the record, the record needs to be locked when reading.

Locked reading is called current reading, and any addition, deletion or modification of data requires reading the data first. This reading process is also current reading.

SELECT * FROM t LOCK IN SHARE MODE; # 共享锁
SELECT * FROM t FOR UPDATE; # 排他锁
UPDATE SET t..
Copy after login

2. Snapshot read

Snapshot read is also called consistent read. It reads the snapshot version of the data row. In MySQL, ordinary select statements (select statements without for update or lock in share mode) use snapshot reading by default without locking.

SELECT * FROM table WHERE ...
Copy after login

The reason for this is that snapshot reading can avoid locking operations and reduce overhead.

When the isolation level of the transaction is serial, snapshot reading is useless and will degrade to current reading.

3. Isolation level and version chain review

Isolation level:

The default isolation level in MySQL is repeatable read RR, which can solve the non-repeatable read problem. In MySQL, special support is provided to solve the phantom read problem.

How does it solve the phantom reading problem? There are two ways:

  • Use gap locks and temporary key locks to solve the problem. In short, it is locking. During this period, other transactions cannot insert data

  • MVCC method does not require locking and has low consumption (the disadvantage is that it does not completely solve the phantom read problem).

undo log version chain:

Corresponding to InnoDB, each record in the clustered index contains two necessary hidden fields:

  • trx_id: Every time a transaction changes a clustered index record, the transaction id of the transaction will be assigned to the trx_id hidden column.

  • roll_pointer: rollback pointer. Every time the data is modified, the old data will be put into the undo log. The new data points to the old data to form a version chain. The pointer field is called the rollback pointer, through which the data before modification can be found.

Example:

A transaction with ID 8 creates a piece of data, then the schematic diagram of the record is as follows:

How to implement MySQL MVVC multi-version concurrency control

Assume that two transactions with IDs 10 and 20 will update this record. The process is as follows:

##BEGIN;BEGIN;UPDATE student SET name='李思' WHERE id=1;UPDATE student SET name='王五' WHERE id=1;##COMMIT;# #UPDATE student SET name='Zhao Liu' WHERE id=1;UPDATE student SET name='Qian Qi' WHERE id=1;COMMIT;

每个修改都会生成一个undo log日志,并与其他日志相互链接形成版本链,从而呈现出该条数据的图示

How to implement MySQL MVVC multi-version concurrency control

每个版本中还包含生成该版本时对应的事务id 。

四、Read View

有了undo log就可以读取到记录的历史版本,那么在什么情况下,读取哪个版本的记录呢?这就用到了Read View,它帮我们解决了行的可见性问题。

读视图是指在使用MVCC机制进行快照读操作时产生的事务视图。这个视图是对当前数据库中所有活跃的、尚未提交的事务ID列表进行拍照的。

1.实现原理

四种隔离级别里,读未提交和串行化是不会使用MVVC的,因为读未提交直接读取某个数据的最新数据即可,串行化是通过加锁来读的。

读已提交和可重复读都必须保证读到的数据都是其他事务提交了的,所以,其他事务修改了数据但是还未提交,我们不能够访问该数据,但可以通过MVVC机制读取该记录的历史版本,核心问题就是需要判断版本链中的哪条历史版本是当前事务可见的,这也是ReadView要解决的问题。

Read View包含4个比较重要的内容:

  • creator_trx_id:创建这个Read View的事务id,Read View和事务是一一对应的。

当事务对表中的记录作出修改时,才会分配一个事务ID,否则如果事务仅进行读取操作,则该事务的ID默认为0。

  • trx_ids:表示在生成Read View时当前系统中活跃的事务id列表。提交了的事务不在其中。

  • up_limit_id:活跃的事务中最小的事务id。

  • low_limit_id:表示生成Read View时系统应该分配给下一个事务的id值,同样也表示系统中最大的事务id值。

请注意,低限制事务ID不一定是trx_ids中的最大值,因为事务ID是按递增顺序分配的。例如,有三个事务的id分别为1、2、5,其中id为5的事务提交了。那么一个新的读事务在生成ReadView时, trx_ids就包括1和2,up_limit_id的值就是1,low_limit_id的值就是6。

How to implement MySQL MVVC multi-version concurrency control

2.Read View规则

How to implement MySQL MVVC multi-version concurrency control

版本链

当某个事务有了Read View,访问某条记录时,需要按照下面的步骤判断该记录的哪个版本可见:

  • 如果该版本记录的trx_id和Read View的creator_trx_id相同,意味着该版本的记录是由当前事务修改的,因此该版本可以被当前事务访问

  • 如果该版本记录的trx_id小于Read View的up_limit_id,证明当前事务生成Read View时,此事务已经提交了,所以当前事务可以读取该版本。

  • 如果该版本的trx_id大于等于low_limit_id,证明生成该版本的事务在当前事务生成Read View之后才开启,所以该版本不可以被当前事务访问。

  • 如果被访问版本的trx_id属性值在ReadView的up_limit_id和low_limit_id之间,那就需要判断一下trx_id属性值是不是在trx_ids列表中,如果不在的话才能访问,否则不能访问。

3.整体流程

了解了这些概念之后,我们来看下当查询一条记录的时候,系统如何通过MVCC找到它:

  • 首先获取事务自己的版本号,也就是事务ID;

  • 获取 ReadView;

  • 查询得到的数据,然后与 ReadView 中的事务版本号进行比较;

  • 如果不符合 ReadView 规则,就需要从Undo Log中获取历史快照;

  • 最后返回符合规则的数据。

在隔离级别为读已提交时,一个事务中的每一次SELECT查询都会重新获取一次Read View,而可重复读是第一SELECT操作才会生成Read View,之后的查询操作复用这一个。

导致这两种的差距是因为:可重复读要保证一个事务中相同的SELECT读取的内容是相同的。

How to implement MySQL MVVC multi-version concurrency control

五、举例

1.READ

COMMITTED隔离级别下

现在有两个事务id分别为10、20的事务在执行:

-- id为10的事务
begin;
update t set name='李四' where id=1;
update t set name='王五' where id=1;
-- id为20的事务
更新其他行的数据
Copy after login

此刻,表中id为1的记录得到的版本链表如下所示:

How to implement MySQL MVVC multi-version concurrency control

此时新来一个事务执行如下操作:

begin;
select * from t where id=1;
-- 事务10、20未提交
Copy after login

查询到的结果为张三。

具体的过程如下:

  • 在执行select语句前,先生成一个Read View,Read View的creator_trx_id为0,trx_ids列表的内容是[10,20],up_limit_id为10,low_limit_id为21。

  • 查询name为王五的最新版本的记录,按规则进行对比,因为trx_id为10,10刚好是trx_ids中的记录,所以这条记录对当前事务不可见,根据回滚指针得到下一个版本

  • 下一个版本name为李四,也不行

  • 继续找到name为张三的版本,trx_id为8,8小于up_limit_id,所以该版本对当前事务可见,得到最终结果

接下来,再将id为10的事务进行commit提交。然后id为20的事务来更新记录:

begin;
-- id为20的事务
update t set name='赵六' where id=1;
update t set name='钱七' where id=1;
Copy after login

此时版本链更新为:

How to implement MySQL MVVC multi-version concurrency control

再到刚才使用READ COMMITTED隔离级别的事务中继续查找这个id 为1的记录,得到的结果为name=王五的那条记录。执行过程如下:

  • 生成Read View,Read View的creator_trx_id为0,trx_ids列表的内容是[20],up_limit_id为20,low_limit_id为21。

  • 因为前两个版本的记录trx_id为20,存在trx_ids中,所以跳过

  • 到第三条记录时,trx_id为10,小于20,可以读取,所以最终结果为王五

注意:READ COMMITTED,每次读取数据前都生成一个新的ReadView。

2.REPEATABLE READ隔离级别下

假如此时id为10的事务和id为20的事务正在修改,都未提交,修改内容和前面的一样,但是还未提交,此时当前事务做一个查询。

How to implement MySQL MVVC multi-version concurrency control

步骤为:

  • 生成Read View,Read View的creator_trx_id为0,trx_ids列表的内容是[10,20],up_limit_id为10,low_limit_id为21。

  • trx_id为10和20的都不满足要求

  • 最后查找到name为张三的历史版本的数据

此时,id为10的记录提交事务。

How to implement MySQL MVVC multi-version concurrency control

当前事务又需要select id为1的记录,步骤为:

  • 因为是可重复读,且第一次select已经生成过Read View了,所有会复用它,不重新生成。

  • 所以trx_id为10和20的记录依旧不符合规则,最终得到的数据还是张三,符合可重复读的规范

注意:REPEATABLE READ,每次读取都复用第一次生成的Read View

3.如何解决幻读

假设现在有一条数据,id为1

How to implement MySQL MVVC multi-version concurrency control

当前活跃的事务有10和20。

此时当前事务启动了,执行如下SQL语句:

begin;
select * from student where id>=1;
Copy after login

在开始前生成Read View,内容如下:creator_trx_id=0,trx_ids= [10,20] , up_limit_id=10, low_limit_id=21。

由于id大于等于1的数据只有一个,且该数据的trx_id为8,小于up_limit_id,所以可以读取到。

在这之后id为10的事务新增了一行数据,增加了id为2的数据,且提交了。

How to implement MySQL MVVC multi-version concurrency control

此时当前线程继续查找id>=1的数据,因为是可重复读,复用刚刚的Read View。

得到两行数据,但是因为id为2的数据trx_id为10,该值在Read View的trx_ids中存在,所以该记录对当前事务不可见,所以最后查询到的数据只有一条记录。

如果当前事务再插入id为2的数据就插不进去,所以说MVVC只解决了一半的幻读问题。

The above is the detailed content of How to implement MySQL MVVC multi-version concurrency control. For more information, please follow other related articles on the PHP Chinese website!

Related labels:
source:yisu.com
Statement of this Website
The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn
Popular Tutorials
More>
Latest Downloads
More>
Web Effects
Website Source Code
Website Materials
Front End Template
Transaction 10事20