Personally, I think the first-level cache and the second-level cache of mybatis are not a good design. I basically don’t use the first-level cache and the second-level cache at work. Because improper use can cause many problems, so let’s take a look today at what exactly are the problems?
In the previous section, we introduced that Executor will call StatementHandler to execute SQL, which serves as a link between the previous and the following. The design of Executor is a typical decorator pattern. SimpleExecutor and ReuseExecutor are specific implementation classes, while CachingExecutor is a decorator class.
You can see that the specific component implementation class has a parent class BaseExecutor, and this parent class is a typical application of the template pattern. The operations of operating the first-level cache are all in this class, and the specific functions of operating the database are Let the subclasses implement it.
"The second-level cache is a decorator class. When the second-level cache is turned on, CachingExecutor will be used to decorate the specific implementation class, so when querying, you must first query the second-level cache and then Query the first-level cache""So what is the difference between the first-level cache and the second-level cache?"
// BaseExecutor protected PerpetualCache localCache;
The first-level cache is a member variable localCache in BaseExecutor (a simple encapsulation of HashMap), so the first-level cache The life cycle is the same as SqlSession. If you are not familiar with SqlSession, you can compare it to Connection in JDBC programming, which is a session of the database.
「一级缓存和二级缓存key的构建规则是一致的,都是一个CacheKey对象,因为Mybatis中涉及动态SQL等多方面的因素,缓存的key不能仅仅通过String来表示」
当执行sql的如下4个条件都相等时,CacheKey才会相等
「当查询的时候先从缓存中查询,如果查询不到的话再从数据库中查询」
org.apache.ibatis.executor.BaseExecutor#query当使用同一个SqlSession执行更新操作时,会先清空一级缓存。因此一级缓存中内容被使用的概率也很低
「看到美团技术团队上关于一级缓存和二级缓存的一些测试写的挺不错的,就直接引用过来了」
原文地址:https://tech.meituan.com/2018/01/19/mybatis-cache.html 测试代码github地址:https://github.com/kailuncen/mybatis-cache-demo
接下来通过实验,了解MyBatis一级缓存的效果,每个单元测试后都请恢复被修改的数据。
首先是创建示例表student,创建对应的POJO类和增改的方法,具体可以在entity包和mapper包中查看。
CREATE TABLE `student` ( `id` int(11) unsigned NOT NULL AUTO_INCREMENT, `name` varchar(200) COLLATE utf8_bin DEFAULT NULL, `age` tinyint(3) unsigned DEFAULT NULL, PRIMARY KEY (`id`) ) ENGINE=InnoDB AUTO_INCREMENT=4 DEFAULT CHARSET=utf8 COLLATE=utf8_bin;
在以下实验中,id为1的学生名称是凯伦
「实验1」
开启一级缓存,范围为会话级别,调用三次getStudentById,代码如下所示:
执行结果:我们可以看到,只有第一次真正查询了数据库,后续的查询使用了一级缓存。
「实验2」
增加了对数据库的修改操作,验证在一次数据库会话中,如果对数据库发生了修改操作,一级缓存是否会失效。
执行结果:我们可以看到,在修改操作后执行的相同查询,查询了数据库,一级缓存失效。
「实验3」
开启两个SqlSession,在sqlSession1中查询数据,使一级缓存生效,在sqlSession2中更新数据库,验证一级缓存只在数据库会话内部共享。
输出如下sqlSession1和sqlSession2读的时相同的数据,但是都查询了数据库,说明了「一级缓存只在数据库会话层面共享」
sqlSession2更新了id为1的学生的姓名,从凯伦改为了小岑,但sqlSession1之后的查询中,id为1的学生的名字还是凯伦,出现了脏数据,也证明了之前的设想,一级缓存只在数据库会话层面共享
「MyBatis的一级缓存最大范围是SqlSession内部,有多个SqlSession或者分布式的环境下,数据库写操作会引起脏数据,建议设定缓存级别为Statement,即进行如下配置」
<setting name="localCacheScope" value="STATEMENT"/>
原因也很简单,看BaseExecutor的query()方法,当配置成STATEMENT时,每次查询完都会清空缓存「看到这你可能会想,我用mybatis后没设置这个参数啊,好像也没发生脏读的问题啊,其实是因为你和spring整合了」
当mybatis和spring整合后(整合的相关知识后面还有一节)
「当mybatis和spring整合后,未开启事务的情况下,不会有任何问题,因为一级缓存没有生效。当开启事务的情况下,可能会有问题,由于一级缓存的存在,在事务内的查询隔离级别是可重复读,即使你数据库的隔离级别设置的是提交读」
// Configuration protected final Map<String, Cache> caches = new StrictMap<>("Caches collection");
「而二级缓存是Configuration对象的成员变量,因此二级缓存的生命周期是整个应用级别的。并且是基于namespace构建的,一个namesapce构建一个缓存」
「二级缓存不像一级缓存那样查询完直接放入一级缓存,而是要等事务提交时才会将查询出来的数据放到二级缓存中。」
因为如果事务1查出来直接放到二级缓存,此时事务2从二级缓存中拿到了事务1缓存的数据,但是事务1回滚了,此时事务2不就发生了脏读了吗?
「二级缓存的相关配置有如下3个」
「1.mybatis-config.xml」
<settings> <setting name="cacheEnabled" value="true"/> </settings>
这个是二级缓存的总开关,只有当该配置项设置为true时,后面两项的配置才会有效果
从Configuration类的newExecutor方法可以看到,当cacheEnabled为true,就用缓存装饰器装饰一下具体组件实现类,从而让二级缓存生效「2.mapper映射文件中」mapper映射文件中如果配置了
<cache type="" eviction="FIFO" size="512"></cache>
二级缓存的部分配置如上,type就是填写一个全类名,用来指定二级缓存的实现类,这个实现类需要实现Cache接口,默认是PerpetualCache(你可以利用这个属性将mybatis二级缓存和Redis,Memcached等缓存组件整合在一起)
org.apache.ibatis.builder.MapperBuilderAssistant#useNewCache
这个eviction表示缓存清空策略,可填选项如下
选项 | 解释 | 装饰器类 |
---|---|---|
LRU | 最近最少使用的:移除最长时间不被使用的对象 | LruCache |
FIFO | 先进先出:按对象进入缓存的顺序来移除它们 | FifoCache |
SOFT | 软引用:移除基于垃圾回收器状态和软引用规则的对象 | SoftCache |
WEAK | 弱引用:更积极地移除基于垃圾收集器状态和弱引用规则的对象 | WeakCache |
A typical implementation of the decorator pattern, changing the cache clearing strategy is to change the decorator. 「3.
This attribute indicates whether the results generated by the query should be saved in the second-level cache. The default value of the useCache attribute is true, this configuration can subdivide the second-level cache to the statement level
The second level cache is implemented based on namespace, that is, a mapper mapping file uses a cache.
In this experiment, the student name with id 1 is initialized as Dot.
「Experiment 1」
Test the effect of the second-level cache without submitting the transaction. After sqlSession1 queries the data, whether the same query of sqlSession2 will obtain data from the cache. Execution results:We can see that when sqlsession does not call the commit() method, the second-level cache does not play a role.
「Experiment 2」
Test the effect of the second-level cache. When submitting a transaction, after sqlSession1 queries the data, will the same query of sqlSession2 obtain data from the cache? . As can be seen from the picture, the query of sqlsession2 uses cache, and the cache hit rate is 0.5.
「Experiment 3」
Test whether the update operation will refresh the second-level cache under the namespace.
We can see that after sqlSession3 updates the database and submits the transaction, the query under the StudentMapper namespace of sqlsession2 goes to the database and not to the Cache.
「Experiment 4」
验证MyBatis的二级缓存不适应用于映射文件中存在多表查询的情况。
getStudentByIdWithClassInfo的定义如下
通常我们会为每个单表创建单独的映射文件,由于MyBatis的二级缓存是基于namespace的,多表查询语句所在的namspace无法感应到其他namespace中的语句对多表查询中涉及的表进行的修改,引发脏数据问题。
执行结果:
在这个实验中,我们引入了两张新的表,一张class,一张classroom。class中保存了班级的id和班级名,classroom中保存了班级id和学生id。我们在StudentMapper中增加了一个查询方法getStudentByIdWithClassInfo,用于查询学生所在的班级,涉及到多表查询。在ClassMapper中添加了updateClassName,根据班级id更新班级名的操作。
当sqlsession1的studentmapper查询数据后,二级缓存生效。保存在StudentMapper的namespace下的cache中。当sqlSession3的classMapper的updateClassName方法对class表进行更新时,updateClassName不属于StudentMapper的namespace,所以StudentMapper下的cache没有感应到变化,没有刷新缓存。当StudentMapper中同样的查询再次发起时,从缓存中读取了脏数据。
「实验5」
为了解决实验4的问题呢,可以使用Cache ref,让ClassMapper引用StudenMapper命名空间,这样两个映射文件对应的SQL操作都使用的是同一块缓存了。
mapper文件中的配置如下
<cache-ref namespace="mapper.StudentMapper"/>
执行结果:
不过这样做的后果是,缓存的粒度变粗了,多个Mapper namespace下的所有操作都会对缓存使用造成影响。
Mybatis's first-level cache and second-level cache are based on local, which will inevitably occur in a distributed environment Dirty reading.
The second-level cache can centrally manage the cache and avoid dirty reads by implementing the Cache interface, but there is a certain development cost, and when using multi-table queries, dirty data is likely to appear if used improperly
The above is the detailed content of Why are both the first-level and second-level caches of Mybatis not recommended?. For more information, please follow other related articles on the PHP Chinese website!