关于Pacemaker集群配置的版本_PHP教程-PHP Tutorial-php.cn

Table of Contents

关于Pacemaker集群配置的版本

1. 手册说明

2.实际验证

2.1 环境

2.2 基本验证

2.4 分区验证2

3.总结

Home

Backend Development

PHP Tutorial

关于Pacemaker集群配置的版本_PHP教程

WBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWB

Jul 12, 2016 am 08:54 AM

android

关于Pacemaker集群配置的版本

Pacemaker中CIB有一个由admin_epoch, epoch, num_updates组合而成的版本，当有节点加入集群时，根据版本号的大小，取其中版本最大的作为整个集群的统一配置。

admin_epoch, epoch, num_updates这3者中，admin_epoch通常是不会变的，epoch在每次"配置"变更时累加并把num_updates置0，num_updates在每次"状态"变更时累加。"配置"指持久的CIB中configuration节点下的内容，包括cluster属性，node的forever属性，资源属性等。"状态"指node的reboot属性，node死活，资源是否启动等动态的东西。

"状态"通常是可以通过monitor重新获取的（除非RA脚本设计的有问题），但"配置"出错可能会导致集群的故障，所以我们更需要关心epoch的变更以及节点加入后对集群配置的影响。尤其一些支持主从架构的RA脚本会动态修改配置(比如mysql的mysql_REPL_INFO
和pgsql的pgsql-data-status)，一旦配置处于不一致状态可能会导致集群故障。

1. 手册说明

http://clusterlabs.org/doc/en-US/Pacemaker/1.1-plugin/html-single/Pacemaker_Explained/index.html#idm140225199219024

3.2.Configuration Version When a node joins the cluster, the cluster will perform a check to see who has the best configuration based on the fields below. It then asks the node with the highest (admin_epoch,epoch,num_updates) tuple to replace the configuration on all the nodes - which makes setting them, and setting them correctly, very important.

Table3.1.Configuration Version Properties

Field	Description
admin_epoch	Never modified by the cluster. Use this to make the configurations on any inactive nodes obsolete.Never set this value to zero, in such cases the cluster cannot tell the difference between your configuration and the "empty" one used when nothing is found on disk.
epoch	Incremented every time the configuration is updated (usually by the admin)
num_updates	Incremented every time the configuration or status is updated (usually by the cluster)

2.实际验证

2.1 环境

3台机器，srdsdevapp69，srdsdevapp71和srdsdevapp73
OS: CentOS 6.3
Pacemaker: 1.1.14-1.el6 (Build: 70404b0)
Corosync: 1.4.1-7.el6

2.2 基本验证

0. 初始时epoch="48304"，num_updates="4"

[root@srdsdevapp69 mysql_ha]# cibadmin -Q |grep epoch

1. 更新集群配置导致epoch加1并将num_updates清0

[root@srdsdevapp69 mysql_ha]# crm_attribute --type crm_config -s set1 --name foo1 -v "1"
[root@srdsdevapp69 mysql_ha]# cibadmin -Q |grep epoch

2. 更新值如果和现有值相同epoch不变

[root@srdsdevapp69 mysql_ha]# crm_attribute --type crm_config -s set1 --name foo1 -v "1"
[root@srdsdevapp69 mysql_ha]# cibadmin -Q |grep epoch

3. 更新生命周期为forever的节点属性也导致epoch加1

[root@srdsdevapp69 mysql_ha]# crm_attribute -N `hostname` -l forever -n foo2 -v 2
[root@srdsdevapp69 mysql_ha]# cibadmin -Q |grep epoch

4. 更新生命周期为reboot的节点属性导致num_updates加1

[root@srdsdevapp69 mysql_ha]# crm_attribute -N `hostname` -l reboot -n foo3 -v 2
[root@srdsdevapp69 mysql_ha]# cibadmin -Q |grep epoch

2.3 分区验证

1. 人为造成srdsdevapp69和其它两个节点的网络隔离形成分区,分区前的DC(Designated Controller)为srdsdevapp73

[root@srdsdevapp69 mysql_ha]# iptables -A INPUT -j DROP -s srdsdevapp71
[root@srdsdevapp69 mysql_ha]# iptables -A OUTPUT -j DROP -s srdsdevapp71
[root@srdsdevapp69 mysql_ha]# iptables -A INPUT -j DROP -s srdsdevapp73
[root@srdsdevapp69 mysql_ha]# iptables -A OUTPUT -j DROP -s srdsdevapp73

两个分区上的epoch都没有变，仍是48306,但srdsdevapp69将自己作为了自己分区的DC 。

分区1(srdsdevapp69) : 未取得QUORUM

[root@srdsdevapp69 mysql_ha]# cibadmin -Q |grep epoch

分区2(srdsdevapp71,srdsdevapp73) : 取得QUORUM

[root@srdsdevapp71 ~]# cibadmin -Q |grep epoch

2. 在srdsdevapp69上做2次配置更新，使其epoch增加2

[root@srdsdevapp69 mysql_ha]# crm_attribute --type crm_config -s set1 --name foo4 -v "1"
[root@srdsdevapp69 mysql_ha]# crm_attribute --type crm_config -s set1 --name foo5 -v "1"
[root@srdsdevapp69 mysql_ha]# cibadmin -Q |grep epoch

3.在srdsdevapp71上做1次配置更新，使其epoch增加1

[root@srdsdevapp71 ~]# crm_attribute --type crm_config -s set1 --name foo6 -v "1"
[root@srdsdevapp71 ~]# cibadmin -Q |grep epoch

4.恢复网络再检查集群的配置

[root@srdsdevapp69 mysql_ha]# iptables -F
[root@srdsdevapp69 mysql_ha]# cibadmin -Q |grep epoch
[root@srdsdevapp69 mysql_ha]# crm_attribute --type crm_config -s set1 --name foo5 -q
1
[root@srdsdevapp69 mysql_ha]# crm_attribute --type crm_config -s set1 --name foo4 -q
1
[root@srdsdevapp69 mysql_ha]# crm_attribute --type crm_config -s set1 --name foo6 -q
Error performing operation: No such device or address

可以发现集群采用了srdsdevapp69分区的配置,因为它的版本更大，这时在srdsdevapp71,srdsdevapp73分区上所做的更新丢失了。
这个测试反映了一个问题：取得QUORUM的分区配置可能会被未取得QUORUM的分区配置覆盖。如果自己开发RA的话，这是一个需要注意的问题。

2.4 分区验证2

前一个测试中，产生分区前的DC在取得QUORUM的分区中，现在再试一下产生分区前的DC在未取得QUORUM的分区中的场景。

1. 人为造成DC(srdsdevapp73)和其它两个节点的网络隔离形成分区

[root@srdsdevapp73 ~]# iptables -A INPUT -j DROP -s srdsdevapp69
[root@srdsdevapp73 ~]# iptables -A OUTPUT -j DROP -s srdsdevapp69
[root@srdsdevapp73 ~]# iptables -A INPUT -j DROP -s srdsdevapp71
[root@srdsdevapp73 ~]# iptables -A OUTPUT -j DROP -s srdsdevapp71

srdsdevapp73上epoch没有变

[root@srdsdevapp73 ~]# cibadmin -Q |grep epoch

但另一个分区(srdsdevapp69,srdsdevapp71)上的epoch加1了

[root@srdsdevapp69 ~]# cibadmin -Q |grep epoch

恢复网络后集群采用了版本号更高的配置，DC仍然是分区前的DC(srdsdevapp73)

[root@srdsdevapp73 ~]# iptables -F
[root@srdsdevapp73 ~]# cibadmin -Q |grep epoch

通过这个测试可以发现：

DC协商会导致epoch加1
分区恢复后，Pacemaker倾向于使分区前的DC作为新的DC

3.总结

Pacemaker的行为特征

CIB配置变更会导致epoch加1
DC协商会导致epoch加1
分区恢复后，Pacemaker采取版本号大的作为集群的配置
分区恢复后，Pacemaker倾向于使分区前的DC作为新的DC

开发RA的注意点

尽量避免动态修改集群配置
如果做不到第一点，尽量避免使用多个动态集群配置参数，比如可以把多个参数拼接成一个（mysql的mysql_REPL_INFO就是这么干的）
检查crm_attribute的出错并重试(pgsql就是这么干的)
失去quorum时的资源停止处理(demote，stop)中避免修改集群配置

Statement of this Website

The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn

Hot AI Tools

Undresser.AI Undress

AI-powered app for creating realistic nude photos

AI Clothes Remover

Online AI tool for removing clothes from photos.

Undress AI Tool

Undress images for free

Clothoff.io

AI clothes remover

Video Face Swap

Swap faces in any video effortlessly with our completely free AI face swap tool!

Hot Article

Assassin's Creed Shadows: Seashell Riddle Solution

3 weeks ago By DDD

What's New in Windows 11 KB5054979 & How to Fix Update Issues

2 weeks ago By DDD

Assassin's Creed Shadows - How To Find The Blacksmith And Unlock Weapon And Armour Customisation

1 months ago By DDD

Where to find the Crane Control Keycard in Atomfall

3 weeks ago By DDD

Roblox: Dead Rails - How To Complete Every Challenge

3 weeks ago By DDD

Hot Tools

Notepad++7.3.1

Easy-to-use and free code editor

SublimeText3 Chinese version

Chinese version, very easy to use

Zend Studio 13.0.1

Powerful PHP integrated development environment

Dreamweaver CS6

Visual web development tools

SublimeText3 Mac version

God-level code editing software (SublimeText3)

Hot Topics

Where is the login entrance for gmail email?

7644

CakePHP Tutorial

1392

What is the format of the account name of steam

win11 activation key permanent

nyt connections hints and answers

151

Related knowledge

New report delivers damning assessment of rumoured Samsung Galaxy S25, Galaxy S25 Plus and Galaxy S25 Ultra camera upgrades Sep 12, 2024 pm 12:23 PM

In recent days, Ice Universe has been steadily revealing details about the Galaxy S25 Ultra, which is widely believed to be Samsung's next flagship smartphone. Among other things, the leaker claimed that Samsung only plans to bring one camera upgrade

Samsung Galaxy S25 Ultra leaks in first render images with rumoured design changes revealed Sep 11, 2024 am 06:37 AM

OnLeaks has now partnered with Android Headlines to provide a first look at the Galaxy S25 Ultra, a few days after a failed attempt to generate upwards of $4,000 from his X (formerly Twitter) followers. For context, the render images embedded below h

$IFA 2024 | TCL\'s NXTPAPER 14 won\'t match the Galaxy Tab S10 Ultra in performance, but it nearly matches it in size$ IFA 2024 | TCL\'s NXTPAPER 14 won\'t match the Galaxy Tab S10 Ultra in performance, but it nearly matches it in size Sep 07, 2024 am 06:35 AM

Alongside announcing two new smartphones, TCL has also announced a new Android tablet called the NXTPAPER 14, and its massive screen size is one of its selling points. The NXTPAPER 14 features version 3.0 of TCL's signature brand of matte LCD panels

Vivo Y300 Pro packs 6,500 mAh battery in a slim 7.69 mm body Sep 07, 2024 am 06:39 AM

The Vivo Y300 Pro just got fully revealed, and it's one of the slimmest mid-range Android phones with a large battery. To be exact, the smartphone is only 7.69 mm thick but features a 6,500 mAh battery. This is the same capacity as the recently launc

Samsung Galaxy S24 FE billed to launch for less than expected in four colours and two memory options Sep 12, 2024 pm 09:21 PM

Samsung has not offered any hints yet about when it will update its Fan Edition (FE) smartphone series. As it stands, the Galaxy S23 FE remains the company's most recent edition, having been presented at the start of October 2023. However, plenty of

New report delivers damning assessment of rumoured Samsung Galaxy S25, Galaxy S25 Plus and Galaxy S25 Ultra camera upgrades Sep 12, 2024 pm 12:22 PM

Xiaomi Redmi Note 14 Pro Plus arrives as first Qualcomm Snapdragon 7s Gen 3 smartphone with Light Hunter 800 camera Sep 27, 2024 am 06:23 AM

The Redmi Note 14 Pro Plus is now official as a direct successor to last year'sRedmi Note 13 Pro Plus(curr. $375 on Amazon). As expected, the Redmi Note 14 Pro Plus heads up the Redmi Note 14 series alongside theRedmi Note 14and Redmi Note 14 Pro. Li

iQOO Z9 Turbo Plus: Reservations begin for the potentially beefed-up series flagship Sep 10, 2024 am 06:45 AM

OnePlus'sister brand iQOO has a 2023-4 product cycle that might be nearlyover; nevertheless, the brand has declared that it is not done with itsZ9series just yet. Its final, and possibly highest-end,Turbo+variant has just beenannouncedas predicted. T

See all articles