> 데이터 베이스 > MySQL 튜토리얼 > 打完AIX操作系统补丁,ASM hang磁盘无法mount

打完AIX操作系统补丁,ASM hang磁盘无法mount

WBOY
풀어 주다: 2016-06-07 16:38:05
원래의
1385명이 탐색했습니다.

同事的一套数据库是AIX系统的,打完操作系统补丁之后,ASM实例启动无法mount磁盘组,出现hang的情况。然后我们对这个hang的过程做了一个hang analyze,hang分析的文件如下所示: Chains most likely to have caused the hang: [a] Chain 1 Signature: ='rdbm

同事的一套数据库是AIX系统的,打完操作系统补丁之后,ASM实例启动无法mount磁盘组,出现hang的情况。然后我们对这个hang的过程做了一个hang analyze,hang分析的文件如下所示:

Chains most likely to have caused the hang:
 [a] Chain 1 Signature:  Oracle session identified by:
    {
                instance: 1 (+asm.+asm)
                   os id: 5112446
              process id: 18, oracle@NG-CSDB (TNS V1-V3)
              session id: 1135
        session serial #: 1
    }
    which is waiting for 'rdbms ipc reply' with wait info:
    {
                      p1: 'from_process'=0xd
                      p2: 'timeout'=0x7fffff7b
            time in wait: 1.626263 sec
           timeout after: 0.373737 sec
                 wait id: 86
                blocking: 2 sessions
             current sql: ALTER DISKGROUP UATDG MOUNT  /* asm agent *//* {0:0:324} */
             short stack: ksedsts()+360 Oracle session identified by:
    {
                instance: 1 (+asm.+asm)
                   os id: 4653624
              process id: 13, oracle@NG-CSDB (RBAL)
              session id: 820
        session serial #: 1
    }
    which is not in a wait:
    {
               last wait: 2 min 13 sec ago
                blocking: 3 sessions
             current sql: 
             short stack: ksedsts()+360
<p><font face="幼圆" size="3">可以看到我们的会话在执行磁盘组mount的过程中,出现了等待事件enq: DD – contention,而这个进程被ASM的RBAL进程所阻塞,RBAL的历史等待情况是’CSS initialization’=>’GPnP Get Item’=>’CSS operation: action’,而hang分析文件里面也包含了RBAL进程的堆栈信息。一般遇到这种问题,我们需要把相关堆栈拿到support上做一下搜索,于是,我搜到了一篇文档,和我们等待事件还有堆栈信息如出一辙。文档内容如下:ASM Instance Hangs During The Diskgroup Mount Stage After AIX OS Patch Install (文档 ID 1633273.1)。我们来看一下该文档的一些说明情况。文档上说明这是bug 16288155导致的,但是在11.2.0.3上已经进行了修复。但是奇怪的是,我们的版本就是11.2.0.3的版本。文档上要我们对RBAL进程做一下truss信息搜集,如下所示:</font></p>
<pre class="brush:php;toolbar:false">1. Create a script called rbal.sh on /home/oracle directory. Add the following lines:
#!/bin/csh
/usr/bin/truss -o /tmp/rbal_strace.log -aefldD /bin/oracle
chmod +x /home/oracle/rbal.sh
2.
sqlplus / as sysasm
SQL>create pfile='/tmp/xx.ora' from spfile;
3. Edit xx.ora and add the following parameter
_dbg_proc_startup=TRUE
4. Shutdown ASM
5. Startup ASM using the following commands:
# csh
# setenv RBAL /home/oracle/rbal.sh
# sqlplus / as sysasm
# SQL> startup pfile='/tmp/xx.ora'
로그인 후 복사

我们按照文档的要求对进程做了truss的收集,内容如下:

3866976: 20447683: 0.0002: statx("/oracle/app/grid_base/diag/asm/+asm/+ASM/alert/log.xml", 0x0FFFFFFFFFFF7FD0, 176, 01) = 0 
3866976: 20447683: 0.0002: statx("/oracle/app/grid_base/diag/asm/+asm/+ASM/alert/log.xml", 0x0FFFFFFFFFFF8100, 176, 0) = 0 
3866976: 20447683: 0.0004: kopen("/oracle/app/grid_base/diag/asm/+asm/+ASM/alert/log.xml", O_WRONLY|O_CREAT|O_APPEND|O_LARGEFILE, S_IRUSR|S_IWUSR|S_IRGRP|S_IWGRP) = 7 
3866976: 20447683: 0.0002: kfcntl(7, F_SETFD, 0x0000000000000001) = 0 
3866976: 20447683: kwrite(7, " 
<p><font face="幼圆" size="3">我们可以看到,RBAL已经启动了,然后就一直处于Sleeping的状态。证明RBAL进程没有什么问题。而MOS的文档是显示RBAL访问磁盘出现了问题。我们的情况是不一样的。那么究竟是什么问题?这个确实很诡异,我们只能根据这篇文章最后介绍的办法就是把os的patch回滚掉,然后重新relink软件。对于这个问题开了SR也没有什么特别的进展,怀疑是一个还没有发现的冲突。因为毕竟Oracle软件是先研发出来的,而IBM的新补丁是后开发出来的,这些本身就容易造成一些不兼容的情况。最好是把操作系统补丁都安装好了,然后再重装最新版的数据库软件。</font></p>
<p><font face="幼圆" size="3"><strong>参考文档:ASM Instance Hangs During The Diskgroup Mount Stage After AIX OS Patch Install (文档 ID 1633273.1)</strong></font></p>
    <p class="copyright">
        原文地址:打完AIX操作系统补丁,ASM hang磁盘无法mount, 感谢原作者分享。
    </p>
    
    


로그인 후 복사
관련 라벨:
원천:php.cn
본 웹사이트의 성명
본 글의 내용은 네티즌들의 자발적인 기여로 작성되었으며, 저작권은 원저작자에게 있습니다. 본 사이트는 이에 상응하는 법적 책임을 지지 않습니다. 표절이나 침해가 의심되는 콘텐츠를 발견한 경우 admin@php.cn으로 문의하세요.
최신 이슈
인기 튜토리얼
더>
최신 다운로드
더>
웹 효과
웹사이트 소스 코드
웹사이트 자료
프론트엔드 템플릿