oracle rac 重启一个节点,RAC一节点重启问题
OS:HP-UX B.11.31
ORALCE:10.2.0.4.0 - 64bit
两个节点,RAC+ASM
心跳线通过同一个交换机连通。
DB02自动重启,从DB01的CSSD日志看是DB01发现DB02不通了,而让DB02重启的。但不知具体原因,不知怎么查找。
以前试过心跳线断了,会导致DB02重启,但当时会从两节点的CRS日志看到,两节点都会抢仲裁盘的资源,而想让对它重启的日志。
而现在只有DB01的CSSD有日志,DB02重启前一点日志都没有。
麻烦高手们指点一下,提个思路,会是什么原因导致的呢?!
有用的日志情况如下:
DB02的OLDsyslog.log:
Dec 10 11:24:07 db02 syslog: Oracle CSSD failure 134.
Dec 10 11:24:07 db02 syslog: Oracle CRS failure. Rebooting for cluster integrity.
Dec 10 11:24:07 db02 vmunix: User requested reset of the system.
Dec 10 11:24:07 db02 vmunix:
Dec 10 11:24:07 db02 vmunix: Oracle CRS TOC for clusterware integrity...
db01那个时间段有用syslog.log:
Dec 10 11:23:47 db01 vmunix: Dead gateway detection can't ping the last remaining default gateway at 0xc0a82101 .See ndd -h ip_ire_gw_probe for more info
我不知这代表什么,是不是提示网关不通了。
DB01的CSSD日志:
[ CSSD]2009-12-10 11:23:52.710 [14] >WARNING: clssnmPollingThread: node db02 (2) at 50 2.000000e+00artbeat fatal, eviction in 14.
084 seconds
[ CSSD]2009-12-10 11:23:52.711 [14] >TRACE: clssnmPollingThread: node db02 (2) is impending reconfig, flag 1037, misstime 15916
[ CSSD]2009-12-10 11:23:52.711 [14] >TRACE: clssnmPollingThread: diskTimeout set to (27000)ms impending reconfig status(1)
[ CSSD]2009-12-10 11:23:59.710 [14] >WARNING: clssnmPollingThread: node db02 (2) at 75 2.000000e+00artbeat fatal, eviction in 7.0
84 seconds
[ CSSD]2009-12-10 11:24:00.710 [14] >WARNING: clssnmPollingThread: node db02 (2) at 75 2.000000e+00artbeat fatal, eviction in 6.0
84 seconds
[ CSSD]2009-12-10 11:24:04.710 [14] >WARNING: clssnmPollingThread: node db02 (2) at 90 2.000000e+00artbeat fatal, eviction in 2.0
84 seconds
[ CSSD]2009-12-10 11:24:05.710 [14] >WARNING: clssnmPollingThread: node db02 (2) at 90 2.000000e+00artbeat fatal, eviction in 1.0
84 seconds
[ CSSD]2009-12-10 11:24:06.710 [14] >WARNING: clssnmPollingThread: node db02 (2) at 90 2.000000e+00artbeat fatal, eviction in 0.0
84 seconds
[ CSSD]2009-12-10 11:24:06.800 [14] >TRACE: clssnmPollingThread: Eviction started for node db02 (2), flags 0x040d, state 3, wt4
c 0
[ CSSD]2009-12-10 11:24:06.800 [16] >TRACE: clssnmDoSyncUpdate: Initiating sync 11
[ CSSD]2009-12-10 11:24:06.801 [16] >TRACE: clssnmDoSyncUpdate: diskTimeout set to (27000)ms
[ CSSD]2009-12-10 11:24:06.801 [16] >TRACE: clssnmSetupAckWait: Ack message type (11)
[ CSSD]2009-12-10 11:24:06.801 [16] >TRACE: clssnmSetupAckWait: node(1) is ALIVE
[ CSSD]2009-12-10 11:24:06.801 [16] >TRACE: clssnmSendSync: syncSeqNo(11)
[ CSSD]2009-12-10 11:24:06.801 [16] >TRACE: clssnmWaitForAcks: Ack message type(11), ackCount(1)
[ CSSD]2009-12-10 11:24:06.801 [9] >TRACE: clssnmHandleSync: diskTimeout set to (27000)ms
[ CSSD]2009-12-10 11:24:06.801 [6] >TRACE: clssnmReadDskHeartbeat: node(2) is down. rcfg(11) wrtcnt(1176466) LATS(3931462902) D
isk lastSeqNo(1176466)
[ CSSD]2009-12-10 11:24:06.801
本文来自互联网用户投稿,文章观点仅代表作者本人,不代表本站立场,不承担相关法律责任。如若转载,请注明出处。 如若内容造成侵权/违法违规/事实不符,请点击【内容举报】进行投诉反馈!
