2019-08-12
11.2.0.4 RAC节点二被驱逐
节点二被驱逐,重启无法加入,启动ohasd,然后节点一也重启了
收藏
复制链接
微信扫码分享
在小程序上查看
分享
8条回答
默认
最新
采纳答案后不可修改和取消
评论
有用 0采纳答案后不可修改和取消
两节点TFA日志及当时系统负载
链接:https://pan.baidu.com/s/1JmElim6O450AjiJ9A9TUdQ
提取码:1siy
评论
有用 0第一次驱逐日志如下IPC Send timeout detected:
Mon Jul 29 06:22:01 2019 IPC Send timeout detected. Receiver ospid 64555 [ Mon Jul 29 06:22:01 2019 Errors in file /oracle/ECP/saptrace/diag/rdbms/ecp/ECP001/trace/ECP001_lms3_64555.trc: IPC Send timeout detected. Receiver ospid 64559 [ Mon Jul 29 06:22:01 2019 Errors in file /oracle/ECP/saptrace/diag/rdbms/ecp/ECP001/trace/ECP001_lms4_64559.trc: IPC Send timeout detected. Receiver ospid 64551 [ Mon Jul 29 06:22:01 2019 Errors in file /oracle/ECP/saptrace/diag/rdbms/ecp/ECP001/trace/ECP001_lms2_64551.trc: IPC Send timeout detected. Receiver ospid 64543 [ Mon Jul 29 06:22:01 2019 Errors in file /oracle/ECP/saptrace/diag/rdbms/ecp/ECP001/trace/ECP001_lms0_64543.trc: Mon Jul 29 06:22:27 2019 IPC Send timeout detected. Receiver ospid 64541 [ Mon Jul 29 06:22:27 2019 Errors in file /oracle/ECP/saptrace/diag/rdbms/ecp/ECP001/trace/ECP001_lmd0_64541.trc: Mon Jul 29 06:22:49 2019
lms日志显示大量DRM操作
*** 2019-07-29 06:16:44.305 * lms 3 finished parallel drm freeze in DRM(1260) window 1, pcount 43 DRM(1260) win(1) lms 3 finished drm freeze 2019-07-29 06:16:44.359174 : 2129 GCS shadows traversed, 0 replayed in drm replay DRM(1260) win(1) lms 3 finished replaying gcs resources *** 2019-07-29 06:16:44.757 DRM(1260) win(1) lms 3 finished fixing gcs write protocol DRM(1260) quiesced basts [131072-262143] * lms 3 finished parallel drm freeze in DRM(1260) window 2, pcount 43 DRM(1260) win(2) lms 3 finished drm freeze 2019-07-29 06:16:45.075140 : 2116 GCS shadows traversed, 0 replayed in drm replay DRM(1260) win(2) lms 3 finished replaying gcs resources *** 2019-07-29 06:21:15.845 2019-07-29 06:21:15.845506 : GSIPC:PING: send PINGREQ[1] to 2.4 (seq 0.134764181) stm 0x31a04fed *** 2019-07-29 06:22:01.147 Received ORADEBUG command (#2) 'dump errorstack 1' from process 'Unix process pid: 64529, image: <none>'
请结合AWR确认是否6点有大量跨节点的批量任务,触发DRM的bug,建议先关闭DRM特性。
评论
有用 0采纳答案后不可修改和取消
在当日九点钟的时候,重启了二号节点,也是不成功的。如果是bug引起, 现象是这样吗?而且重启的时候,还引起了一号节点重启了,请大佬帮忙看看!十分感谢
评论
有用 0采纳答案后不可修改和取消
好的。非常感谢!
评论
有用 0采纳答案后不可修改和取消
问题已关闭: 问题已经得到解决
评论
有用 0回答交流
提交
问题信息
请登录之后查看
附件列表
请登录之后查看
邀请回答
暂无人订阅该标签,敬请期待~~
墨值悬赏

