问题描述
两个实例异常终止,文件系统变成只读:
PCLERPDB2:[10g]:/DBMS/PCMK/admin/PCMK> sqlplus "/ as sysdba" SQL*Plus: Release 10.2.0.3.0 - Production on Thu Jan 19 09:08:05 2012 Copyright (c) 1982, 2006, Oracle. All Rights Reserved. ERROR: ORA-09925: Unable to create audit trail file Linux Error: 30: Read-only file system Additional information: 9925 ORA-01075: you are currently logged on
专家解答
检查系统日志,发现早晨出现SCSI IO错误:
Jan 19 07:56:00 PCLERPDB2 kernel: SCSI error : <0 0 0 1> return code = 0x10000 Jan 19 07:56:00 PCLERPDB2 kernel: end_request: I/O error, dev sda, sector 26480696 Jan 19 07:56:00 PCLERPDB2 kernel: Buffer I/O error on device sda1, logical block 3310083 Jan 19 07:56:00 PCLERPDB2 kernel: lost page write due to I/O error on sda1 Jan 19 07:56:00 PCLERPDB2 kernel: SCSI error : <0 0 0 9> return code = 0x10000 Jan 19 07:56:00 PCLERPDB2 kernel: end_request: I/O error, dev sdh, sector 60052680 Jan 19 07:56:00 PCLERPDB2 kernel: SCSI error : <0 0 0 4> return code = 0x10000 Jan 19 07:56:00 PCLERPDB2 kernel: end_request: I/O error, dev sdc, sector 20042688 Jan 19 07:56:00 PCLERPDB2 kernel: SCSI error : <0 0 0 9> return code = 0x10000 Jan 19 07:56:00 PCLERPDB2 kernel: end_request: I/O error, dev sdh, sector 26747408 Jan 19 07:56:00 PCLERPDB2 kernel: Buffer I/O error on device sdh2, logical block 843074 Jan 19 07:56:00 PCLERPDB2 kernel: lost page write due to I/O error on sdh2 Jan 19 07:56:00 PCLERPDB2 kernel: SCSI error : <0 0 0 1> return code = 0x10000 Jan 19 07:56:00 PCLERPDB2 kernel: end_request: I/O error, dev sda, sector 32606944 Jan 19 07:56:00 PCLERPDB2 kernel: Buffer I/O error on device sda1, logical block 4075864
然后数据库崩溃.
安排用户重启数据库主机,检查是否硬件软故障。
很幸运,重启后数据库能够正常启动:
Thu Jan 19 09:55:09 2012 Completed redo application Thu Jan 19 09:55:09 2012 Completed crash recovery at Thread 1: logseq 18735, block 5214, scn 5965501404211 59 data blocks read, 59 data blocks written, 609 redo blocks read Thu Jan 19 09:55:09 2012 LGWR: STARTING ARCH PROCESSES ARC0 started with pid=23, OS id=14599 Thu Jan 19 09:55:09 2012 ARC0: Archival started ARC1: Archival started LGWR: STARTING ARCH PROCESSES COMPLETE ARC1 started with pid=24, OS id=14601 Thu Jan 19 09:55:09 2012 Thread 1 advanced to log sequence 18736 Thread 1 opened at log sequence 18736 Current log# 3 seq# 18736 mem# 0: /DBMS/DCERP/dcerpdata/log03a.dbf Current log# 3 seq# 18736 mem# 1: /DBMS/DCERP/dcerpdata/log03b.dbf Successful open of redo thread 1 Thu Jan 19 09:55:09 2012 MTTR advisory is disabled because FAST_START_MTTR_TARGET is not set Thu Jan 19 09:55:09 2012 ARC0: Becoming the 'no FAL' ARCH ARC0: Becoming the 'no SRL' ARCH Thu Jan 19 09:55:09 2012 ARC1: Becoming the heartbeat ARCH Thu Jan 19 09:55:09 2012 SMON: enabling cache recovery Thu Jan 19 09:55:11 2012 Successfully onlined Undo Tablespace 368. Thu Jan 19 09:55:11 2012 SMON: enabling tx recovery Thu Jan 19 09:55:11 2012 Database Characterset is UTF8 Thu Jan 19 09:55:11 2012 Incremental checkpoint up to RBA [0x4930.3.0], current log tail at RBA [0x4930.43.0] Thu Jan 19 09:55:11 2012 replication_dependency_tracking turned off (no async multimaster replication found) Starting background process QMNC QMNC started with pid=25, OS id=14626 Thu Jan 19 09:55:25 2012 Completed: ALTER DATABASE OPEN Thu Jan 19 10:15:13 2012 Incremental checkpoint up to RBA [0x4930.100d.0], current log tail at RBA [0x4930.107d.0] Thu Jan 19 10:35:16 2012 Incremental checkpoint up to RBA [0x4930.150f.0], current log tail at RBA [0x4930.155b.0] Thu Jan 19 10:55:17 2012 Incremental checkpoint up to RBA [0x4930.1724.0], current log tail at RBA [0x4930.175a.0] Thu Jan 19 11:15:18 2012 Incremental checkpoint up to RBA [0x4930.1edf.0], current log tail at RBA [0x4930.1f13.0]
估计硬件的生命周期达到,需要更新了。
「喜欢这篇文章,您的关注和赞赏是给作者最好的鼓励」
关注作者
【版权声明】本文为墨天轮用户原创内容,转载时必须标注文章的来源(墨天轮),文章链接,文章作者等基本信息,否则作者和墨天轮有权追究责任。如果您发现墨天轮中有涉嫌抄袭或者侵权的内容,欢迎发送邮件至:contact@modb.pro进行举报,并提供相关证据,一经查实,墨天轮将立刻删除相关内容。