Sun Microsystems, Inc.  Oracle System Handbook - ISO 7.0 May 2018 Internal/Partner Edition
   Home | Current Systems | Former STK Products | EOL Systems | Components | General Info | Search | Feedback

Asset ID: 1-72-1498408.1
Update Date:2013-06-06
Keywords:

Solution Type  Problem Resolution Sure

Solution  1498408.1 :   Exadata: Failed Bundle patch/patchset apply leaves clusterware unable to start  


Related Items
  • Exadata Database Machine X2-2 Full Rack
  •  
Related Categories
  • PLA-Support>Eng Systems>Exadata/ODA/SSC>Oracle Exadata>DB: Exadata_EST
  •  




In this Document
Symptoms
Changes
Cause
Solution
References


Created from <SR 3-6309346521>

Applies to:

Exadata Database Machine X2-2 Full Rack - Version All Versions to All Versions [Release All Releases]
Information in this document applies to any platform.

Symptoms

Gird home versions 11.2.X

+ A failed bundle patch apply leaves clusterware unable to start.

+ Reviewing the opatch logs for the failed apply, we see that the automatic rollback of the patch also failed

+ Tried a manual rollback of the bundle patch(using the BP readme) and still the clusterware fails to start

+ Checking the status of clusterware  with "crsctl stat res -t -init" shows the following resource states:

ora.asm      OFFLINE
ora.crsd     OFFLINE
ora.diskmon  OFFLINE

+ Reviewing the ASM alert log shows the following errors during startup- while trying to mount the diskgroup.

-----------------------------
SQL> ALTER DISKGROUP ALL MOUNT /* asm agent call crs *//* {0:0:2} */
NOTE: Diskgroups listed in ASM_DISKGROUPS are
      DATA_PT01
      RECO_PT01
NOTE: Diskgroup used for Voting files is:
      DBFS_DG
Diskgroup with spfile:DBFS_DG
Diskgroup used for OCR is:DBFS_DG
NOTE: cache registered group DATA_PT01 number=1 incarn=0x1e8993b7

.

.
DSKM process appears to be hung. Initiating system state dump.
Thu Oct 11 23:13:37 2012
System state dump requested by (instance=1, osid=20817 (GEN0)), summary=[system state dump request (ksz_check_ds)].

.
Errors in file /u01/app/grid/diag/asm/+asm/+ASM1/trace/+ASM1_dskm_20823.trc:
ORA-56867: Cannot connect to Master Diskmon on pipe "default pipe"
ORA-27300: OS system dependent operation:connect failed with status: 111
ORA-27301: OS failure message: Connection refused
ORA-27302: failure occurred at: skgznpcon6
DSKM (ospid: 20823): terminating the instance due to error 56867

----------------------

 

+ Additional symptoms observed from other reported issues ( which may or may not be present)

 o  Core files generated in the "$GRID_HOME/log/<hostname>/diskmon" due to diskmon crash

 o "crsctl start res ora.diskmon -init" might bring the resource online, but sets it back to offline as soon as ASM tried to start.

 o  The size or checksum  reported by   "cksum $GRID_HOME/bin/diskmon.bin" might be different across the nodes.

 o Connectivity issues reported by DISKMON log, aborting the clusterware startup

eg: 2013-05-17 19:32:02.860: [ DISKMON][20792:1105443136] dskm_new_ossb10: oss_open for device o/10.217.206.41 (inc 0, ossbp 0x2aaaac013e60) failed with error 2

 

Changes

Issue started after a failed Bundle patch apply on the grid home

-OR-

Issue started after a failed upgrade/patchset application

Cause

 The failed upgrade/patch apply/rollback  leaves the problematic GRID_HOME with only a partial list of patches applied- when compared to a working node.

Especially ,Make sure all 3 patches part of the BP (Database Patch/CRS Patch/Diskmon Patch)  are  reported as applied on the problematic home.

There are several bugs logged on this issue which are closed as "Not a bug" 

Solution

 Re-apply the missing patches on the problematic GRID_HOME and bring it on par with the GRID homes on working nodes

If any of the patches part of the Bundle Patch are observed missing, Re-applying the Bundle Patch should apply the missing patch.

References

<BUG:13786466> - CORE FILE CREATED IN /U01/APP/11.2.0/GRID/LOG/EX11DB01/DISKMON
<BUG:14107412> - UPGRADE OF GI TO 11.2.0.3 FROM 11.2.0.2 IS FAILING DUE TO DISKMON NOT COMING UP
<BUG:14082848> - ORA-27302: FAILURE OCCURRED AT: SKGZNPCON6

Attachments
This solution has no attachment
  Copyright © 2018 Oracle, Inc.  All rights reserved.
 Feedback