Sun Microsystems, Inc.  Oracle System Handbook - ISO 7.0 May 2018 Internal/Partner Edition
   Home | Current Systems | Former STK Products | EOL Systems | Components | General Info | Search | Feedback

Asset ID: 1-72-1916321.1
Update Date:2017-10-11
Keywords:

Solution Type  Problem Resolution Sure

Solution  1916321.1 :   SunFire[TM] V1280, E2900 and Netra[TM] 1280, 1290 System Hang Occur And OS Disk Failed  


Related Items
  • Sun Fire V1280 Server
  •  
  • Sun Netra 1290 Server
  •  
  • Sun Fire E2900 Server
  •  
  • Sun Netra 1280 Server
  •  
Related Categories
  • PLA-Support>Sun Systems>SPARC>Enterprise>SN-SPARC: SF-x8x0/Ex900
  •  
  • Tools>Type>Information Center
  •  
  • Tools>Primary Use>Availability
  •  
  • Tools>Type>Diagnostic - Analyzer Script
  •  




In this Document
Symptoms
Changes
Cause
Solution
References


Created from <SR 3-9462901271>

Applies to:

Sun Fire E2900 Server - Version All Versions to All Versions [Release All Releases]
Sun Netra 1280 Server - Version All Versions to All Versions [Release All Releases]
Sun Fire V1280 Server - Version All Versions to All Versions [Release All Releases]
Sun Netra 1290 Server - Version All Versions to All Versions [Release All Releases]
Information in this document applies to any platform.

Symptoms

 System Has hung, and OS disk has fatal errors

Changes

 

Cause

 Disk has failed.

Solution

These messages show the disk path where the physical disk has FAILED.  The cluster software has failed the disk, and it was offlined by the OS

Aug 11 13:30:49 suntestd2 genunix: [ID 408114 kern.info] /ssm@0,0/pci@18,600000/pci@2/scsi@2/sd@0,0 (sd8) offline
Aug 11 13:36:56 suntestd2 Cluster.scdpmd: [ID 977412 daemon.notice] The state of the path to device: /dev/did/rdsk/d12s0 has changed to FAILED


 

The OS/Kernel indicates the disk location, in this case it is controled by the glm HBA if this was internal then it would be an mpt device and control by LSI/mpt device.

Aug 11 16:42:43 suntestd2 scsi: [ID 583861 kern.info] sd8 at glm0: target 0 lun 0
Aug 11 16:42:43 suntestd2 genunix: [ID 936769 kern.info] sd8 is /ssm@0,0/pci@18,600000/pci@2/scsi@2/sd@0,0
Aug 11 16:42:44 suntestd2 genunix: [ID 408114 kern.info] /ssm@0,0/pci@18,600000/pci@2/scsi@2/sd@0,0 (sd8) online


 

Looking at the list of scsi sense codes (Doc ID 1523186.1) we see that this is a media error and the bad sector allocation table of the specific disk was full hence the auto reallocation failed

Aug 11 16:46:09 suntestd2 scsi: [ID 107833 kern.warning] WARNING: /ssm@0,0/pci@18,600000/pci@2/scsi@2/sd@0,0 (sd8):
Aug 11 16:46:09 suntestd2 Error for Command: read(10) Error Level: Retryable
Aug 11 16:46:09 suntestd2 scsi: [ID 107833 kern.notice] Requested Block: 65552896 Error Block: 65553054
Aug 11 16:46:09 suntestd2 scsi: [ID 107833 kern.notice] Vendor: SEAGATE Serial Number: 0633458G4D
Aug 11 16:46:09 suntestd2 scsi: [ID 107833 kern.notice] Sense Key: Media_Error
Aug 11 16:46:09 suntestd2 scsi: [ID 107833 kern.notice] ASC: 0x11 (unrecovered read error - auto reallocate failed), ASCQ: 0x4, FRU: 0x1


 

The suspected disk in this case should be replaced immediately.

 

References

<NOTE:1523186.1> - List of scsi sense codes

Attachments
This solution has no attachment
  Copyright © 2018 Oracle, Inc.  All rights reserved.
 Feedback