Sun Microsystems, Inc.  Oracle System Handbook - ISO 7.0 May 2018 Internal/Partner Edition
   Home | Current Systems | Former STK Products | EOL Systems | Components | General Info | Search | Feedback

Asset ID: 1-71-1936834.1
Update Date:2017-10-11
Keywords:

Solution Type  Technical Instruction Sure

Solution  1936834.1 :   M8000 Has Bad C0t1d0 300g Drive after replacement the format/cfgadm unable to attach the disk back to OS  


Related Items
  • Sun SPARC Enterprise M8000 Server
  •  
Related Categories
  • PLA-Support>Sun Systems>SPARC>Enterprise>SN-SPARC: Mx000
  •  




In this Document
Goal
Solution
References


Created from <SR 3-9751258601>

Applies to:

Sun SPARC Enterprise M8000 Server - Version All Versions to All Versions [Release All Releases]
SPARC

Goal

This document assist customer, sparc platform engineer and/or onsite field engineer when encountering failed disk and after disk replacement the OS cannot recognized the disk 

Solution

We should see this message event from the messages log file

Oct 14 03:10:01 <hostname> scsi: [ID 243001 kern.warning] WARNING: /pci@0,600000/pci@0/scsi@1 (mpt0):
Oct 14 03:10:01 <hostname> Disconnected command timeout for Target 1 SAS=5000c500335d85c1

Oct 15 13:46:34 <hostname> scsi: [ID 107833 kern.warning] WARNING: /pci@0,600000/pci@0/scsi@1 (mpt0):
Oct 15 13:46:34 <hostname> mpt_handle_event_sync : SAS target 1 added.

Oct 15 13:52:26 <hostname> scsi: [ID 107833 kern.warning] WARNING: /pci@0,600000/pci@0/scsi@1/sd@1,0 (sd0):
Oct 15 13:52:26 <hostname> Error for Command: read_capacity           Error Level: Retryable
Oct 15 13:52:26 <hostname> scsi: [ID 107833 kern.notice] Requested Block: 0                         Error Block: 0
Oct 15 13:52:26 <hostname> scsi: [ID 107833 kern.notice] Vendor: SEAGATE                            Serial Number: 110672S9A4  
Oct 15 13:52:26 <hostname> scsi: [ID 107833 kern.notice] Sense Key: Unit_Attention
Oct 15 13:52:26 <hostname> scsi: [ID 107833 kern.notice] ASC: 0x29 (power on occurred), ASCQ: 0x1, FRU: 0x1

as well as the following in the cfgadm -alv output, and notice that the disk does not have a relevant dsk/c# t# d#.

c0                             connected    configured   unknown
unavailable  scsi-sas     n        /devices/pci@0,600000/pci@0/scsi@1:scsi
c0::dsk/c0t0d0                 connected    configured   unknown    SEAGATE ST930003SSUN300G
unavailable  disk         n        /devices/pci@0,600000/pci@0/scsi@1:scsi::dsk/c0t0d0
c0::sd0                        connected    configured   unknown    SEAGATE ST930003SSUN300G
unavailable  disk         n        /devices/pci@0,600000/pci@0/scsi@1:scsi::sd0   

The cfgadm and format command will not work with above disk device. The platform log indicate that there is a hot removal and insertion of disk replacement.

The devfsadm command should be used with the options "-c disk -Cv" to clean up and device path. Upon successful relinking of the device path customer and/or field engineer should be able to cfgadm and bring the disk back to OS control.

 

If the devfsadm option was not successful you will need to reboot the platform or engage OS/disk driver expert engineer.


Pls check New IOCStatus / IOCLogInfo message reported by the mpt driver (Doc ID 1408673.1) for BUG 15441000

/pci@0/pci@0/pci@2/scsi@0 (mpt0): mpt_get_sas_device_page0 config: IOCStatus=0x8022, IOCLogInfo=0x30030501

with "label checksum failed" and disk was detected by OS format command pls replace the disk and see below
Bug 15074229: SUNBT4487312 DATA CORRUPTED AFTER: "CORRUPT LABEL - LABEL CHECKSUM FAILED" ON BA

Oct 15 20:06:39 <hostname> scsi: [ID 107833 kern.warning] WARNING: /pci@0,600000/pci@0/scsi@1/sd@1,0 (sd0):
Oct 15 20:06:39 <hostname> Corrupt label - label checksum failed

 

 

References

<NOTE:1408673.1> - New IOCStatus / IOCLogInfo message reported by the mpt driver

Attachments
This solution has no attachment
  Copyright © 2018 Oracle, Inc.  All rights reserved.
 Feedback