Asset ID: |
1-72-1673319.1 |
Update Date: | 2017-09-13 |
Keywords: | |
Solution Type
Problem Resolution Sure
Solution
1673319.1
:
T4-4 internal Solid State SATA Drive (SSD) off the REM faulted and show only 6mb of space.
Related Categories |
- PLA-Support>Sun Systems>SPARC>CMT>SN-SPARC: T4
|
In this Document
Created from <SR 3-8689430241>
Applies to:
SPARC T4-4 - Version All Versions to All Versions [Release All Releases]
Information in this document applies to any platform.
Symptoms
1. format show configured with capacity of 6.00MB on the SSD only.
exmaple:
# format c0t500151795966683Bd0
c0t500151795966683Bd0: configured with capacity of 6.00MB
selecting c0t500151795966683Bd0
[disk formatted]
# format c0t5001517959654C5Fd0
c0t5001517959654C5Fd0: configured with capacity of 6.00MB
selecting c0t5001517959654C5Fd0
[disk formatted]
or
2. fail to boot if it was a boot drive will "Can't open disk label package" and/or "Can't open boot device".
probe-scsi-all do show the drives but in very small blocks/KB.
example:
{0} ok probe-scsi-all
/pci@700/pci@1/pci@0/pci@0/LSI,sas@0
FCode Version 1.00.54, MPT Version 2.00, Firmware Version 5.00.17.00
Target 9
Unit 0 Disk ATA INTEL SSDSA2BZ30 0362 16384 Blocks, 8388 KB
SATAWorldWideName 500151795966683b PhyNum 0
/pci@400/pci@1/pci@0/pci@0/LSI,sas@0
FCode Version 1.00.54, MPT Version 2.00, Firmware Version 5.00.17.00
Target 9
Unit 0 Disk ATA INTEL SSDSA2BZ30 0362 16384 Blocks, 8388 KB
SATAWorldWideName 5001517959654c5f PhyNum 0
/var/adm/messages may report as follow:
Mar 4 17:01:40 ucp02a Error for Command: read(10) Error Level: Retryable
Mar 4 17:01:40 ucp02a scsi: Requested Block: 88571646 Error Block: 88571646
Mar 4 17:01:40 ucp02a scsi: Vendor: ATA Serial Number: CVLV133400A8
Mar 4 17:01:40 ucp02a scsi: Sense Key: Unit_Attention
Mar 4 17:01:40 ucp02a scsi: ASC: 0x29 (power on, reset, or bus reset occurred), ASCQ: 0x0, FRU: 0x0
Mar 4 17:01:40 ucp02a scsi: WARNING: /scsi_vhci/disk@g5001517959654c5f (sd6):
Mar 4 17:01:40 ucp02a Error for Command: read(10) Error Level: Retryable
Mar 4 17:01:40 ucp02a scsi: Requested Block: 88626174 Error Block: 88626174
Mar 4 17:01:40 ucp02a scsi: Vendor: ATA Serial Number: CVLV133400A8
Mar 4 17:01:40 ucp02a scsi: Sense Key: Unit_Attention
Mar 4 17:01:40 ucp02a scsi: ASC: 0x29 (power on, reset, or bus reset occurred), ASCQ: 0x0, FRU: 0x0
Mar 4 17:05:56 ucp02a scsi: WARNING: /scsi_vhci/disk@g500151795966683b (sd5):
Mar 4 17:05:56 ucp02a Error for Command: write(10) Error Level: Retryable
Mar 4 17:05:56 ucp02a scsi: Requested Block: 93873751 Error Block: 93873751
Mar 4 17:05:56 ucp02a scsi: Vendor: ATA Serial Number: CVLV134300ER
Mar 4 17:05:56 ucp02a scsi: Sense Key: Aborted_Command
Mar 4 17:05:56 ucp02a scsi: ASC: 0x0 (no additional sense info), ASCQ: 0x0, FRU: 0x0
Mar 4 17:05:56 ucp02a scsi: WARNING: /scsi_vhci/disk@g5001517959654c5f (sd6):
Mar 4 17:05:56 ucp02a Error for Command: write(10) Error Level: Retryable
Mar 4 17:05:56 ucp02a scsi: Requested Block: 93874263 Error Block: 93874263
Mar 4 17:05:56 ucp02a scsi: Vendor: ATA Serial Number: CVLV133400A8
Mar 4 17:05:56 ucp02a scsi: Sense Key: Aborted_Command
Mar 4 17:05:56 ucp02a scsi: ASC: 0x0 (no additional sense info), ASCQ: 0x0, FRU: 0x0
Mar 4 17:05:56 ucp02a scsi: WARNING: /scsi_vhci/disk@g500151795966683b (sd5):
Mar 4 17:05:56 ucp02a incomplete write- retrying
Mar 4 17:07:46 ucp02a scsi: WARNING: /pci@700/pci@1/pci@0/pci@0/LSI,sas@0 (mpt_sas1):
Mar 4 17:07:46 ucp02a Disconnected command timeout for Target 9
Mar 4 17:07:46 ucp02a scsi: WARNING: /scsi_vhci/disk@g500151795966683b (sd5):
Mar 4 17:07:46 ucp02a SCSI transport failed: reason 'timeout': retrying command
Mar 4 17:07:46 ucp02a scsi: WARNING: /pci@400/pci@1/pci@0/pci@0/LSI,sas@0 (mpt_sas0):
Mar 4 17:07:46 ucp02a Disconnected command timeout for Target 9
Essentially, turning the SSD into a brick and no recovery option available other then replace the SSD.
Cause
This is a known LSI mpt-sas issue ("Zero-length read/write commands to SATA drives cause unexpected SCSI_TASK_TERMINATED responses") . It is fixed with phase 11 firmware of the REMs.
Oracle bug referenced in CR 15812121 - SUNBT7193893 LSI Erie RAID Expansion Module (REM) in T4-4 sends duplicate tags
In short, firmware tried to clean up an I/O when it didn't need to because hardware was in automated mode.
The fix is that firmware (version 11.05.03.00) does not access the SATA table anymore in this automated scenario.
Solution
Upgrade the REMs with firmware Version 11.05.03.00.
There is no equivalent Oracle "patch" available. Per Engineering, This version has been through a complete CFTT test cycle last year inclusive of the T4-4
This firmware can be downloaded from http://www.lsi.com/sep/Pages/oracle/sg_x_sas6-rem-z.aspx
Select 11.05.03 (firmware-110503-00.zip) & Solaris sas2flash utility (solaris-utils-14.05.00.00-11.00.00.03.zip) if not available on the system.
Firmware update details/steps per README.txt instructions as follow:
Following sample from T4-4 with firmware version 05.00.17.00
# ./sas2flash -listall
LSI Corporation SAS2 Flash Utility
Version 14.05.00.00 (2012.10.25)
Copyright (c) 2008-2012 LSI Corporation. All rights reserved
Adapter Selected is a LSI SAS: SAS2008(B2)
Num Ctlr FW Ver NVDATA x86-BIOS PCI Addr
----------------------------------------------------------------------------
0 SAS2008(B2) 05.00.17.00 05.02.00.14 07.05.05.00 00:03:00:00
1 SAS2008(B2) 05.00.17.00 05.02.00.14 07.05.05.00 00:03:00:00
Finished Processing Commands Successfully.
Exiting SAS2Flash.
# ./sas2flash -c 1 -f ./fw-rem-11050300.bin
LSI Corporation SAS2 Flash Utility
Version 14.05.00.00 (2012.10.25)
Copyright (c) 2008-2012 LSI Corporation. All rights reserved
Adapter Selected is a LSI SAS: SAS2008(B2)
Executing Operation: Flash Firmware Image
Firmware Image has a Valid Checksum.
Firmware Version 11.05.03.00
Firmware Image compatible with Controller.
Valid NVDATA Image found.
NVDATA Version 0a.03.00.00
Checking for a compatible NVData image...
NVDATA Device ID and Chip Revision match verified.
NVDATA Versions Compatible.
Valid Initialization Image verified.
Valid BootLoader Image verified.
Beginning Firmware Download...
Firmware Download Successful.
Verifying Download...
Firmware Flash Successful.
Resetting Adapter...
Adapter Successfully Reset.
Finished Processing Commands Successfully.
Exiting SAS2Flash.
# ./sas2flash -listall
LSI Corporation SAS2 Flash Utility
Version 14.05.00.00 (2012.10.25)
Copyright (c) 2008-2012 LSI Corporation. All rights reserved
Adapter Selected is a LSI SAS: SAS2008(B2)
Num Ctlr FW Ver NVDATA x86-BIOS PCI Addr
----------------------------------------------------------------------------
0 SAS2008(B2) 05.00.17.00 05.02.00.14 07.05.05.00 00:03:00:00
1 SAS2008(B2) 11.05.03.00 0a.03.00.1c 07.05.05.00 00:03:00:00
# ./sas2flash -c 0 -f ./fw-rem-11050300.bin
LSI Corporation SAS2 Flash Utility
Version 14.05.00.00 (2012.10.25)
Copyright (c) 2008-2012 LSI Corporation. All rights reserved
Adapter Selected is a LSI SAS: SAS2008(B2)
Executing Operation: Flash Firmware Image
Firmware Image has a Valid Checksum.
Firmware Version 11.05.03.00
Firmware Image compatible with Controller.
Valid NVDATA Image found.
NVDATA Version 0a.03.00.00
Checking for a compatible NVData image...
NVDATA Device ID and Chip Revision match verified.
NVDATA Versions Compatible.
Valid Initialization Image verified.
Valid BootLoader Image verified.
Beginning Firmware Download...
Firmware Download Successful.
Verifying Download...
Firmware Flash Successful.
Resetting Adapter...
Adapter Successfully Reset.
Finished Processing Commands Successfully.
Exiting SAS2Flash.
# ./sas2flash -listall
LSI Corporation SAS2 Flash Utility
Version 14.05.00.00 (2012.10.25)
Copyright (c) 2008-2012 LSI Corporation. All rights reserved
Adapter Selected is a LSI SAS: SAS2008(B2)
Num Ctlr FW Ver NVDATA x86-BIOS PCI Addr
----------------------------------------------------------------------------
0 SAS2008(B2) 11.05.03.00 0a.03.00.1c 07.05.05.00 00:03:00:00
1 SAS2008(B2) 11.05.03.00 0a.03.00.1c 07.05.05.00 00:03:00:00
Finished Processing Commands Successfully.
Exiting SAS2Flash.
# reboot
May 16 15:20:42 t4-4-bur09-a reboot: rebooted by root
May 16 15:20:42 t4-4-bur09-a syslogd: going down on signal 15
syncing file systems... done
rebooting...
Resetting...
SPARC T4-4, No Keyboard
Copyright (c) 1998, 2013, Oracle and/or its affiliates. All rights reserved.
OpenBoot 4.34.3, 8192 MB memory available, Serial #97968252.
Ethernet address 0:21:28:d6:e0:7c, Host ID: 85d6e07c.
Boot device: /pci@700/pci@1/pci@0/pci@0/LSI,sas@0/disk@w5000cca0128b9b09,0:a File and args:
SunOS Release 5.10 Version Generic_147147-25 64-bit
Copyright (c) 1983, 2012, Oracle and/or its affiliates. All rights reserved.
Hostname: t4-4-bur09-a
t4-4-bur09-a console login: root
Password:
Last login: Mon May 12 14:37:56 on console
May 16 15:34:07 t4-4-bur09-a login: ROOT LOGIN /dev/console
Oracle Corporation SunOS 5.10 Generic Patch January 2005
#
Attachments
This solution has no attachment