![]() | Oracle System Handbook - ISO 7.0 May 2018 Internal/Partner Edition | ||
|
|
![]() |
||||||||||||||||||
Solution Type Problem Resolution Sure Solution 2179850.1 : Solaris Scsi Error - Sense Key: Aborted_Command - ASC: 0x44 (internal target failure), ASCQ: 0x0, FRU: 0x0
In this Document
Created from <SR 3-13287745091> Applies to:Sun SPARC Enterprise T5220 Server - Version All Versions and laterSolaris Operating System - Version 8.0 and later Information in this document applies to any platform. SymptomsThis is a Solaris 10 T4-2 server with two Oracle FC HBAs connected to the SAN to access an EMC Disk Storage Array No errors on FC HBAs, EMC LUNs are under mpxio multipathing software. From time to time, we see single scsi write errors against different storage LUNs Aug 26 00:57:57 server01 scsi: [ID 107833 kern.warning] WARNING: /scsi_vhci/ssd@g60000970000xxxxxxxxxxxx030324441 (ssd1041):
Aug 29 22:12:38 server01 scsi: [ID 107833 kern.warning] WARNING: /scsi_vhci/ssd@g60000970000xxxxxxxxxxxx030313944 (ssd1218): Sep 2 04:24:07 server01 scsi: [ID 107833 kern.warning] WARNING: /scsi_vhci/ssd@g60000970000xxxxxxxxxxxx030324441 (ssd1041): Sep 4 10:27:03 server01 scsi: [ID 107833 kern.warning] WARNING: /scsi_vhci/ssd@g60000970000xxxxxxxxxxxx030314138 (ssd1207):
All the errors are like this one: Aug 26 00:57:57 server01 scsi: [ID 107833 kern.warning] WARNING: /scsi_vhci/ssd@g60000970000xxxxxxxxxxxx030324441 (ssd1041):
Aug 26 00:57:57 server01 Error for Command: write(10) Error Level: Retryable Aug 26 00:57:57 server01 scsi: [ID 107833 kern.notice] Requested Block: 12160 Error Block: 12160 Aug 26 00:57:57 server01 scsi: [ID 107833 kern.notice] Vendor: EMC Serial Number: 41XXXXXXX Aug 26 00:57:57 server01 scsi: [ID 107833 kern.notice] Sense Key: Aborted_Command Aug 26 00:57:57 server01 scsi: [ID 107833 kern.notice] ASC: 0x44 (internal target failure), ASCQ: 0x0, FRU: 0x0 CauseEMC Disk Storage array is reporting "ASC: 0x44 (internal target failure)" for these particular LUNs at that particular moment in time. What does it mean this single error? There are hundreds of IOs per second that can be generated by applications to the storage disks , There was one IO that was failed by the EMC storage at this moment in time: "Aug 26 00:57:57" , it was a write IO operation to disk "/scsi_vhci/ssd@g60000970000xxxxxxxxxxxx030324441 (ssd1041)" (ssd instance number 1041), Notice this was single error , no other errors were reported at that time. That meas all other other IOs (reads and writes) were being completed successfully, except this single IO that had to be retried by ssd driver. SolutionThis is a external problem to the Solaris server.
Disk Storage vendor (EMC in this case) is responsible to explain why the storage is reporting this error, what situations can make the storage to report this error?
RCA Example Note. Be aware there may be different scenarios on the Diks Storage arrays that may lead to this type of errors, the following is an example, EMC provided RCA and solution:
References<NOTE:1285485.1> - GUDS - A Script for Gathering Solaris Performance Data<NOTE:1010680.1> - Troubleshooting Disk Performance Attachments This solution has no attachment |
||||||||||||||||||
|