Sun Microsystems, Inc.  Oracle System Handbook - ISO 7.0 May 2018 Internal/Partner Edition
   Home | Current Systems | Former STK Products | EOL Systems | Components | General Info | Search | Feedback

Asset ID: 1-72-2019567.1
Update Date:2016-12-01
Keywords:

Solution Type  Problem Resolution Sure

Solution  2019567.1 :   Sun StorageTek RAID Manager : How to Handle Bad Stripe Entries On a Logical Device  


Related Items
  • Sun Storage J4200 Array
  •  
  • Sun Storage J4400 Array
  •  
  • Sun Storage 3Gb SAS RAID HBA, External
  •  
  • Sun Storage J4500 Array
  •  
Related Categories
  • PLA-Support>Sun Systems>DISK>HBA>SN-DK: SCSI SAS HBA
  •  




In this Document
Symptoms
Changes
Cause
Solution
References


Created from <SR 3-10751529721>

Applies to:

Sun Storage J4500 Array - Version All Versions to All Versions [Release All Releases]
Sun Storage J4400 Array - Version All Versions to All Versions [Release All Releases]
Sun Storage J4200 Array - Version All Versions to All Versions [Release All Releases]
Sun Storage 3Gb SAS RAID HBA, External - Version Not Applicable to Not Applicable [Release N/A]
Information in this document applies to any platform.

Symptoms

Sun StorageTek RAID Manager will report errors similar to the following due to media errors on multiple drives.

From the RAID event logs :

May 15, 2015 2:57:47 PM CEST WRN 215:A01C-S--L-- <...> One or more logical devices contain a bad stripe: controller 1.

From arcconf getconfig output :

Logical device number 2
 Logical device name                      : Device 2
 RAID level                               : 5EE
 Status of logical device                 : Optimal
 Status of RAID 5EE                       : Compacted
 Size                                     : 9533430 MB
 Stripe-unit size                         : 256 KB
 Read-cache mode                          : Enabled
 Write-cache mode                         : Enabled (write-back)
 Write-cache setting                      : Enabled (write-back) when protected by battery
 Partitioned                              : Yes
 Protected by Hot-Spare                   : No
 Bootable                                 : No
 Failed stripes                           : Yes
   <----------------

The above errors indicate bad stripes on logical device 2.

The Sun StorageTek RAID Manager GUI will display a yellow exclamation point on the logical device. Right-click on the icon and select Properties to find the error.

Changes

The Bad Stripe Table tracks stripes across a logical drive that contains invalid or incomplete data.The table is stored in an area that is reserved for configuration information on each physical disk grouped into an array that hosts one or more logical drives. There is a separate table for each logical drive.

A bad stripe is essentially a stripe-level RAID failure, although instead of taking the entire logical drive off-line, only the data within the stripe becomes unavailable.A bad stripe entry is then made to the Bad Stripe Table and becomes a part of the array and logical drive configuration.

Cause

A bad stripe entry can occur primarily due to one of the following scenarios:

  •     Encountering Medium Error on one of the source drives during a REBUILD operation.
  •     Encountering Medium Error on same LBA of multiple array members during Verify with fix operation.
  •     If an improper shutdown occurs on a CRITICAL array (one or more drives missing) while there are open writes.

In the instances when a bad stripe has occurred, the data contained within that stripe is incomplete, invalid, or inconsistent between the data and parity and a Bad Stripe Table entry is created to block that stripe to prevent hidden data corruption.

Solution

 

Replace the underlying faulty disks. Verify the data integrity and restore data if required.

This action will not clear the Bad Stripe Table entries. There is no procedure or tool for clearing or repairing bad stripes while maintaining the existing array.

In order to clear the bad stripe entries there are only two methods (both are destructive).

 

The customer is 100% responsible for backing up and restoring the data within the array as this procedure will erase the existing data.

 
Method one:

Remove or delete the existing array configuration from the physical disks associated to the array with the affected logical drive, and then create an identical new configuration, which will overwrite any previous existing configuration data.

The Bad Stripe table will be re-written and will start with zero entries.

Method two:

This is identical to method one, but has an additional step. After the existing configuration is removed from the physical disks, do a low-level format on each physical disk, and then create an identical new configuration.

This provides an additional benefit of verifying that the drives are error-free.

Once the selected operation is completed, check the Array/Logical Device health to ensure "Bad Stripes"  are no longer present,
by running the following arcconf command from the OS:

 

# /Opt/StorMan/arcconf getconfig 1

Logical device number 2
 Logical device name                      : Device 2
 RAID level                               : 5EE
 Status of logical device                 : Optimal    
 Status of RAID 5EE                       : Compacted
 Size                                     : 9533430 MB
 Stripe-unit size                         : 256 KB
 Read-cache mode                          : Enabled
 Write-cache mode                         : Enabled (write-back)
 Write-cache setting                      : Enabled (write-back) when protected by battery
 Partitioned                              : Yes
 Protected by Hot-Spare                   : No
 Bootable                                 : No
 Failed stripes                           : No 
         <---------------


 

 

 

References

<NOTE:1312847.1> - Oracle Explorer Data Collector Resource Center
<NOTE:1296274.1> - How to Download Common Array Manager (CAM) Software and Patches

Attachments
This solution has no attachment
  Copyright © 2018 Oracle, Inc.  All rights reserved.
 Feedback