Sun Microsystems, Inc.  Oracle System Handbook - ISO 7.0 May 2018 Internal/Partner Edition
   Home | Current Systems | Former STK Products | EOL Systems | Components | General Info | Search | Feedback

Asset ID: 1-71-1922204.1
Update Date:2018-01-08
Keywords:

Solution Type  Technical Instruction Sure

Solution  1922204.1 :   Pillar Axiom: How to recover/clean Bad Blocks reported by An Axiom Event "Volume Bad Blocks"  


Related Items
  • Pillar Axiom 600 Storage System
  •  
Related Categories
  • PLA-Support>Sun Systems>DISK>Axiom>SN-DK: Ax600
  •  




Created from <SR 3-9479434471>

Applies to:

Pillar Axiom 600 Storage System - Version All Versions to All Versions [Release All Releases]
Information in this document applies to any platform.

Goal

 This document explains meaning of the "Volume Bad Blocks" error and recovery options.

Solution

 Axiom can report an alert  "Volume Bad Blocks" about a volume if it detects and bad block on the volume during a migration to different storage domain via Axiom tools, "Clone LUN", or "Copy LUN" operation.

An example of the Alert here and there is no proposed action to clear or recover this bad block: 

Alert: Volume Bad Blocks
Time Occurred: Tue Aug 26 14:57:41 EEST 2014
Affected Object: copy_oz_test_with_bbl

A LUN, filesystem, or clone has bad blocks.

 

The source of the error should be a bad block recorded at brick (RAID) level during or before migration.  Once RAID has a bad block for the storage under a LUN, if you copy or migrate the data in a LUN using the Axiom, the Axiom must make sure that no host would be given the false impression that there is nothing wrong with the data on the LUN.   In order to do this, as you copy/migrate, the Axiom takes the RAID bad block entry and converts it to an Array Manager Bad Block entry at higher level for that exact same Logical Block Array (LBA).  Host would still see a Media Error on a read, but on a write, the Array Manager bad block entry would be cleared. Both bad block records and their details are not available for end user and can readable only by Oracle TSC. The only way to get rid of those bad blocks is to write that LBA from a host, with correct good data.


To recovery:
1.  Turn off all new snapshots for the LUN reporting bad blocks via setting the respository space to 0 at  "Data Protection" screen on GUI.

2.   Do not use any Axiom based utility to move the data from the source to target LUN/bricks.

3.  Create new LUNs with same size and QoS settings and use ONLY host based utilities to take data from the source and put it on the target LUN/bricks.

4.  Use host based data integrity utilities to check and recover the data on the LUNs on the target LUN or bricks to the best of their ability.  Reported Bad blocks may not be in the user area and host based tools may not report any error against them. This way the bad block will remain in source LUN/bricks and will not be propagated to target LUN/bricks.

5.  For any bad block on the newly created LUN/bricks where a Media Error is returned, only a host based write of data will clear that condition.  That means host based utilities did not help and only restore from a clean/good backup will clear bad blocks.


References

<BUG:19445541> - VOLUME BAD BLOCKS

Attachments
This solution has no attachment
  Copyright © 2018 Oracle, Inc.  All rights reserved.
 Feedback