Sun Microsystems, Inc.  Oracle System Handbook - ISO 7.0 May 2018 Internal/Partner Edition
   Home | Current Systems | Former STK Products | EOL Systems | Components | General Info | Search | Feedback

Asset ID: 1-72-2159460.1
Update Date:2017-09-26
Keywords:

Solution Type  Problem Resolution Sure

Solution  2159460.1 :   SL8500 - FRS 8.51 Upgrade Causes Partition Corruption and Loss Of ACSLS Database Tape and Panel Information  


Related Items
  • Sun StorageTek Auto Cartridge Sys Lib SW (ACSLS)
  •  
  • Sun StorageTek SL8500 Modular Library System
  •  
Related Categories
  • PLA-Support>Sun Systems>TAPE>Tape Hardware>SN-TP: SL3000-8500 Library
  •  




In this Document
Symptoms
Changes
Cause
Solution
References


Created from <SR 3-12979243981>

Applies to:

Sun StorageTek SL8500 Modular Library System - Version All Versions to All Versions [Release All Releases]
Sun StorageTek Auto Cartridge Sys Lib SW (ACSLS) - Version 8.3 and later
Information in this document applies to any platform.

SL8500 FRS 8.51
ACSLS 8.3

Symptoms

Library will not come online after firmware upgrade to 8.51. When attempt to bring online was made the library type was seen to change from 8500 to 8509. When a config was run, most of the panels and all of the tapes disappeared.

The customer attempted to start Acsls and received the following message :-

Lh error: Cm_4400::ProcessAck: ACS 0 configured to ACSLS with partition ID =
0, but the library is NOT partitioned.

The customer tried to vary the LSMs online and got the messages :-
2016-07-07 13:06:58 VARY[361]:
2563 E va_lsm_stat.c 1 2212
va_lsm_state:st_lsm_check: LSM 0, 3 type changed from 8500 to 8509; LSM
remains offline.

2016-07-07 13:06:58 VARY[361]:
2564 I va_lsm_stat.c 1 2215
va_lsm_state:st_lsm_check: Please update your ACSLS library configuration
using Dynamic config or acsss_config.

2016-07-07 13:06:58 ACSSA[361]:
1433 N sa_demux.c 1 296
LSM 0, 3: Offline.

A display of the LSM's shows :-
Library error, LC/LMU failure

ACSSA> display lsm *
2016-07-07 09:05:24 Display LSM
Acs Lsm Status State Free_cells
0 0 normal offline 1082
0 1 normal offline 1860
0 2 normal offline 1379
0 3 normal offline 1020

Under advice from Oracle the customer ran 'config acs 0' which resulted in ACSLS corruption and lost data in ACSLS DB and a reduction in the number of tapes to zero. Note, number of panels reduced to 3 (from 53)...

bash-3.2$ config acs 0
2016-07-07 19:47:35

Obtaining Configuration details, please wait
2016-07-07 19:50:50

Configuration Details for ACS # 0

Partition ID Not partitioned
Number of LSMs 4 (old) 4 (new)
Number of Ports 1
Port Name bkprbt01

LSM Details
OLD NEW Change
--- --- ------
Lsm # 0,0 :
---------
Lsm Type 8500 8509 Y
Number of CAPS 0 0 N
Number of PTPS 3 3 N
Number of Panels 51 2 Y
Number of Drives 16 16 N
Number of Available cells 2522 0 Y

Lsm # 0,1 :
---------
Lsm Type 8500 8509 Y
Number of CAPS 2 2 N
Number of PTPS 3 3 N
Number of Panels 51 2 Y
Number of Drives 16 16 N
Number of Available cells 2522 0 Y

Lsm # 0,2 :
---------
Lsm Type 8500 8509 Y
Number of CAPS 0 0 N
Number of PTPS 3 3 N
Number of Panels 51 2 Y
Number of Drives 16 16 N
Number of Available cells 2522 0 Y

Lsm # 0,3 :
---------
Lsm Type 8500 8509 Y
Number of CAPS 0 0 N
Number of PTPS 3 3 N
Number of Panels 51 2 Y
Number of Drives 16 16 N
Number of Available cells 2522 0 Y

ACSSA> display cap * -f status state desired_state condition priority mode size availability

2015-09-09 09:53:15 Display CAP
Acs Lsm Cap Status State Desired_state Condition Priority Mode Size Availability
0 1 0 available online online operative 0 automatic 39 lib_not_partn
0 1 1 available offline online inoperative 0 manual 39 not_installed

In addition, it was not possible to load a tape to a cell or enter any tapes as the library was seen as 'full' (there was one tape on a drive that was attempted to be dismounted in this example).

Audit: Audit of ACS 0, Valid.
ACSSA> dismount A96254 0,3,1,10 force
Dismount: Dismount failed, ACS 0 full.

 

Changes

 Upgrade to 8.51 firmware

Cause

Prior to version 5 , previous versions of the SL8500 Release Notes for the FRS 8.51 code omitted to add the following step to the upgrade process....

After upgrade to FRS_8.51:
Code upgrade issues have been identified with FRS_8.51, non-partitioned libraries only, requiring extra steps after the upgrade is complete. Once the following procedure(s) have been performed, the library will be ready for customer use.
Standalone Library
1) Using SLC go to the Select Active Cells screen.
2) Press the "Apply Default Capacity" button.
3) Verify:
 SLC -> Library -> General Properties Active Cell count reflects the expected capacity of the library.
 SLC -> Library -> General Status Access State is online.
If not, "Bring Online" via the "-select-transition" to bring the library online.
Complex
1) Login to Cli on one library in the complex with the service id and password.
2) Execute the following Cli command:
partition setNonPartitionedHli
NOTE: This command will propagate the configuration change to all libraries in the complex.
3) Verify:
 SLC -> Library -> General Properties Active Cell count reflects the expected capacity of the library.
 SLC -> Library -> General Status Access State is online.
If not, "Bring Online" via the "-select-transition" to bring the library online.

Since this step was not run the partition information in the library was corrupted. The following message was seen in the library snapshot ....

2016-07-07T08:55:28.667, 1.0.0.0.0, root, default,getPartition, 16801, error, 1202, "Invalid parameter value", parameter= Currently partition configuration is in invalid state, cannot change partition state when the whole partition configuration state is in invalid state

The cause was failure to run the additional step mentioned in the FRS 8.51 v5 Release Notes due to using older release notes.  

Solution

The customer was advised to run the 'After upgrade to FRS_8.51' step from the v5 Release Notes. 

The library was rebooted and a full physical hardware audit run.

The customer re-ran the 'config acs 0' again which changed the library type back to '8500' and re-recognized the full library complex

The customer had to run a new Acsls Audit which found all the tapes.

References

<NOTE:1487764.1> - SL8500 - Library Not Online After ACSLS Restart
<NOTE:1360720.1> - ACSLS - re_vary_lsm: Vary failed, message STATUS_NOT READY

Attachments
This solution has no attachment
  Copyright © 2018 Oracle, Inc.  All rights reserved.
 Feedback