Asset ID: |
1-75-1392919.1 |
Update Date: | 2017-07-17 |
Keywords: | |
Solution Type
Troubleshooting Sure
Solution
1392919.1
:
Troubleshooting Sun Storage[TM] Array non-SMART Battery Faults
Related Items |
- Sun Storage Flexline 240 Array
- Sun Storage Flexline 380 Array
- Sun Storage Flexline 280 Array
- Sun Storage 6540 Array
- Sun Storage 6130 Array
- Sun Storage Flexline 210 Array
- Sun Storage 6140 Array
|
Related Categories |
- PLA-Support>Sun Systems>DISK>Arrays>SN-DK: 6130
|
In this Document
Applies to:
Sun Storage Flexline 210 Array - Version Not Applicable to Not Applicable [Release N/A]
Sun Storage Flexline 240 Array - Version Not Applicable to Not Applicable [Release N/A]
Sun Storage Flexline 280 Array - Version Not Applicable to Not Applicable [Release N/A]
Sun Storage 6540 Array - Version Not Applicable to Not Applicable [Release N/A]
Sun Storage 6130 Array - Version Not Applicable to Not Applicable [Release N/A]
Information in this document applies to any platform.
Purpose
The purpose of this document is to help users identify common problems with batteries in the Sun Storage Flexline 210, 240, 280, 380, StorageTek 6130, StorageTek 6140 or StorageTek 6540 arrays. These batteries provide power to the controller's data cache in the event of a power outage. If you have a StorageTek 2500, Sun Storage 2500-M2, 6x80 array, please refer to <Document 1021054.1> Troubleshooting Sun Storage[TM] Array SMART Battery Faults.
Symptoms Include
The following table contains the most common faults seen by Common Array Manager (CAM) or SANtricity:
Grid ID | CAM Critical Fault | SANtricity Critical Fault |
xx.66.1005 |
Battery Near Expiration |
BATTERY_NEAR_EXPIRATION |
xx.66.1006 |
Critical Fault for a Cache Backup Battery has Failed |
FAILED_BATTERY |
xx.66.1039 |
Controller Cache Battery Near Expiration |
NON_FRU_BATTERY_NEAR_EXPIRATION INTEGRATED_BATTERY_NEAR_EXPIRATION |
xx.66.1040 |
A Controller Cache Backup Battery has Failed |
NON_FRU_FAILED_BATTERY or FAILED_INTEGRATED_BATTERY |
xx.66.1091 |
Battery Tray.xx.Battery.xx has transitioned to an Unknown State |
BATTERY_UNKNOWN_STATE |
xx.66.1101 |
There has been a failure in the ICC Battery pack |
FAILED_BATTERY_SYSTEM |
xx.66.1254 |
Critical Fault for Battery has expired |
EXPIRED_BATTERY |
xx.66.1255 |
Critical Fault for Battery has expired |
EXPIRED_INTEGRATED_BATTERY |
Other possible conditions include:
- You just replaced the battery, but it still shows failed.
- Amber LED lit on battery.
- Amber LED lit on array.
Batteries are monitored by two methods, an Expiration Timer and SMART (Self-Monitoring, Analysis and Reporting Technology) battery technology. Within the scope of this document only the Expiration Timer is used.
The Expiration Timer is a simple counter whereas the newer SMART battery technology internally tests the ability of the batteries to hold a charge. Both are used to determine battery replacement but the Expiration Timer is also susceptible to outside conditions which can lead to reports of premature failures.
Troubleshooting Steps
- Verify the Array Type, Firmware Version and Fault.
Since the steps to resolve battery issues will differ based on the Hardware and Firmware involved, it is necessary to gather this information in order to determine the proper troubleshooting steps.
- To determine the Array type, see <Document 1021066.1> Verify Sun Storage [TM] Array Array type via the User Interface.
- To determine the Firmware version, see <Document 1021067.1> Verify Storage[TM] Array Firmware via the user Interface.
- To determine the faults, see <Document 1021057.1> Sun Storage Common Array Manager (CAM): How to Verify Critical Faults for Sun Storage 2500, 2500-M2, 6000 and J4000 Arrays.
If there are no faults, go to step 5.
The following table lists the most common faults associated with batteries. If you have an array with redundant batteries and both batteries have a fault, each fault should be evaluated on it's own. Sometimes a single remedy will fix multiple faults. If you have a single battery with multiple faults, go to step 6, contact Oracle support.
Critical Fault | Array Type | Firmware Version | Remedy |
Battery Near Expiration |
6130/FLXxx0/6x40 |
Any |
Go to step 2. |
Battery Expired |
6130/FLXxx0/6x40 |
Any |
Go to step 2. |
Over Temperature |
6130/FLXxx0/6x40 |
Any |
Go to step 6. |
Replacement Required |
6130/FLXxx0/6x40 |
Any |
Go to step 6. |
Battery Failed |
6130/FLXxx0/6x40 |
Any |
Go to step 2. |
Unknown |
6130/FLXxx0/6x40 |
Any |
Go to step 6. |
- Confirm Array System Time is correct.
Batteries that have the Expiration Timer active are subject to premature failures if the array system time gets improperly set. Typically this is the result of a rogue NTP server. Use <Document 1021108.1> Verifying and Setting Sun Storage[TM] Array System Time, to verify the array system time. If the system time is incorrect, search the majorEventLog.txt file (from supportdata bundle) to see if a rogue NTP server is the cause.
Example:
# grep NTP majorEventLog.txt
Description: Controller clocks set via NTP or SNTP
Description: Controller clocks set via NTP or SNTP
#
If you find any instances of the above, you can reset the array system time but the problem is likely to return unless the rogue NTP server is addressed. Isolating the array(s) and management host on their own private network is the best way to avoid a rogue NTP server. Reset the array system time and wait 5 minutes.
- If the critical fault clears, no further action is needed.
- If the critical fault remains and you recently replaced the battery, go to step 3.
- If the critical fault remains, and the array system time is correct, and the battery has not been recently replaced, it is likely time to replace the battery. Go to step 6.
- Reset the Battery Age.
Go to <Document 1021695.1> How to Reset the Cache Backup Battery Age for Sun Storage 2500, 2500-M2, and 6000 Arrays. If after resetting the battery age the problem is still unresolved, it may be necessary to reseat the battery to clear the failed state. Proceed to step 4.
- Reseating the Battery.
The 6x40 and FLXx80 arrays have batteries that are externally accessible and this can be done without any impact to data access. Flexline 210, 240 and 6130 arrays will require a controller reboot which means a temporary loss of a data path. Users may want to schedule an outage for this but it is not a requirement.
To reseat the battery follow the service steps for battery replacement to remove and reinsert the battery.
- For CAM users see Service Advisor
- For SANtricity users see Recovery Guru
If you have to reboot the controller
- For CAM users select the array from the Storage System Summary page, then expand the Physical Devices and select Controllers in the Navigation Tree. Scroll down to the proper controller and select Reset Controller. Confirm to execute the reboot.
- For SANtricity users select the Logical/Physical View tab in the Array Management Window and highlight/select the controller in the Physical window. From the menu bar, select Advanced then Recovery, then Reset and finally Controller. Confirm to execute the reboot.
If the problem still is not resolved, go to step 6.
- Battery in an Unknown State.
Go to <Document 1283914.1> Sun Storage Common Array Manager (CAM) Reports "Battery has transitioned to an unknown state" for further troubleshooting.
- Contact Oracle Support.
Collect a supportdata:
- For CAM users, reference <Document 1002514.1> Collection Support Data for Arrays Using Sun StorageTek[TM] Common Array Manager.
- For SANtricity users, reference <Document 1014074.1> Collecting Support Data for Arrays Using Sun StorageTek[TM] SANtricity Storage Manager.
Log a Service Request
Do you still have questions? You can use My Oracle Support Communities. Communities put you in touch with industry professionals like yourself. They are monitored by Oracle support engineers, so you can expect reliable and correct answers. Ask questions and see what others are asking about in the
Disk Storage 2000, 3000, 6000 RAID Arrays & JBODs Community.
References
<NOTE:1021054.1> - Troubleshooting Sun Storage Array SMART Battery Faults
<NOTE:1021057.1> - Sun Storage Common Array Manager (CAM): How to Verify Critical Faults for Sun Storage 2500, 2500-M2, 6000 and J4000 Arrays
<NOTE:1021066.1> - Verify Sun Storage[TM] Array Array Type via the User Interface
<NOTE:1021067.1> - How to Verify Sun Storage[TM] Array Firmware Using Sun Storage Common Array Manager or SANtricity
<NOTE:1021108.1> - Verifying and Setting Sun Storage Array System Time
<NOTE:1021695.1> - How to Reset the Cache Backup Battery Age for Sun Storage 2500, 2500-M2, and 6000 Arrays
<NOTE:1283914.1> - Sun Storage Common Array Manager (CAM) Reports "Battery has transitioned to an unknown state"
<NOTE:1002514.1> - Collecting Sun Storage Common Array Manager Support Data for Arrays
Attachments
This solution has no attachment