Sun Microsystems, Inc.  Oracle System Handbook - ISO 7.0 May 2018 Internal/Partner Edition
   Home | Current Systems | Former STK Products | EOL Systems | Components | General Info | Search | Feedback

Asset ID: 1-71-1452873.1
Update Date:2017-05-22
Keywords:

Solution Type  Technical Instruction Sure

Solution  1452873.1 :   How to Recognize and Diagnose Service Processor Battery Failure on Sun Fire/Netra T1000/T2000 & Sun Blade T6300/T6320/T6340  


Related Items
  • Sun Fire T2000 Server
  •  
  • Sun Blade T6300 Server Module
  •  
  • Sun Netra T6340 Server Module
  •  
  • Sun Netra T2000 Server
  •  
  • Sun Blade T6320 Server Module
  •  
  • Sun SPARC Enterprise T2000 Server
  •  
  • Sun Blade T6340 Server Module
  •  
Related Categories
  • PLA-Support>Sun Systems>SPARC>CMT>SN-SPARC: Tx000
  •  


The following document provides instructions on recognizing and properly diagnosing a Sun Fire T2000 or Sun Blade T63XX Service Processor Battery Failure

Applies to:

Sun Blade T6320 Server Module - Version Not Applicable to Not Applicable [Release N/A]
Sun SPARC Enterprise T2000 Server - Version Not Applicable to Not Applicable [Release N/A]
Sun Blade T6340 Server Module - Version Not Applicable to Not Applicable [Release N/A]
Sun Netra T6340 Server Module - Version Not Applicable to Not Applicable [Release N/A]
Sun Fire T2000 Server - Version Not Applicable to Not Applicable [Release N/A]
Information in this document applies to any platform.

Goal

 The following document provides instructions on recognizing and properly diagnosing a Sun Fire T2000 Service Processor Battery Failure

Solution

Troubleshooting Details:

Symptoms

The following error messages may be seen on the System Controller (SP),  or Advanced Lights Out Manager (ALOM) console logs:

/var/adm/messages or showlogs -v:

APR 02 13:34:22: 00040068: "BATTERY at SC/BAT/V_BAT has exceeded low warning threshold."



Also, you may show faults reported on the SP via ALOM command "showfaults" and will see the faulted reported similar to:

sc> showfaults -v
Last POST run: TUE MAY 22 14:26:07 2007
POST status: Passed all devices

 ID Time              FRU               Fault
2627 FEB 03 21:48:56   SC/BAT            BATTERY at SC/BAT/V_BAT has exceeded low warning threshold.



When connected to ALOM on the Tx000 machine, running the "showenvironment" command at the sc prompt will provide you with greater detail of the environmental statuses of the basic system components installed in the machine such as:
System Temperatures, System Indicator Status, Fans Status, Voltage sensors (in Volts), etc., and will appear similar to the following:

sc> showenvironment


=============== Environmental Status ===============


--------------------------------------------------------------------------------
System Temperatures (Temperatures in Celsius):
--------------------------------------------------------------------------------
Sensor           Status  Temp LowHard LowSoft LowWarn HighWarn HighSoft HighHard
--------------------------------------------------------------------------------
PDB/T_AMB        OK        25    -10      -5       0      45       50       55
MB/T_AMB         OK        24    -10      -5       0      50       55       60
MB/CMP0/T_TCORE  OK        42    -10      -5       0      85       90       95
MB/CMP0/T_BCORE  OK        41    -10      -5       0      85       90       95
IOBD/IOB/TCORE   OK        39    -10      -5       0      95      100      105
IOBD/T_AMB       OK        29    -10      -5       0      52       57       62

--------------------------------------------------------
System Indicator Status:
--------------------------------------------------------
SYS/LOCATE           SYS/SERVICE          SYS/ACT             
OFF                  ON                   ON                  
--------------------------------------------------------
SYS/REAR_FAULT       SYS/TEMP_FAULT       SYS/TOP_FAN_FAULT   
ON                   OFF                  OFF                 

~snip~


--------------------------------------------------------------------------------
Voltage sensors (in Volts):
--------------------------------------------------------------------------------
Sensor          Status      Voltage LowSoft LowWarn HighWarn HighSoft
--------------------------------------------------------------------------------
MB/V_+1V5       OK            1.48    1.36    1.39    1.60     1.63
MB/V_VMEML      OK            1.79    1.63    1.67    1.92     1.98
MB/V_VMEMR      OK            1.79    1.63    1.67    1.92     1.98
MB/V_VTTL       OK            0.89    0.81    0.83    0.96     0.99
MB/V_VTTR       OK            0.87    0.81    0.83    0.96     0.99
MB/V_+3V3STBY   OK            3.34    3.13    3.16    3.53     3.59
MB/V_VCORE      OK            1.31    1.20    1.24    1.36     1.39
IOBD/V_+1V5     OK            1.48    1.36    1.39    1.60     1.63
IOBD/V_+1V8     OK            1.79    1.63    1.67    1.92     1.96
IOBD/V_+3V3MAIN OK            3.34    3.06    3.10    3.49     3.53
IOBD/V_+3V3STBY OK            3.36    3.13    3.16    3.53     3.59
IOBD/V_+1V      OK            1.18    1.09    1.11    1.28     1.30
IOBD/V_+1V2     OK            1.16    1.09    1.11    1.28     1.30
IOBD/V_+5V      OK            5.12    4.55    4.75    5.35     5.45
IOBD/V_-12V     OK          -12.11  -13.08  -12.84  -11.16   -10.92
IOBD/V_+12V     OK           12.00   10.92   11.16   12.84    13.08
SC/BAT/V_BAT    WARNING       0.57      --    2.25      --       --        <--- This is the voltage sensor reporting the current voltage status of the Service Processor Battery



As you will notice above under "Voltage sensors (in Volts):" section, for the service processor battery, we see the following key indicators of a battery failure:

Sensor          Status      Voltage LowSoft LowWarn HighWarn HighSoft
SC/BAT/V_BAT    WARNING       0.57      --    2.25      --       --

1- The current voltage of the SP battery is currently low (at only .57 Volts),
2- the "LowWarn" warning voltage is set to 2.25 (Volts), and when the voltage of the battery falls below the 2.25V threshold, it will generate a warning message which will be logged in the console logs and appear similar to the following:

APR 02 13:34:22: 00040068: "BATTERY at SC/BAT/V_BAT has exceeded low warning threshold."



Also, we will find the fault reported on the SP via ALOM command "showfaults" and will see the fault reported similar to:

sc> showfaults -v
Last POST run: TUE MAY 22 14:26:07 2007
POST status: Passed all devices

 ID Time              FRU               Fault
2627 FEB 03 21:48:56   SC/BAT            BATTERY at SC/BAT/V_BAT has exceeded low warning threshold.


Also, to verify the system firmware level:

ALOM:

sc> showhost
Sun-Fire-T2000 System Firmware 6.7.12  2011/07/06 20:03

Host flash versions:
   OBP 4.30.4.d 2011/07/06 14:29
   Hypervisor 1.7.3.c 2010/07/09 15:14
   POST 4.30.4.b 2010/07/09 14:24

ILOM: ( T6320 )

-> version
SP firmware 3.0.12.4.u
SP firmware build number: 69647
SP firmware date: Sat Nov 19 07:22:35 PST 2011
SP filesystem version: 0.1.22

 

 


Cause


The root cause of this issue is dropping voltage of the battery over time, per normal usage. The battery is located on the System Controller (SC), or System Processor (SP)

NOTE (1): A known issue has been identified where good SC Batteries on the T2000 machines were being marked "failed" prematurely. This issue is attributed to voltage thresholds used by the ALOM software for determining battery status. The issue was seen on systems with firmware releases prior to 6.2.4. Before ultimately replacing the SC battery on the T2000, please ensure proper FW is applied, in order to avoid an unnecessary outage for a needless battery replacement. Please use the following ALOM command to check the currently installed firmware on your machine: sc> showhost (more details are also available in the following knowledge article: How to Determine the System Firmware Version on T1000/T2000 (Doc ID 1012949.1))

NOTE (2): The latest and most recommended Firmware Release available is 6.7.13. The patch containing this firmware is labeled as 139434-10 FIRMWARE: Sun Fire T2000 - Sun System Firmware Update 6.7.13
and can be found in the System Handbook under the Patch Readmes and Downloads link. See the following for the available ways of "Accessing the Oracle System Handbook on My Oracle Support (Doc ID 1227213.1)."

Additional References
Frequently Asked Questions About Sun System Firmware (Doc ID 1021792.1)
How to Download Firmware for Oracle Systems and Storage (Doc ID 1342226.1)
Selecting the firmware upgrade method on T1000/T2000 (Doc ID 1011226.1)
How to Update the Firmware from the OS on T1000/T2000 (Doc ID 1501504.1)



Solution


Replace the failing Service Processor Battery as soon as available.

Please Log an Oracle Service Request*** for this issue to request a new battery to be shipped for the replacement of the failing SP Battery of the server. When opening the SR, you should have ready, or be prepared, to provide (at a minimum) the following ALOM commands output, in order to quickly have this matter resolved:

For ALOM based units:

  sc>  showhost
  sc>  showenvironment
  sc>  showlogs -v
  sc>  showfaults -v

For T63x0 units not using ALOM compatibility mode, these are the equivalent ILOM commands:(Note: ILOM data appears in a different format )

-> show /HOST
-> show -o table -level all /SYS
-> show /SP/logs/event/list
-> show faulty
-> version

*** Note: for greatest ease and fastest turn-around for this issue, when creating a Service Request(SR) via My Oracle Support Portal, please have a new/fresh Explorer file available for upload during SR creation. Please be sure to select "PSU, Fan, Battery failure" and then proceed to upload the ready Explorer file. ***

Upon receipt of the new SP Battery, please see the following document for detailed instructions on replacing the SP Battery:

How to Remove and Replace a Sun Fire T2000 Service Processor Battery:ATR:1180:1 [VIDEO] (Doc ID 1308278.1)


Attachments
This solution has no attachment
  Copyright © 2018 Oracle, Inc.  All rights reserved.
 Feedback