Sun Microsystems, Inc.  Oracle System Handbook - ISO 7.0 May 2018 Internal/Partner Edition
   Home | Current Systems | Former STK Products | EOL Systems | Components | General Info | Search | Feedback

Asset ID: 1-77-1019824.1
Update Date:2015-10-13
Keywords:

Solution Type  Sun Alert Sure

Solution  1019824.1 :   Sun Fire V215 and V245 Servers may Experience an Erroneous "Overtemp" Alarm Causing the System to Power Off  


Related Items
  • Sun Fire V245 Server
  •  
  • Sun Fire V215 Server
  •  
Related Categories
  • PLA-Support>Sun Systems>Sun_Other>Sun Collections>SN-OTH: Sun Alert
  •  
  • _Old GCS Categories>Sun Microsystems>Sun Alert>Release Phase>Resolved
  •  

PreviouslyPublishedAs
246946


Bug Id
<BUG:15457562>

Product
Sun Fire V215 Server
Sun Fire V245 Server

Date of Resolved Release
05-Dec-2008

***Checked for relevance 02-Jan-2014***

1. Impact

Sun Fire V215 and V245 servers may experience an erroneous transient "overtemp" alarm resulting in an abrupt power off of the system without a graceful shutdown of Solaris. When this occurs the "OVERTEMP LED" will be lit (amber) even though there is no genuine overtemp condition being experienced.

2. Contributing Factors

This issue can occur on the following platforms:
  • Sun Fire V215 Server without patch 139735-01
  • Sun Fire V245 Server without patch 139735-01
Note: This issue can occur on the systems listed above irrespective of the OS/ALOM versions in use.

3. Symptoms

If the described issue occurs, messages similar to the following will be seen in the ALOM event log ("showlogs" command):
    Feb 02 10:46:16 : 00040029: "Host system has shut down."
Feb 02 10:46:56 : 00070002: "Indicator PS0.DC_OK is now OFF"
Feb 02 10:46:56 : 00070002: "Indicator PS1.DC_OK is now OFF
In addition, the "OVERTEMP LED" on the front of the system chassis will be lit (amber). This LED status can also be confirmed with the ALOM "showenvironment" command. Meanwhile, all the sensors have an "OK" status.

Outputs similar to the following will be logged to "SYS.OVERTEMP":
    sc> showenvironment
=============== Environmental Status ===============
--------------------------------------------------------------------------------
System Temperatures (Temperatures in Celsius):
--------------------------------------------------------------------------------
Sensor         Status    Temp LowHard LowSoft LowWarn HighWarn HighSoft HighHard
--------------------------------------------------------------------------------
MB.P0.T_CORE    OK         67    -15     -10       0     100      105      110
MB.T_REMOTE     OK         28     --      --      --      --       --       --
MB.T_1064       OK         60    -15     -10       0     105      110      115
MB.T_FIRE       OK         31    -15     -10       0      95      105      108
MB.T_AMB        OK         33    -15     -10       0      65       75       85
FIOB.T_AMB      OK         19    -15     -10       0      45       47       50
PDB.T_DISK      OK         30    -15     -10       0      55       65       70
PDB.T_PS0       OK         25    -15     -10       0      48       50       53
PDB.T_PS1       OK         27    -15     -10       0      48       50       53
--------------------------------------
Keyswitch:
--------------------------------------
Keyswitch position: NORMAL
--------------------------------------------------------
System Indicator Status:
--------------------------------------------------------
SYS.LOCATE           SYS.SERVICE          SYS.ACT
--------------------------------------------------------
OFF                  OFF                  ON
--------------------------------------------------------
SYS.PSFAIL           SYS.OVERTEMP         SYS.FANFAIL
--------------------------------------------------------
OFF                  ON                   OFF
4. Workaround

There is no workaround for this issue.  Please see the Resolution section below.

5. Resolution

This issue is addressed on the following platforms:
  • Sun Fire V215 Server with patch 139735-01 or later
  • Sun Fire V245 Server with patch 139735-01 or later
Note: It is highly recommended that ALOM 1.6.9 is installed before the above patch installation. Also, please make sure "epicupdate" completes with the following message:
    EPIC update completed successfully.
If "epicupdate" fails with the following error:
    EPIC image verification failed. Bad record 0x0
EPIC update failed.
EPIC update failed. Please retry
please retry until "epicupdate" completes successfully.


Modification History
12-Mar-2009: Updated the Resolution section.
02-Jan-2014: checked for currency/relevance/formatting; no change in content

Questions regarding this document should be addressed to
sunalertpublication_us_grp@oracle.com
Internal Contributor/submitter: Kenji.Suzuki@oracle.com
Internal Eng Responsible Engineer: Roddy.Reilly@oracle.com
Internal Services Knowledge Engineer: jeff.folla@sun.com

References




Attachments
This solution has no attachment
  Copyright © 2018 Oracle, Inc.  All rights reserved.
 Feedback