Sun Microsystems, Inc.  Oracle System Handbook - ISO 7.0 May 2018 Internal/Partner Edition
   Home | Current Systems | Former STK Products | EOL Systems | Components | General Info | Search | Feedback

Asset ID: 1-72-1523766.1
Update Date:2017-10-18
Keywords:

Solution Type  Problem Resolution Sure

Solution  1523766.1 :   Sun SPARC Enterprise Server E3x00/E4x00/E5x00/E6x00 ( SunFire Classic ): "Overtemp detected on board " error during the POST.  


Related Items
  • Sun Enterprise 6500 Server
  •  
  • Sun Enterprise 3500 Server
  •  
  • Sun Enterprise 5000 Server
  •  
  • Sun Enterprise 4000 Server
  •  
  • Sun Enterprise 3000 Server
  •  
  • Sun Enterprise 4500 Server
  •  
  • Sun Enterprise 5500 Server
  •  
  • Sun Enterprise 6000 Server
  •  
Related Categories
  • PLA-Support>Sun Systems>SPARC>Enterprise>SN-SPARC: Exx00
  •  




In this Document
Symptoms
Cause
Solution


Created from <SR 3-6605776950>

Applies to:

Sun Enterprise 3000 Server - Version All Versions and later
Sun Enterprise 3500 Server - Version All Versions and later
Sun Enterprise 4000 Server - Version All Versions and later
Sun Enterprise 4500 Server - Version All Versions and later
Sun Enterprise 5000 Server - Version All Versions and later
Information in this document applies to any platform.

Symptoms

During the POST, the server displays the error "Overtemp detected on board X".

The following system messages indicate environmental conditions:

PROM NOTICE: Overtemp detected on board <n>.
PROM NOTICE: System has cooled down.
PROM WARNING: Board <n> is too hot.
PROM NOTICE: Insufficient power detected.
PROM NOTICE: Power supply restored.
PROM NOTICE: Board insert detected.
PROM NOTICE: Reset Initiated...


If a board temperature is above a predetermined temperature threshold for that
board type, the OpenBoot PROM (OBP) initiates a reset. This results in POST
disabling the faulty board.

If Insufficient power detected is not fixed in 30 seconds, then the OBP
initiates a reset to enable POST to deconfigure the necessary boards.

 

The following example is taken from an E3000 server with CPU/MEM Board 3 and CPU/MEM Board 5 present in the configuration

Running the POST with only one of this boards (3 or 5), the same error is displayed.

Only Board 3:

3,0>Board 3 Environmental Probe Test
3,0>    Environmental Probe
3,0>ERROR: TEST=Environmental Probe,SUBTEST=Environmental Probe ID=1f.1
3,0>Component under test: Board 3 System Interrupt
3,0>Overtemp detected on board 16

Only Board 5:

5,0>Board 5 Environmental Probe Test
5,0>    Environmental Probe
5,0>ERROR: TEST=Environmental Probe,SUBTEST=Environmental Probe ID=1f.1
5,0>Component under test: Board 5 System Interrupt
5,0>Overtemp detected on board 16

During the environment tests Board 16 fail because of overheating.

slot 16 is the Clock Board:

0,0>Board 16 Clock Board Test

Cause

This error can be related to

  • defective Power Cooling Module (PCM) which is responsible for cooling the board reporting overtemp
  • temperature sensors failure of the board itself.


Solution

The first thing to do is check the state of the PCMs of the server in order to identify a possible issue with the cooling.

  • If possible swap the PCM resposable for the cooling the reporting board with another and check if the issue persists against the same board or move to another one.
    • If so, the Board is the most suspect FRU and needs to be replaced.
    • If not the Power Cooling Module (PCM) is the first suspect

 

In the previous example the most suspect FRU is the Clock Board (Board 16)

 

Note:

1. each PCM is responsable for the cooling of 2 Boards

2. on E3x00 systems each PCM drives the boards below it

3. on E4x00/E5x00/E6x00 systems each PCM drives the boards aside it


Open a Service Request using MOS Portal and request the diagnosis to confirm the fault and replace the defective component.

 

To discuss this information further with Oracle experts and industry peers, we encourage you to review, join or start a discussion in an appropriate
My Oracle Support Community - Oracle Sun Technologies Community.

 


Attachments
This solution has no attachment
  Copyright © 2018 Oracle, Inc.  All rights reserved.
 Feedback