Sun Microsystems, Inc.  Oracle System Handbook - ISO 7.0 May 2018 Internal/Partner Edition
   Home | Current Systems | Former STK Products | EOL Systems | Components | General Info | Search | Feedback

Asset ID: 1-79-2218777.1
Update Date:2017-10-16
Keywords:

Solution Type  Predictive Self-Healing Sure

Solution  2218777.1 :   M12-env.temp.over-warn - An overtemperature warning condition is detected by a temperature sensor  


Related Items
  • Fujitsu SPARC M12-1
  •  
  • Fujitsu M10 PCI Expansion Unit
  •  
  • Fujitsu SPARC M12-2
  •  
  • Fujitsu SPARC M12-2S
  •  
Related Categories
  • PLA-Support>Sun Systems>Sun_Other>Sun Collections>SN-OTH: Sun PSH
  •  




In this Document
Purpose
Details
References


Applies to:

Fujitsu M10 PCI Expansion Unit
Fujitsu SPARC M12-2
Fujitsu SPARC M12-2S
Fujitsu SPARC M12-1
SPARC

Purpose

Provide additional information for message ID: M12-env.temp.over-warn

Fujitsu fault codes:

01910811, 01910911, 01910A11, 01910B11, 01910C11, 01910D11,
01910E10, 01912D11, 01912F11, 11000031, 11000038

Details

Type

Hardware Fault
   env.temp.over-warn

Severity

Major

Description

Fault due to an overtemperature warning condition detected by a temperature sensor. The warning indicates a higher temperature than the overtemperature condition event described in:

M12-env.temp.over - An overtemperature condition is detected by a temperature sensor (Doc ID 2218787.1)

When this fault is detected at the temperature sensor at inlet, no immediate action will be taken other than log.

When this fault is detected on a XB chip, PPARs using this XB chip will be requested to shut down.

When the fault is detected on a PSU, the PSU will be requested to power-off.  The chassis will keep running with using remaining PSU, unless the other
PSU is already faulted.

For other cases, a PPAR, which is using a FRU where this fault is detected, will be requested to shut down due to this overtemperature condition.

The overtemperature warning was detected by:

- An inlet temperature sensor

- An exhaust temperature sensor

- A temperature sensor located near a:

  - CPU

  - DIMM

  - XB chip

  - PCI express switch chip

  - SAS chip

  - PSU

  - DDC

The sensor could also be located on an XB-Box or PCI expansion box.

Automated Response

The fan speed of all the fans on the platform is raised to high (the fans are probably already running at high speed) and the platform administrator should investigate the cause of the overtemperature warning condition.

If the temperature sensor is an inlet temperature sensor, then no further action is taken.

When this fault is detected at XBU, then shutdown messages are sent to all PPARs using this XBU.

If shutdown messages have been sent to all domains on the platform, then no further action is taken. Otherwise, shutdown messages are sent to a PPAR, which uses FRU where this fault is detected, by the XSCF driver.

Impact

If it’s detected by inlet temperature sensor, Nothing is deconfigured.
In other cases, a FRU where the temperature abnormality is detected is deconfigured.

Indicted Hardware

The suspect is the external environment when detected on the inlet temperature sensor.

In other cases, Fujitsu M12 hardware/firmware cannot distinguish the failure is caused by either the external environment (ambient temperature problem or an issue decreasing air flow for the system) or a failure of the hardware.  A field engineer should check the system.  The field engineer should consider a possibility that the cause was gone before their arrival.  If the field engineer cannot find the root cause, the error state should be cleared, and try to start the system to see if the problem can be reproduced.  The error state of this failure can be cleared by disconnecting all the power cables to the chassis where the FRU is located. (For example, if it is a PCI expansion box, disconnect all the power cables connected to the PCI expansion box)

If an immediate fix is required, the reported FRU should be replaced understanding that the root cause can be the external environment.

Suggested Action for System Administrator

Refer to the following document for the latest procedures for displaying event content in preparation for submitting a service request and applying any post-repair actions that may be required.

PSH Procedural Article for Fujitsu M10 Diagnosis (Doc ID 1525156.1)

 


Attachments
This solution has no attachment
  Copyright © 2018 Oracle, Inc.  All rights reserved.
 Feedback