Sun Microsystems, Inc.  Oracle System Handbook - ISO 7.0 May 2018 Internal/Partner Edition
   Home | Current Systems | Former STK Products | EOL Systems | Components | General Info | Search | Feedback

Asset ID: 1-72-1938097.1
Update Date:2017-05-02
Keywords:

Solution Type  Problem Resolution Sure

Solution  1938097.1 :   NETRA 1290 : FAILED FAN  


Related Items
  • Sun Netra 1290 Server
  •  
Related Categories
  • PLA-Support>Sun Systems>SPARC>Enterprise>SN-SPARC: SF-x8x0/Ex900
  •  




In this Document
Symptoms
Cause
Solution
References


Created from <SR 3-9294617051>

Applies to:

Sun Netra 1290 Server - Version All Versions and later
Information in this document applies to any platform.

Symptoms

showenvironment and showlogs show FAN speed to be low and FT and FAN are marked as faulted, but prtdiag does not reflect such alarm.  The Faulted Fan replacements do not solve the problem.

Cause

NOTE:  The most common cause of Fans in the failed state is a failed Fan.  This issue only applies to systems where replacement does not fix the issue.

 

 

After a couple of FANs been replaced, in a new showboards output SIB may show up in status "Degraded"

lom>showboards

Slot Pwr Component Type State Status
---- --- -------------- ----- ------
SSC1 On System Controller V2 Main Passed
/N0/SCC - System Config Card Assigned OK
/N0/BP - Baseplane Assigned Passed
/N0/SIB - Indicator Board Assigned Degraded
/N0/SPDB - System Power Distribution Bd. Assigned Passed
/N0/PS0 On D142 Power Supply - OK
/N0/PS1 On D142 Power Supply - OK
/N0/PS2 On D142 Power Supply - OK
/N0/PS3 On D142 Power Supply - OK
/N0/FT0 On Fan Tray Auto Speed Passed
/N0/RP0 On Repeater Board Assigned OK
/N0/RP2 On Repeater Board Assigned OK
/N0/SB0 On CPU Board V3 Active Passed
/N0/IB6 On PCI+ I/O Board Active Passed
/N0/MB - Media Bay Assigned Passed


prtdiag only shows chassis as alarmed

  - fault ON amber
  - power ON green
  - top_access ON amber



showenvironment may show one or more FANs as X days failed

 

In this example, the first error was located in Fan 0, but after replacement the failure component changed to FAN 3.  The Fan numbers may vary with different situations.

/N0/FT0 Fan 3 Cooling 0 ???? 4 days failed
/N0/FT0 Fan 0 Cooling 0 Auto 7 sec OK
/N0/FT0 Fan 1 Cooling 0 Auto 7 sec OK
/N0/FT0 Fan 2 Cooling 0 Auto 7 sec OK
/N0/FT0 Fan 4 Cooling 0 Auto 7 sec OK
/N0/FT0 Fan 5 Cooling 0 Auto 7 sec OK
/N0/FT0 Fan 6 Cooling 0 Auto 7 sec OK
/N0/FT0 Fan 7 Cooling 0 Auto 7 sec OK


This event appears in showlogs

Thu Aug 28 21:40:08 wip1sc lom: [ID 201746 local0.notice] Fan Tray Slot 0 Device poll caused: sun.serengeti.FailedHwException: Lw8FanCsr.getRotorSpeed: I2cComm.busyWait: I2c error waiting for: RRDY, slave did not ACK, status=0x6023c089, bus=26(/N0/BP) ring=06 addr=23


These kind of message indicates something wrong with the i2c bus which gathers status information from the various components in the system; in this case the i2c path is System Controller / Base Plane / Indicator Board / Fan Tray.  Problems on one component of the i2c bus can in very rare conditions cause other FRUs to respond with misleading status readings.


Solution

In this particular case the SIB was replaced and that solved the problem.  The SIB resides on  the Field Replaceable Unit(FRU) 540-6565 Front Bezel FRU

References

<NOTE:1008393.1> - Sun Fire [TM] SF3800/SF4800/SF4810/SF6800 - E4900/E6900 - V1280/E2900 - Netra 1280/1290 : Troubleshooting Cooling Fan and Power Supply Fan Failures
<NOTE:1019066.1> - Sun Fire[TM] v1280, 3800, 4800, 4810, 6800, E2900, E4900, E6900 and Netra[TM] 1280, 2900 servers: How to collect scextended or 1280extended Explorer

Attachments
This solution has no attachment
  Copyright © 2018 Oracle, Inc.  All rights reserved.
 Feedback