Sun Microsystems, Inc.  Oracle System Handbook - ISO 7.0 May 2018 Internal/Partner Edition
   Home | Current Systems | Former STK Products | EOL Systems | Components | General Info | Search | Feedback

Asset ID: 1-79-1452064.1
Update Date:2017-07-26
Keywords:

Solution Type  Predictive Self-Healing Sure

Solution  1452064.1 :   SUN4V-8002-KQ faults reported on T4-4, T4-2 and Netra T4-2 servers  


Related Items
  • SPARC SuperCluster T4-4
  •  
  • SPARC T4-2
  •  
  • Netra SPARC T4-2 Server
  •  
  • SPARC T4-4
  •  
Related Categories
  • PLA-Support>Sun Systems>SPARC>CMT>SN-SPARC: T4
  •  


A limited number of Sparc T4-4, T4-2 and Netra Sparc T4-2 servers may experience a SUN4V-8002-KQ FMA fault.

In this Document
Purpose
Details
References


Oracle Confidential PARTNER - Available to partners (SUN).
Reason: Internal support procedures

Applies to:

Netra SPARC T4-2 Server
SPARC SuperCluster T4-4
SPARC T4-4
SPARC T4-2
SPARC
If the threshold level for ereport.cpu.generic-sparc.c2c-link@/host events is exceeded a FMA SUN4V-8002-KQ fault will be triggered by FMA.

Example:

root@ssccn4-m1:~# fmadm faulty
--------------- ------------------------------------ -------------- ---------
TIME EVENT-ID MSG-ID SEVERITY
--------------- ------------------------------------ -------------- ---------
Mar 06 16:17:01 12f78095-7788-e159-dfa9-8da2a22fb197 SUN4V-8002-KQ Major

Host : ssccn4-m1
Platform : ORCL,SPARC-T4-4 Chassis_id :
Product_sn :

Fault class : fault.cpu.generic-sparc.c2c
Affects : hc://:product-id=ORCL,SPARC-T4-4:product-sn=1151BDY190:server-id=ssccn4-m1:chassis-id=1151BDY190/chassis=0/cpuboard=1/chip=2
hc://:product-id=ORCL,SPARC-T4-4:product-sn=1151BDY190:server-id=ssccn4-m1:chassis-id=1151BDY190/chassis=0/cpuboard=0/chip=0
faulted but still in service
FRU : "/SYS/PM1" (hc://:product-id=ORCL,SPARC-T4-4:product-sn=1151BDY190:server-id=ssccn4-m1:chassis-id=1151BDY190:serial=465769T+1149L9010F:part=7019789:revision=05/chassis=0/cpuboard=1) 50%
"/SYS/PM0" (hc://:product-id=ORCL,SPARC-T4-4:product-sn=1151BDY190:server-id=ssccn4-m1:chassis-id=1151BDY190:serial=465769T+1124L9000G:part=7019789:revision=03/chassis=0/cpuboard=0) 50%
faulty

Description : The number of chip-to-chip recoverable errors has exceeded acceptable levels.

Purpose

 This document outlines current support procedures for any of the T4-4, T4-2 and Netra T4-2 servers which have reported a SUN4V-8002-KQ fault.

Details

T4-4 servers only:

NOTE:6/5/2012 - FAB 1463634.1 is now released. If a  SUN4V-8002-KQ  occurs on a T4-4 server follow the procedures in the FAB 

Download the Sun_SPARC_T4-4_PM_E0010556.pkg:  Sun_SPARC_T4-4_PM_E0010556.zip 

The FAB shows the installation using the CLI interface. Use of the WEB BUI is also supported.


T4-2  and Netra T4-2 servers:

When a Service Request is opened for a SUN4V-8002-KQ fault follow this process.

NOTE:This article does NOT apply to SUN4V-8002-MJ faults. See article 1180029.1 .

Review the previous service history for the system serial number checking for any previous reports of  SUN4V-8002-KQ faults.

Another option is to review fma output explorer data. As long as prior fma data has not been purged any previous faults should be listed in the fmdump.out file in the fma directory.

Below is an example of a system where previous repairs were made but new SUN4V-8002-KQ faults occurred.

more fmdump.out
TIME                 UUID                                 SUNW-MSG-ID
Mar 04 10:42:44.4448 1868ee5f-e1ef-ea88-9866-f6be13edcc93 FMD-8000-4M Repaired
Mar 04 10:42:44.7305 1868ee5f-e1ef-ea88-9866-f6be13edcc93 FMD-8000-6U Resolved
Mar 04 10:52:16.3468 52e54ae0-3b27-eaba-99a4-b4ca595e23a2 SUN4V-8002-KQ
Mar 04 10:55:42.3190 52e54ae0-3b27-eaba-99a4-b4ca595e23a2 FMD-8000-4M Repaired
Mar 04 10:55:42.5931 52e54ae0-3b27-eaba-99a4-b4ca595e23a2 FMD-8000-6U Resolved
Mar 04 11:05:02.4880 001f670e-5522-cbd9-8ece-cb2ae15ae499 SUN4V-8002-KQ
Mar 04 11:29:33.5235 001f670e-5522-cbd9-8ece-cb2ae15ae499 FMD-8000-4M Repaired
Mar 04 11:29:33.8412 001f670e-5522-cbd9-8ece-cb2ae15ae499 FMD-8000-6U Resolved
Mar 04 11:39:41.3722 e56ed3d4-5f1f-ef31-db30-9093d7a82685 SUN4V-8002-KQ
Mar 04 13:38:06.2154 e56ed3d4-5f1f-ef31-db30-9093d7a82685 FMD-8000-4M Repaired
Mar 04 13:38:06.5064 e56ed3d4-5f1f-ef31-db30-9093d7a82685 FMD-8000-6U Resolved

 

Process:

If this is the first time a SUN4V-8002-KQ fault has been reported the recommendation is to clear the FMA faults and power cycle the server.

From Solaris:
Use fmadm faulty to determine which device(s) have reported a SUN4V-8002-KQ fault.

For T4-2 the FRU listed will be /SYS/MB


#fmadm repaired /SYS/MB

#fmadm faulty

From ILOM

-> start /SP/faultmgmt/shell
Are you sure you want to start /SP/faultmgmt/shell (y/n)? y

faultmgmtsp> fmadm faulty -r
/SYS/MB

faultmgmtsp> fmadm repair /SYS/MB

faultmgmtsp> fmadm faulty -r

If this is a second fault and a single FRU on the T4-2 (System Board) is deemed faulted replace that FRU.


Attachments
This solution has no attachment
  Copyright © 2018 Oracle, Inc.  All rights reserved.
 Feedback