Sun Microsystems, Inc.  Oracle System Handbook - ISO 7.0 May 2018 Internal/Partner Edition
   Home | Current Systems | Former STK Products | EOL Systems | Components | General Info | Search | Feedback

Asset ID: 1-79-1021919.1
Update Date:2017-05-09
Keywords:

Solution Type  Predictive Self-Healing Sure

Solution  1021919.1 :   SCF-8005-ED - Correctable errors on the interface between a CPU chip and a SC chip have exceeded an acceptable threshold.  


Related Items
  • Sun SPARC Enterprise M4000 Server
  •  
  • Sun SPARC Enterprise M9000-32 Server
  •  
  • Sun SPARC Enterprise M5000 Server
  •  
  • Sun SPARC Enterprise M9000-64 Server
  •  
  • Sun SPARC Enterprise M8000 Server
  •  
  • Sun SPARC Enterprise M3000 Server
  •  
Related Categories
  • PLA-Support>Sun Systems>Sun_Other>Sun Collections>SN-OTH: Sun PSH
  •  

PreviouslyPublishedAs
SCF-8005-ED


Applies to:

Sun SPARC Enterprise M3000 Server - Version All Versions to All Versions [Release All Releases]
Sun SPARC Enterprise M4000 Server - Version All Versions to All Versions [Release All Releases]
Sun SPARC Enterprise M5000 Server - Version All Versions to All Versions [Release All Releases]
Sun SPARC Enterprise M8000 Server - Version All Versions to All Versions [Release All Releases]
Sun SPARC Enterprise M9000-32 Server - Version All Versions to All Versions [Release All Releases]
All Platforms

Purpose

Provide additional information for message ID: SCF-8005-ED

Scope

 

Details

Predictive Self-Healing Article
Correctable errors on the interface between a CPU chip and a SC chip have exceeded an acceptable threshold.

Type

Fault
  fault.chassis.SPARC-Enterprise.if.ce-cpu-sc

Severity

Minor

Description

The number of correctable errors on the interface between a CPU chip and a SC chip has exceeded an acceptable threshold.

Automated Response

The CPU chip is deconfigured after the platform is power cycled or after the domain reboots or after a Dynamic Reconfiguration operation is performed.

Impact

No immediate impact. The CPU chip is deconfigured after the platform is power cycled or after the domain reboots or after a Dynamic Reconfiguration operation is performed.

Suggested Action for System Administrator

Schedule a repair action to replace the affected Field Replaceable Unit (FRU), the identity of which can be determined using fmdump -v -u EVENT_ID. Please consult the detail section of the knowledge article for additional information.

Details

The number of correctable errors on the interface between a CPU chip and a SC chip
has exceeded an acceptable threshold.

The CPU chip is deconfigured after the platform is power cycled or after the domain reboots or
after a Dynamic Reconfiguration operation is performed.


The recommended service action for this event is to schedule the replacement of the affected FRU's.

Step 1. Collect the fault message (use one of the following methods):


   Single-line fault message displayed on the XSCF console:

   Mar 20 21:44:36 san-ff2-21-0 fmd: SOURCE: sde, REV: 1.12, CSN: 7860000772 
   EVENT-ID: 3e6c22e7-1a63-4876-b9d9-3cc2da436a1b
   Refer to http://www.sun.com/msg/SCF-8005-ED for detailed information.

   Complete fault message using 'fmdump -m' on the XSCF console:

   MSG-ID: SCF-8005-ED, TYPE: Fault, VER: 1, SEVERITY: Minor
   EVENT-TIME: Tue Mar 20 21:44:36 UTC 2007
   PLATFORM: SPARC-Enterprise, CSN: 7860000772, HOSTNAME: san-ff2-21-0
   SOURCE: sde, REV: 1.12
   EVENT-ID: 3e6c22e7-1a63-4876-b9d9-3cc2da436a1b
   DESC: The number of correctable errors on the interface between a CPU chip and a SC chip
   has exceeded an acceptable threshold.
   Refer to http://www.sun.com/msg/SCF-8005-ED for more information.
   AUTO-RESPONSE: The CPU chip is deconfigured after the platform is power cycled or
   after the domain reboots or after a Dynamic Reconfiguration operation is performed.
   IMPACT: No immediate impact. The CPU chip is deconfigured after the platform is power cycled or
   after the domain reboots or after a Dynamic Reconfiguration operation is performed.
   REC-ACTION: Schedule a repair action to replace the affected Field Replaceable Unit (FRU),
   the identity of which can be determined using fmdump -v -u EVENT_ID.
   Please consult the detail section of the knowledge article for additional information.


Step 2. Collect the output from the 'fmdump -v -u EVENT_ID' command


        xscf> fmdump -v -u 3e6c22e7-1a63-4876-b9d9-3cc2da436a1b

        TIME                 UUID                                 MSG-ID
        Mar 20 21:44:36.2075 3e6c22e7-1a63-4876-b9d9-3cc2da436a1b SCF-8005-ED
          66%  fault.chassis.SPARC-Enterprise.if.ce-cpu-sc

               Problem in: hc:///chassis=0/cmu=0/cpu=0
                  Affects: hc:///chassis=0/cmu=0/cpu=0
                      FRU: hc://:product-id=SPARC-Enterprise:chassis-id=7860000772:
                           server-id=san-ff2-21-0:serial=PP06121471:
                           part=CA06761-D102 A0:revision=0101/component=/MBU_B/CPUM#0

          33%  fault.chassis.SPARC-Enterprise.if.ce-cpu-sc

               Problem in: hc:///chassis=0/cmu=0
                  Affects: hc:///chassis=0/cmu=0/cpu=0
                      FRU: hc://:product-id=SPARC-Enterprise:chassis-id=7860000772:
                           server-id=san-ff2-21-0:
                           part=CA20393-B50X 001AA:revision=0101/component=/MBU_B


Step 3. Contact your Authorized Service Provider.

If you require additional information, please refer to Document 1002526.1.

Attachments
This solution has no attachment
  Copyright © 2018 Oracle, Inc.  All rights reserved.
 Feedback