Sun Microsystems, Inc.  Oracle System Handbook - ISO 7.0 May 2018 Internal/Partner Edition
   Home | Current Systems | Former STK Products | EOL Systems | Components | General Info | Search | Feedback

Asset ID: 1-72-1994655.1
Update Date:2018-05-09
Keywords:

Solution Type  Problem Resolution Sure

Solution  1994655.1 :   T4-2 - ILOM shows a SPT-8000-HR message and a failed MCU  


Related Items
  • SPARC T4-2
  •  
Related Categories
  • PLA-Support>Sun Systems>SPARC>CMT>SN-SPARC: T4
  •  




In this Document
Symptoms
Cause
Solution
References


Created from <SR 3-10437756541>

Applies to:

SPARC T4-2 - Version All Versions to All Versions [Release All Releases]
Information in this document applies to any platform.

Symptoms

Customer sees some errors on POST. This may cause booting problems.

These SMI errors are seen in POST with an MCU failing to initialize:

2015-03-17 18:54:16  0:0:0> NOTICE:  SMI Channel 0, SB Mapping 0 -- ERRCNT:0x0LNERR:0x0
2015-03-17 18:54:16  1:0:0> NOTICE:  SMI Channel 0, SB Mapping 0 -- ERRCNT:0x1f8LNERR:0x80
2015-03-17 18:54:16  0:0:0> NOTICE:  SMI Channel 0, SB Mapping 1 -- ERRCNT:0x0LNERR:0x0
2015-03-17 18:54:16  1:0:0> NOTICE:  SMI Channel 0, SB Mapping 1 -- ERRCNT:0x1f8LNERR:0x80
2015-03-17 18:54:16  0:0:0> NOTICE:  SMI Channel 1, SB Mapping 0 -- ERRCNT:0x0LNERR:0x0
2015-03-17 18:54:16  1:0:0> NOTICE:  SMI Channel 1, SB Mapping 0 -- ERRCNT:0x0LNERR:0x0
2015-03-17 18:54:16  0:0:0> NOTICE:  SMI Channel 1, SB Mapping 1 -- ERRCNT:0x0LNERR:0x0
2015-03-17 18:54:16  1:0:0> NOTICE:  SMI Channel 1, SB Mapping 1 -- ERRCNT:0x0LNERR:0x0
2015-03-17 18:54:16  1:0:0> ERROR:   N1.MCU0: SMI link failed memory link test.
2015-03-17 18:54:16  1:0:0> ERROR:   /SYS/MB/CMP1/MCU0 failed to initialize
2015-03-17 18:54:53  0:0:0> NOTICE:  SMI Channel 0, SB Mapping 0 -- ERRCNT:0x0LNERR:0x0
2015-03-17 18:54:53  0:0:0> NOTICE:  SMI Channel 0, SB Mapping 1 -- ERRCNT:0x0LNERR:0x0
2015-03-17 18:54:53  0:0:0> NOTICE:  SMI Channel 1, SB Mapping 0 -- ERRCNT:0x0LNERR:0x0
2015-03-17 18:54:54  0:0:0> NOTICE:  SMI Channel 1, SB Mapping 1 -- ERRCNT:0x0LNERR:0x0

 

FMA shows SPT-8000-HR message, and the snapshot ereports will show that an MCU got disabled:

2015-03-17/18:44:21  ereport.hc.dev_faulted.mcu_smi_health_fail@/SYS/MB/CMP1/MCU0

2015-03-17/18:52:03  ereport.component.disabled@/SYS/MB/CMP1/MCU0   /SYS/MB/CMP1/MCU0

 

Once POST finishes, the system may abort auto boot:

ERROR: One or more resources have been retired, please check the SP logs.
Aborting auto-boot sequence.
{0} ok

 

Cause

A faulty MCU can cause the errors noted above.

Solution

Clear the fault, re-enable the component (if possible) and powercycle:
-> set /SYS/MB/ clear_fault_action=true
-> set /SYS/MB/CMP1/MCUX component_state=enabled (if possible)
-> reset /SYS
-> start /SP/console (to see POST and see if the MCU gets disabled again)

If the fault is cleared and the component faults again on the subsequent POST run, please open an SR with Oracle support to investigate.

An ILOM snapshot will be required to troubleshoot the issue.

 

The MCU is part of the MB. Replacing the MB should resolve the problem.

References

<NOTE:1994240.1> - How to determine MCU0 or MCU1 is being flagged an error or false positive
<NOTE:1161794.1> - SPT-8000-HR - Component Disabled

Attachments
This solution has no attachment
  Copyright © 2018 Oracle, Inc.  All rights reserved.
 Feedback