Sun Microsystems, Inc.  Oracle System Handbook - ISO 7.0 May 2018 Internal/Partner Edition
   Home | Current Systems | Former STK Products | EOL Systems | Components | General Info | Search | Feedback

Asset ID: 1-72-1904115.1
Update Date:2017-05-01
Keywords:

Solution Type  Problem Resolution Sure

Solution  1904115.1 :   System fails to boot due to FATAL: /packages/deblocker: Last Trap: Non-Resumable Error  


Related Items
  • Sun SPARC Enterprise T5120 Server
  •  
  • Sun SPARC Enterprise T5220 Server
  •  
  • Sun SPARC Enterprise T5140 Server
  •  
  • Sun SPARC Enterprise T2000 Server
  •  
  • Sun SPARC Enterprise T5240 Server
  •  
  • Sun Fire T2000 Server
  •  
Related Categories
  • PLA-Support>Sun Systems>SPARC>CMT>SN-SPARC: Tx000
  •  




In this Document
Symptoms
Cause
Solution
References


Created from <SR 3-9255606672>

Applies to:

Sun Fire T2000 Server - Version All Versions and later
Sun SPARC Enterprise T5140 Server - Version All Versions and later
Sun SPARC Enterprise T5220 Server - Version All Versions and later
Sun SPARC Enterprise T2000 Server - Version All Versions and later
Sun SPARC Enterprise T5240 Server - Version Not Applicable and later
Information in this document applies to any platform.

Symptoms

 System won't boot at all from the internal disks (single user shown here):

{0} ok boot -sv
Boot device: /pci@7c0/pci@0/pci@1/pci@0,2/LSILogic,sas@2/disk@0,0:a  File and args: -sv
FATAL: /packages/deblocker: Last Trap: Non-Resumable Error
Total Number of Non-resumable traps = 1
Non-resumable Error service report:
EHDL:  10000000000c2a
STICK: 1d61c96880
EDESC: 2
EATTR: 2000002
RA:    -1
SIZ:   40
CPUID: 0

TL:   1

%TL:1 %TT:7f %TPC:f020ac64 %TnPC:f020ac68
%TSTATE:8820001600  %CWP:0
  %PSTATE:16 AG:0 IE:1 PRIV:1 AM:0 PEF:1 RED:0 MM:0 TLE:0 CLE:0 MG:0 IG:0
  %ASI:20  %CCR:88  XCC:Nzvc   ICC:Nzvc

%TL:2 %TT:180 %TPC:f02496d8 %TnPC:f02496dc
%TSTATE:1994f001400  %CWP:0
  %PSTATE:14 AG:0 IE:0 PRIV:1 AM:0 PEF:1 RED:0 MM:0 TLE:0 CLE:0 MG:0 IG:0
  %ASI:4f  %CCR:99  XCC:NzvC   ICC:NzvC

       Normal          GL=1
0:                 0                0
1:          f0200000                0
2:          f0200000                0
3:          fff78000                0
4:              2000        7fff22000
5:                 0        7fff22000
6:                 0        7fff22600
7:                 0         f0243cec
%PC  f020ac64 %nPC f020ac68
%TBA f0200000 %CCR 88200016 XCC:nzvC   ICC:nZVc

Consecutive boot attempts will show that the boot has been disabled (due to the non-resumable error):

{0} ok boot disk1
FATAL: system is not bootable, boot command is disabled 
{0} ok boot disk
FATAL: system is not bootable, boot command is disabled 

 

POST running on MIN shows no errors:

0:0:0> 
0:0:0>Sun Fire[TM] T2000 POST 4.30.4.b 2010/07/09 14:24 
       /export/delivery/delivery/4.30/4.30.4.b/post4.30.4-micro/Niagara/ontario/integrated  (root) 
0:0:0>Copyright (c) 2010, Oracle and/or its affiliates. All rights reserved. 
0:0:0>VBSC cmp0 arg is: ffffffff.00000201 
0:0:0>POST enabling threads: 00000000.ffffffff 
0:0:0>VBSC cntl arg is: ffffffff.00000201 
0:0:0>VBSC selecting POST MIN Testing. 
0:0:0>VBSC setting verbosity level 2 
0:0:0>Start Selftest..... 
0:0:0>Master CPU Tests Basic....Done 
0:0:0>Init MMU..... 
0:0:0>L2 Tests....Done 
0:0:0>Test Memory....Done 
0:0:0>Setup POST Mailbox ....Done 
0:0:0>Extended CPU Tests....Done 
0:0:0>Scrub Memory....Done 
0:0:0>Extended Memory Tests....Done 
0:0:0>IO-Bridge Tests....Done 
2014-06-29 11:54:29.469 0:0:0>INFO: 
2014-06-29 11:54:29.475 0:0:0>  POST Passed all devices. 
2014-06-29 11:54:29.499 0:0:0>POST:     Return to VBSC. 
2014-06-29 11:54:29.516 0:0:0>Master set ACK for vbsc runpost command and spin... 

Cause

 Issue is most likely caused by a DIMM problem.

MAX POST should flag the culprit DIMMs (taken from the output of the same system as the MIN POST above, sample errors):

2014-07-01 21:45:33.296 0:5:1> 
2014-07-01 21:45:33.304 0:5:1>ERROR: TEST = Queue Block Mem Test 
2014-07-01 21:45:33.319 0:5:1>H/W under test = MB/CMP0/CH3/R1/D0/S1 (J2401) 
2014-07-01 21:45:33.335 0:5:1>Repair Instructions: Replace items in order listed by 'H/W under test' above. 
2014-07-01 21:45:33.360 0:5:1>MSG = Pin 13 failed on MB/CMP0/CH3/R1/D0/S1 (J2401) 
2014-07-01 21:45:33.424 0:5:1>END_ERROR 

2014-07-01 21:45:33.433 0:5:1> 
2014-07-01 21:45:33.441 0:5:1>ERROR: TEST = Queue Block Mem Test 
2014-07-01 21:45:33.456 0:5:1>H/W under test = MB/CMP0/CH3/R1/D0/S1 (J2401) 
2014-07-01 21:45:33.473 0:5:1>Repair Instructions: Replace items in order listed by 'H/W under test' above. 
2014-07-01 21:45:33.496 0:5:1>MSG = Pin 143 failed on MB/CMP0/CH3/R1/D0/S1 (J2401) 
2014-07-01 21:45:33.513 0:5:1>END_ERROR 

 

2014-07-01 21:46:26.973 0:0:0>ERROR: 
2014-07-01 21:46:26.981 0:0:0>  POST toplevel status has the following failures: 
2014-07-01 21:46:27.003 0:0:0>          MB/CMP0/CH3/R1/D0/S1 (J2301) 
2014-07-01 21:46:27.136 0:0:0>          MB/CMP0/CH3/R1/D0/S1 (J2401) 
2014-07-01 21:46:27.154 0:0:0>END_ERROR 

 

To have the system run in MAX POST, please perform one of these actions (depending on the SP in the system, ALOM or ILOM):

ILOM:

-> set /SYS keyswitch_state=diag

-> stop /SYS

-> start /SYS

-> start /SP/console

 

ALOM:

sc> setkeyswitch=diag

sc> poweroff

sc> poweron

sc> console -f

 

NOTE: Please remember to set the keyswitch back to normal mode after the logs have been collected or the system will run MAX POST every time you powercycle (which takes a much longer time than regular POST).

Solution

 Open an Oracle Support SR to have the logs looked into and evaluated for possible DIMM replacement

References

<NOTE:1010565.1> - BOOT: Explanation of the "FATAL: system is not bootable, boot command is disabled" OBP message
<NOTE:1011227.1> - How to verify that the Boot devices can be seen for the Sun Fire[TM] T1000/T2000

Attachments
This solution has no attachment
  Copyright © 2018 Oracle, Inc.  All rights reserved.
 Feedback