Asset ID: |
1-72-1904115.1 |
Update Date: | 2017-05-01 |
Keywords: | |
Solution Type
Problem Resolution Sure
Solution
1904115.1
:
System fails to boot due to FATAL: /packages/deblocker: Last Trap: Non-Resumable Error
Related Items |
- Sun SPARC Enterprise T5120 Server
- Sun SPARC Enterprise T5220 Server
- Sun SPARC Enterprise T5140 Server
- Sun SPARC Enterprise T2000 Server
- Sun SPARC Enterprise T5240 Server
- Sun Fire T2000 Server
|
Related Categories |
- PLA-Support>Sun Systems>SPARC>CMT>SN-SPARC: Tx000
|
In this Document
Created from <SR 3-9255606672>
Applies to:
Sun Fire T2000 Server - Version All Versions and later
Sun SPARC Enterprise T5140 Server - Version All Versions and later
Sun SPARC Enterprise T5220 Server - Version All Versions and later
Sun SPARC Enterprise T2000 Server - Version All Versions and later
Sun SPARC Enterprise T5240 Server - Version Not Applicable and later
Information in this document applies to any platform.
Symptoms
System won't boot at all from the internal disks (single user shown here):
{0} ok boot -sv
Boot device: /pci@7c0/pci@0/pci@1/pci@0,2/LSILogic,sas@2/disk@0,0:a File and args: -sv
FATAL: /packages/deblocker: Last Trap: Non-Resumable Error
Total Number of Non-resumable traps = 1
Non-resumable Error service report:
EHDL: 10000000000c2a
STICK: 1d61c96880
EDESC: 2
EATTR: 2000002
RA: -1
SIZ: 40
CPUID: 0
TL: 1
%TL:1 %TT:7f %TPC:f020ac64 %TnPC:f020ac68
%TSTATE:8820001600 %CWP:0
%PSTATE:16 AG:0 IE:1 PRIV:1 AM:0 PEF:1 RED:0 MM:0 TLE:0 CLE:0 MG:0 IG:0
%ASI:20 %CCR:88 XCC:Nzvc ICC:Nzvc
%TL:2 %TT:180 %TPC:f02496d8 %TnPC:f02496dc
%TSTATE:1994f001400 %CWP:0
%PSTATE:14 AG:0 IE:0 PRIV:1 AM:0 PEF:1 RED:0 MM:0 TLE:0 CLE:0 MG:0 IG:0
%ASI:4f %CCR:99 XCC:NzvC ICC:NzvC
Normal GL=1
0: 0 0
1: f0200000 0
2: f0200000 0
3: fff78000 0
4: 2000 7fff22000
5: 0 7fff22000
6: 0 7fff22600
7: 0 f0243cec
%PC f020ac64 %nPC f020ac68
%TBA f0200000 %CCR 88200016 XCC:nzvC ICC:nZVc
Consecutive boot attempts will show that the boot has been disabled (due to the non-resumable error):
{0} ok boot disk1
FATAL: system is not bootable, boot command is disabled
{0} ok boot disk
FATAL: system is not bootable, boot command is disabled
POST running on MIN shows no errors:
0:0:0>
0:0:0>Sun Fire[TM] T2000 POST 4.30.4.b 2010/07/09 14:24
/export/delivery/delivery/4.30/4.30.4.b/post4.30.4-micro/Niagara/ontario/integrated (root)
0:0:0>Copyright (c) 2010, Oracle and/or its affiliates. All rights reserved.
0:0:0>VBSC cmp0 arg is: ffffffff.00000201
0:0:0>POST enabling threads: 00000000.ffffffff
0:0:0>VBSC cntl arg is: ffffffff.00000201
0:0:0>VBSC selecting POST MIN Testing.
0:0:0>VBSC setting verbosity level 2
0:0:0>Start Selftest.....
0:0:0>Master CPU Tests Basic....Done
0:0:0>Init MMU.....
0:0:0>L2 Tests....Done
0:0:0>Test Memory....Done
0:0:0>Setup POST Mailbox ....Done
0:0:0>Extended CPU Tests....Done
0:0:0>Scrub Memory....Done
0:0:0>Extended Memory Tests....Done
0:0:0>IO-Bridge Tests....Done
2014-06-29 11:54:29.469 0:0:0>INFO:
2014-06-29 11:54:29.475 0:0:0> POST Passed all devices.
2014-06-29 11:54:29.499 0:0:0>POST: Return to VBSC.
2014-06-29 11:54:29.516 0:0:0>Master set ACK for vbsc runpost command and spin...
Cause
Issue is most likely caused by a DIMM problem.
MAX POST should flag the culprit DIMMs (taken from the output of the same system as the MIN POST above, sample errors):
2014-07-01 21:45:33.296 0:5:1>
2014-07-01 21:45:33.304 0:5:1>ERROR: TEST = Queue Block Mem Test
2014-07-01 21:45:33.319 0:5:1>H/W under test = MB/CMP0/CH3/R1/D0/S1 (J2401)
2014-07-01 21:45:33.335 0:5:1>Repair Instructions: Replace items in order listed by 'H/W under test' above.
2014-07-01 21:45:33.360 0:5:1>MSG = Pin 13 failed on MB/CMP0/CH3/R1/D0/S1 (J2401)
2014-07-01 21:45:33.424 0:5:1>END_ERROR
2014-07-01 21:45:33.433 0:5:1>
2014-07-01 21:45:33.441 0:5:1>ERROR: TEST = Queue Block Mem Test
2014-07-01 21:45:33.456 0:5:1>H/W under test = MB/CMP0/CH3/R1/D0/S1 (J2401)
2014-07-01 21:45:33.473 0:5:1>Repair Instructions: Replace items in order listed by 'H/W under test' above.
2014-07-01 21:45:33.496 0:5:1>MSG = Pin 143 failed on MB/CMP0/CH3/R1/D0/S1 (J2401)
2014-07-01 21:45:33.513 0:5:1>END_ERROR
2014-07-01 21:46:26.973 0:0:0>ERROR:
2014-07-01 21:46:26.981 0:0:0> POST toplevel status has the following failures:
2014-07-01 21:46:27.003 0:0:0> MB/CMP0/CH3/R1/D0/S1 (J2301)
2014-07-01 21:46:27.136 0:0:0> MB/CMP0/CH3/R1/D0/S1 (J2401)
2014-07-01 21:46:27.154 0:0:0>END_ERROR
To have the system run in MAX POST, please perform one of these actions (depending on the SP in the system, ALOM or ILOM):
ILOM:
-> set /SYS keyswitch_state=diag
-> stop /SYS
-> start /SYS
-> start /SP/console
ALOM:
sc> setkeyswitch=diag
sc> poweroff
sc> poweron
sc> console -f
NOTE: Please remember to set the keyswitch back to normal mode after the logs have been collected or the system will run MAX POST every time you powercycle (which takes a much longer time than regular POST).
Solution
Open an Oracle Support SR to have the logs looked into and evaluated for possible DIMM replacement
References
<NOTE:1010565.1> - BOOT: Explanation of the "FATAL: system is not bootable, boot command is disabled" OBP message
<NOTE:1011227.1> - How to verify that the Boot devices can be seen for the Sun Fire[TM] T1000/T2000
Attachments
This solution has no attachment