Asset ID: |
1-72-1676140.1 |
Update Date: | 2017-07-12 |
Keywords: | |
Solution Type
Problem Resolution Sure
Solution
1676140.1
:
T5240: After Hang, Unable To Boot OS.
Related Items |
- Sun SPARC Enterprise T5240 Server
|
Related Categories |
- PLA-Support>Sun Systems>SPARC>CMT>SN-SPARC: T5xx0
|
In this Document
Created from <SR 3-9041573531>
Applies to:
Sun SPARC Enterprise T5240 Server - Version All Versions and later
Information in this document applies to any platform.
Symptoms
The system was in hang and customer had to shutdown by the power bottom. After this scenario, the system could NOT boot up from the OS, customer sent an snapshot from the ILOM to identify the issue in the system.
Cause
After the reboot followed by root, customer said that the system was in hang. He said that there was a hardware issue, however, nothing showed from the POST
ODM research:
BCM5466R consists of four complete 10/100/1000BASE-T Gigabit Ethernet transceivers integrated on a single monolithic CMOS chip.
Failed Gigabit interfaces onboard.
0:0:0>NEPTUNE Network Interface Unit Tests....Done
0:0:0>
0:0:0>ERROR: TEST = BCM5466R PHY Block Port 0 init test
0:0:0>H/W under test = MB/PCIE-SWITCH1/GBE,MB/PHY0
0:0:0>Repair Instructions: Replace items in order listed by 'H/W under test' above.
0:0:0>MSG = BCM5466 - PHY Error - Cntl Reg,
address 00000000.00000000
expected 00000000.00001140
observed 00000000.00001000
0:0:0>END_ERROR
May 8 14:37:47 netbkup Corrupt label; wrong magic number
May 8 14:37:47 netbkup scsi: WARNING: /pci@500/pci@0/pci@d/SUNW,qlc@0,1/fp@0,0/ssd@w50000972082bd155,5 (ssd865):
May 8 14:37:47 netbkup Corrupt label; wrong magic number
May 8 14:38:00 netbkup nxge: NOTICE: nxge1: xcvr addr:0x1c - link is down
May 8 14:38:02 netbkup nxge: NOTICE: nxge2: xcvr addr:0x1b - link is down
May 8 14:38:04 netbkup nxge: NOTICE: nxge1: xcvr addr:0x1c - link is down
May 8 14:38:04 netbkup nxge: NOTICE: nxge2: xcvr addr:0x1b - link is down
May 8 14:38:05 netbkup nxge: NOTICE: nxge3: xcvr addr:0x1a - link is down
May 8 14:38:07 netbkup scsi: WARNING: /pci@500/pci@0/pci@d/SUNW,qlc@0/fp@0,0/ssd@w5006048ad52fbdbc,294 (ssd6):
May 8 14:38:07 netbkup Corrupt label; wrong magic number
May 8 14:38:09 netbkup nxge: NOTICE: nxge1: xcvr addr:0x1c - link is down
May 8 14:38:11 netbkup nxge: NOTICE: nxge2: xcvr addr:0x1b - link is down
May 8 14:38:13 netbkup nxge: NOTICE: nxge1: xcvr addr:0x1c - link is down
May 8 14:38:13 netbkup nxge: NOTICE: nxge2: xcvr addr:0x1b - link is down
May 8 14:38:14 netbkup nxge: NOTICE: nxge3: xcvr addr:0x1a - link is down
May 8 14:38:17 netbkup scsi: WARNING: /pci@500/pci@0/pci@d/SUNW,qlc@0,1/fp@0,0/ssd@w50000972082bd155,4 (ssd866):
May 8 14:38:17 netbkup Corrupt label; wrong magic number
May 8 14:38:20 netbkup reboot: rebooted by root
May 8 14:38:32 netbkup avrd[2140]: Daemon has terminated due to signal (15)
May 8 14:38:32 netbkup vmd[2095]: volume daemon terminating because it received a signal (15)
May 8 14:38:32 netbkup acsd[2117]: Daemon has terminated due to signal (15)
May 8 14:38:32 netbkup syslogd: going down on signal 15
[...]
WARNING: Power-off requested, system will now shutdown.
0:0:0>
0:0:0>SPARC(R) Enterprise T5140/T5240 POST 4.29.0.a 2008/09/15 12:29
/export/delivery/delivery/4.29/4.29.0.a/post4.29.0-micro/Niagara/maramba/integrated (root)
0:0:0>Copyright 2008 Sun Microsystems, Inc. All rights reserved
0:0:0>POST enabling CMP 0 threads: ffffffff.ffffffff
0:0:0>POST enabling CMP 1 threads: ffffffff.ffffffff
0:0:0>VBSC mode is: 00000000.00000001
0:0:0>VBSC level is: 00000000.00000001
0:0:0>VBSC selecting Normal mode, MAX Testing.
0:0:0>VBSC setting verbosity level 2
1:0:0>NODE 1 present
0:0:0>Test Memory....Done
0:0:0>Setup POST Mailbox ....Done
0:0:0>Master CPU Tests Basic....Done
0:0:0>Init MMU.....
0:0:0>L2 Tests....Done
0:0:0>Extended CPU Tests....Done
0:0:0>Scrub Memory....Done
0:0:0>Functional CPU Tests....Done
0:0:0>Extended Memory Tests....Done
0:0:0>SPU CWQ Tests...Done
0:0:0>MAU Tests...Done
0:0:0>NCU Setup and PIU link train....Done
0:0:0>NEPTUNE Network Interface Unit Tests....Done
0:0:0>
0:0:0>ERROR: TEST = BCM5466R PHY Block Port 0 init test
0:0:0>H/W under test = MB/PCIE-SWITCH1/GBE,MB/PHY0
0:0:0>Repair Instructions: Replace items in order listed by 'H/W under test' above.
0:0:0>MSG = BCM5466 - PHY Error - Cntl Reg,
address 00000000.00000000
expected 00000000.00001140
observed 00000000.00001000
0:0:0>END_ERROR
0:0:0>
0:0:0>ERROR: TEST = BCM5466R PHY Block Port 0 init test
0:0:0>H/W under test = MB/PCIE-SWITCH1/GBE,MB/PHY0
0:0:0>Repair Instructions: Replace items in order listed by 'H/W under test' above.
0:0:0>MSG = BCM5466 - PHY Error - Cntl Reg,
address 00000000.00000001
expected 00000000.00007949
observed 00000000.0000794d
0:0:0>END_ERROR
================================ IO Devices ================================
Slot + Bus Name + Model
Status Type Path
----------------------------------------------------------------------------
MB/SASHBA PCIE scsi-pciex1000,58 LSI,1068E
/pci@400/pci@0/pci@8/scsi@0
PCIE3 PCIX SUNW,emlxs-pci10df,fc20 LPe11000-S
/pci@400/pci@0/pci@d/SUNW,emlxs@0
MB/NET0 PCIE network-pciex108e,abcd SUNW,pcie-neptune
/pci@500/pci@0/pci@8/network@0
MB/NET1 PCIE network-pciex108e,abcd SUNW,pcie-neptune
/pci@500/pci@0/pci@8/network@0,1
MB/NET2 PCIE network-pciex108e,abcd SUNW,pcie-neptune
/pci@500/pci@0/pci@8/network@0,2
MB/NET3 PCIE network-pciex108e,abcd SUNW,pcie-neptune
/pci@500/pci@0/pci@8/network@0,3
MB/USB0 PCIE usb-pciclass,0c0310
/pci@400/pci@0/pci@1/pci@0/usb@0
MB/USB0 PCIE usb-pciclass,0c0310
/pci@400/pci@0/pci@1/pci@0/usb@0,1
MB/USB0 PCIE usb-pciclass,0c0320
/pci@400/pci@0/pci@1/pci@0/usb@0,2
Solution
The issue was in the motherboard. TSC ordered the motherboard replacement.
References
<NOTE:1356876.1> - Warning:1540: Firmware Update Required. (A Manual Hba Reset Or Link Reset (Using Luxadm Or Fcadm) Is Required
<NOTE:1489871.1> - Solaris Volume Manager (SVM) Mirrored Root Disk Server/System/Node Can Not Boot. Troubleshooting. Resolution Path
Attachments
This solution has no attachment