Sun Microsystems, Inc.  Oracle System Handbook - ISO 7.0 May 2018 Internal/Partner Edition
   Home | Current Systems | Former STK Products | EOL Systems | Components | General Info | Search | Feedback

Asset ID: 1-72-1021453.1
Update Date:2014-12-16
Keywords:

Solution Type  Problem Resolution Sure

Solution  1021453.1 :   Sun Storage 3510 Array: Running probe-scsi-all May Cause Multiple Hosts to Hang  


Related Items
  • Sun Storage 3510 FC Array
  •  
Related Categories
  • PLA-Support>Sun Systems>DISK>Arrays>SN-DK: SE31xx_33xx_35xx
  •  

PreviouslyPublishedAs
271929


Applies to:

Sun Storage 3510 FC Array - Version Not Applicable and later
All Platforms

Symptoms

At the OBP prompt, the probe-scsi-all command may cause hosts to hang when there are multiple hosts connected to the same fibre channel loop.

Cause

When we configure hosts to boot from the Sun Storage 3510 FC array that is configured on the same fibre channel loop shared between those hosts, if a probe-scsi-all command is issued at the OBP prompt on one host, other hosts might not be able to boot from OBP. If other hosts are running, it might cause those hosts to hang with 'Loop OFFLINE' messages. The following is the message from a running host when issuing probe-scsi-all on the other server:

Sep 23 17:25:59 xxxxxx lw8: [ID 555425 kern.notice] Main, up 53 days
00:51:16, Memory 8,755,544
Sep 23 21:25:59 xxxxxx lw8: [ID 944324 kern.notice] Main, up 53 days
04:51:16, Memory 8,591,008
Sep 24 00:57:59 xxxxxx qlc: [ID 630585 kern.info] NOTICE: Qlogic
qlc(2): Loop OFFLINE
Sep 24 05:35:38 xxxxxx genunix: [ID 540533 kern.notice] ^MSunOS
Release 5.10 Version Generic_118833-24 64-bit
Sep 24 05:35:38 xxxxxx genunix: [ID 172907 kern.notice] Copyright
1983-2006 Sun Microsystems, Inc. All rights reserved.

This was tested in the following environment.

V480 x2 + SE3510
V480-a:PCI#0 ---- SE3510:Controller#A CH0
V480-a:PCI#1 ---- SE3510:Controller#B CH1
V480-b:PCI#0 ---- SE3510:Controller#A CH1
V480-b:PCI#1 ---- SE3510:Controller#B CH0
- HBA: P/N(375-3363), FW(1.17), Jumper(1-2)
- 3510 Array:Controller P/N(371-0532), FW(4.21F)


1. Test 1 (reproduced)

(1) v480a power on
(2) v480b power on
(3) v480a probe-scsi-all
(4) v480b probe-scsi-all
(5) v480a boot <-- boot fail
(6) v480b boot <-- boot fail

2. Test 2 (reproduced, though changed v480a v480b boot order)

(1) v480b power on
(2) v480a power on
(3) v480b probe-scsi-all
(4) v480a probe-scsi-all
(5) v480b boot <-- boot fail
(6) v480a boot <-- boot fail

3. Test 3 (not reproduced if one node booted earlier)

(1) v480a power on
(2) v480a probe-scsi-all
(3) v480a boot
(4) v480b power on
(5) v480b probe-scsi-all
(6) v480b boot

4. Test 4 (not reproduced if issue reset-all after probe-scsi)

(1) v480a power on
(2) v480b power on
(3) v480a probe-scsi-all
(4) v480a reset-all
(5) v480b probe-scsi-all
(6) v480b reset-all
(7) v480a boot
(8) v480b boot


--------------------
Test log excerpt
--------------------
- power on v480a and v480b
- doing probe-scsi-all on v480a
===================
{2} ok probe-scsi-all
/pci@9,600000/SUNW,qlc@2
LiD HA LUN --- Port WWN --- ----- Disk description -----
0 0 0 2100000c50ac5725 SEAGATE ST336605FSUN36G 0638
1 1 0 2100000c50696a0c SEAGATE ST336607FSUN36G 0307
/pci@8,600000/SUNW,qlc@2,1
QLA2342 Host Adapter fcode version 1.17 08/09/07
Firmware version 3.03.23
Link not ready - Loss of Sync
/pci@8,600000/SUNW,qlc@2
QLA2342 Host Adapter fcode version 1.17 08/09/07
Firmware version 3.03.23
*************************** Local Loop Devices ***************************
Adapter LoopId - 0
Device AL-PA a5 loopId 2a Hard-addr 2a Port WWN 226000c0ffa01f53
Lun 0 SUN StorEdge 3510 413C
Device AL-PA a3 loopId 2b Hard-addr 2b Port WWN 226000c0ffb01f53
Lun 0 SUN StorEdge 3510 413C
/pci@8,600000/SUNW,qlc@1,1
QLA2342 Host Adapter fcode version 1.17 08/09/07
Firmware version 3.03.23
Link not ready - Loss of Sync
/pci@8,600000/SUNW,qlc@1
QLA2342 Host Adapter fcode version 1.17 08/09/07
Firmware version 3.03.23
*************************** Local Loop Devices ***************************
Adapter LoopId - 0
Device AL-PA a7 loopId 28 Hard-addr 28 Port WWN 216000c0ff801f53
Lun 0 SUN StorEdge 3510 413C
Device AL-PA a6 loopId 29 Hard-addr 29 Port WWN 216000c0ff901f53
Lun 0 SUN StorEdge 3510 413C
===================
Issue probe-scsi-all on v480b.
===================
{2} ok probe-scsi-all
/pci@9,600000/SUNW,qlc@2
LiD HA LUN --- Port WWN --- ----- Disk description -----
0 0 0 21000004cf98a7bd SEAGATE ST373405FSUN72G 0638
1 1 0 21000004cf989c39 SEAGATE ST373405FSUN72G 0638
/pci@8,600000/SUNW,qlc@2,1
QLA2342 Host Adapter fcode version 1.17 08/09/07
Firmware version 3.03.23
Link not ready - Loss of Sync
/pci@8,600000/SUNW,qlc@2
QLA2342 Host Adapter fcode version 1.17 08/09/07
Firmware version 3.03.23
Link not ready - Waiting for AL_PA
/pci@8,600000/SUNW,qlc@1,1
QLA2342 Host Adapter fcode version 1.17 08/09/07
Firmware version 3.03.23
Link not ready - Loss of Sync
/pci@8,600000/SUNW,qlc@1
QLA2342 Host Adapter fcode version 1.17 08/09/07
Firmware version 3.03.23
Link not ready - Waiting for AL_PA
===================

 

Solution

The following steps are the suggested workaround.

If all systems on the same fibre channel loop are at the OBP prompt, issue a 'reset-all' after performing the 'probe-scsi-all' command.

If any systems on the shared fibre channel loop are up and running, do not issue a 'probe-scsi-all' without first shutting down those machines.

 

Do you still have questions?  You can use My Oracle Support Communities.  Communities put you in touch with industry professionals like yourself.  They are monitored by Oracle support engineers, so you can expect reliable and correct answers.  Ask questions and see what others are asking about in the Disk Storage 2000, 3000, 6000 RAID Arrays & JBODs Community.

 


Attachments
This solution has no attachment
  Copyright © 2018 Oracle, Inc.  All rights reserved.
 Feedback