Asset ID: |
1-72-2053702.1 |
Update Date: | 2017-09-01 |
Keywords: | |
Solution Type
Problem Resolution Sure
Solution
2053702.1
:
Fabric Interconnect (Formerly Xsigo) reports IO Link Is Down
Related Items |
- Oracle Fabric Interconnect F1-15
- Oracle Fabric Interconnect F1-4
|
Related Categories |
- PLA-Support>Sun Systems>SAND>Network>SN-SND: Oracle Virtual Networking
|
In this Document
Created from <SR 3-11230367361>
Applies to:
Oracle Fabric Interconnect F1-15 - Version All Versions to All Versions [Release All Releases]
Oracle Fabric Interconnect F1-4 - Version All Versions to All Versions [Release All Releases]
Information in this document applies to any platform.
Symptoms
XMS is showing IO Link is down for one of the 10G Ethernet cards. ESX hosts alarms report one of the two redundant vnics has dropped.
Cause
OpenSM dropped port for IO Card 1:
Aug 20 13:19:36 298154 [B661AB90] 0x02 -> drop_mgr_remove_port: Removed port with GUID:0x0013970301001bfc LID range [5, 5] of node:Xsigo Systems, xsigo2 slot=1 vn10gcard
OpenSM notes IO Card 1 port is back up after resetting the IO Card:
Aug 20 15:05:57 458195 [B661AB90] 0x02 -> state_mgr_report_new_ports: Discovered new port with GUID:0x0013970301001bfc LID range [5,5] of node: Xsigo Systems, xsigo2 slot=1 vn10gcard
Here is time stamp of when manually set this IO Card down - notice it correlates to the time stamp of when OpenSM reported it discovered this new port:
Aug 20 14:59:19 fpp chassisCtr[469]: [NOTICE] chassisctr fpp-1 [chassis::iocardadminstatechange] [IOCARD=1] (changeAdminState) New AdminState=adminStateDown, OperState=operStateUp, Qualifier=default
Aug 20 14:59:19 fpp chassisCtr[469]: [ALERT] chassisctr fpp-1 [chassis::iocardoff] [IOCARD=1] Powered off.
Aug 20 14:59:19 fpp chassisCtr[469]: [NOTICE] chassisctr fpp-1 [chassis::cardstatechange] (reportState) [IOCARD=1] Operational state changed. OldState=operStateUp, NewState=operStateDown, Qualifier=default
Aug 20 14:59:19 fpp chassisCtr[469]: [NOTICE] chassisctr fpp-1 [IOCARD=1] Power Down
In this case the cause of this is due to the bug noted below. The fix checked into 4.x XgOS is to *reset* the card whenever ibport logical state is !=ACTIVE and the ibport logical state is down by OpenSM. The OpenSM logs do note it removed this port on Aug 20 @ 13:19:36 which is the signature of this bug
The fix for this issue is in 4.x XgOS.
16336525 XT/XT2 Driver Reset the Card Whenever ibport logical state != ACTIVE and the ibport logical state Is Down By opensm.
OpenSM dropped port for IO Card 1:
Aug 20 13:19:36 298154 [B661AB90] 0x02 -> drop_mgr_remove_port: Removed port with GUID:0x0013970301001bfc LID range [5, 5] of node:Xsigo Systems, xsigo2 slot=1 vn10gcard
Solution
Upgrade to 4.0.4 XgOS which is current GA XgOS. Bug fix was initially released in 4.0.0 XgOS and subsequently all following 4.x XgOS releases including 4.0.4 XgOS.
16336525 XT/XT2 Driver Reset the Card Whenever ibport logical state != ACTIVE and the ibport logical state Is Down By opensm.
References
<BUG:16336525> - [21394]XT/XT2 DRIVER RESET THE CARD WHENEVER IBPORT LOGICAL STATE ! = ACTIVE AND
<NOTE:1517366.1> - How to Create/Upload Diagnostic Log File Bundle for an Oracle Fabric Interconnect
<NOTE:1578001.1> - IO Card Failed/Rebooted With “XT_EVENT_HW_HANG with EVENT_SEV_MAX” Error
Attachments
This solution has no attachment