![]() | Oracle System Handbook - ISO 7.0 May 2018 Internal/Partner Edition | ||
|
|
![]() |
||||||||||||||||
Solution Type Technical Instruction Sure Solution 1992981.1 : How to Replace an Exadata X5-2/X6-2 Storage Cell Infiniband Card
In this Document
Oracle Confidential PARTNER - Available to partners (SUN). Applies to:Exadata X5-2 Hardware - Version All Versions and laterExadata X5-2 Full Rack - Version All Versions and later Exadata X5-2 Eighth Rack - Version All Versions and later Exadata X5-2 Quarter Rack - Version All Versions and later Zero Data Loss Recovery Appliance X5 Hardware - Version All Versions and later Information in this document applies to any platform. GoalHow to Replace an Exadata X5-2/X6-2 Storage Cell Infiniband Card SolutionDISPATCH INSTRUCTIONS
2. For Extended information on this section check MOS Note: SQL> select dg.name,a.value from v$asm_attribute a, v$asm_diskgroup dg where a.name = 'disk_repair_time' and a.group_number = dg.group_number;
# cellcli -e list griddisk attributes name,asmmodestatus,asmdeactivationoutcome ...sample ... CATALOG_CD_09_zdlx5_tvp_a_cel3 ONLINE Yes CATALOG_CD_10_zdlx5_tvp_a_cel3 ONLINE Yes CATALOG_CD_11_zdlx5_tvp_a_cel3 ONLINE Yes DELTA_CD_00_zdlx5_tvp_a_cel3 ONLINE Yes DELTA_CD_01_zdlx5_tvp_a_cel3 ONLINE Yes DELTA_CD_02_zdlx5_tvp_a_cel3 ONLINE Yes ...repeated for all griddisks....
# cellcli
CellCLI> ALTER GRIDDISK ALL INACTIVE ...sample ... GridDisk CATALOG_CD_09_zdlx5_tvp_a_cel3 successfully altered GridDisk CATALOG_CD_10_zdlx5_tvp_a_cel3 successfully altered GridDisk CATALOG_CD_11_zdlx5_tvp_a_cel3 successfully altered GridDisk DELTA_CD_00_zdlx5_tvp_a_cel3 successfully altered GridDisk DELTA_CD_01_zdlx5_tvp_a_cel3 successfully altered GridDisk DELTA_CD_02_zdlx5_tvp_a_cel3 successfully altered ...repeated for all griddisks...
CellCLI> list griddisk attributes name,status,asmmodestatus,asmdeactivationoutcome
CATALOG_CD_09_zdlx5_tvp_a_cel3 inactive OFFLINE Yes CATALOG_CD_10_zdlx5_tvp_a_cel3 inactive OFFLINE Yes CATALOG_CD_11_zdlx5_tvp_a_cel3 inactive OFFLINE Yes DELTA_CD_00_zdlx5_tvp_a_cel3 inactive OFFLINE Yes DELTA_CD_01_zdlx5_tvp_a_cel3 inactive OFFLINE Yes DELTA_CD_02_zdlx5_tvp_a_cel3 inactive OFFLINE Yes ...repeated for all griddisks... 7. Once all disks are offline and inactive, the customer may shutdown the Cell using the following command: # shutdown -hP now
2. Swivel the air baffle into the upright position to allow access to PCIe cards 3. Remove the two infiniband cables from the infiniband card in PCIe slot 3, making note of their locations so they are plugged back into the same ports after replacement 4. Rotate the PCIe card locking mechanism, and then lift up on the PCIe card to disengage it from the motherboard connectors 3. Lower the air baffle to the installed position 4. Install the top cover 1. Once the power cords have been re-attached, slide the server back into the rack.
Step D. Server Services Startup Validation 1. After the OS is up, login as root
2. If the HCA is part of an infiniband network where IB partitions exist follow steps 3 and 4 or step 5 of DOC ID: 1985159.1 otherwise go to next step
3. Verify the InfiniBand links are up at 40Gbps as the cables were disconnected: # /usr/sbin/ibstatus
Infiniband device 'mlx4_0' port 1 status: default gid: fe80:0000:0000:0000:0010:e000:0159:c61d base lid: 0x9 sm lid: 0x2 state: 4: ACTIVE phys state: 5: LinkUp rate: 40 Gb/sec (4X QDR) link_layer: IB Infiniband device 'mlx4_0' port 2 status: default gid: fe80:0000:0000:0000:0010:e000:0159:c61e base lid: 0xa sm lid: 0x2 state: 4: ACTIVE phys state: 5: LinkUp rate: 40 Gb/sec (4X QDR) link_layer: IB
4. Run Verify Infiniband topology (example of fully-operational system): # /opt/oracle.SupportTools/ibdiagtools/verify-topology
[ DB Machine Infiniband Cabling Topology Verification Tool ] Every node is connected to two leaf switches in a single rack.......................................................[SUCCESS] Every inter-leaf switch link is connected correctly in a single rack................................................[SUCCESS] Every leaf switch in an interconnected quarter rack is correctly connected to other rack in a multi-rack group......[NOT APPLICABLE] Every leaf switch is connected to every spine switch in a multi-rack group..........................................[NOT APPLICABLE] Every rack has balanced inter-leaf-and-spine switch links in a multi-rack group.....................................[NOT APPLICABLE] No spine switch is connected to another spine switch in a multi-rack group..........................................[NOT APPLICABLE] Every spine switch is connected to two external spine switches in a multi-rack group................................[NOT APPLICABLE] No external spine switch is connected to a leaf switch in a multi-rack group........................................[NOT APPLICABLE] No external spine switch is connected to another external spine switch in a multi-rack group........................[NOT APPLICABLE]
# cellcli
CellCLI> alter griddisk all active GridDisk CATALOG_CD_09_zdlx5_tvp_a_cel3 successfully altered GridDisk CATALOG_CD_10_zdlx5_tvp_a_cel3 successfully altered GridDisk CATALOG_CD_11_zdlx5_tvp_a_cel3 successfully altered GridDisk DELTA_CD_00_zdlx5_tvp_a_cel3 successfully altered GridDisk DELTA_CD_01_zdlx5_tvp_a_cel3 successfully altered GridDisk DELTA_CD_02_zdlx5_tvp_a_cel3 successfully altered ...repeated for all griddisks...
CellCLI> list griddisk
CATALOG_CD_09_zdlx5_tvp_a_cel3 active CATALOG_CD_10_zdlx5_tvp_a_cel3 active CATALOG_CD_11_zdlx5_tvp_a_cel3 active DELTA_CD_00_zdlx5_tvp_a_cel3 active DELTA_CD_01_zdlx5_tvp_a_cel3 active DELTA_CD_02_zdlx5_tvp_a_cel3 active ...repeated for all griddisks...
CellCLI> list griddisk attributes name,status,asmmodestatus,asmdeactivationoutcome
CATALOG_CD_09_zdlx5_tvp_a_cel3 active SYNCING Yes CATALOG_CD_10_zdlx5_tvp_a_cel3 active SYNCING Yes CATALOG_CD_11_zdlx5_tvp_a_cel3 active SYNCING Yes DELTA_CD_00_zdlx5_tvp_a_cel3 active SYNCING Yes DELTA_CD_01_zdlx5_tvp_a_cel3 active SYNCING Yes DELTA_CD_02_zdlx5_tvp_a_cel3 active SYNCING Yes ...repeated for all griddisks...
1985159.1 Updating IB partitions after replacing an Infiniband HCA in any nodes within IB network - steps to do after replacing HCA Attachments This solution has no attachment |
||||||||||||||||
|