![]() | Oracle System Handbook - ISO 7.0 May 2018 Internal/Partner Edition | ||
|
|
![]() |
||||||||||||||||
Solution Type Technical Instruction Sure Solution 1969327.1 : How to Replace an Exadata X5-2/X6-2 Compute Node Internal HBA Cable Assembly
In this Document
Oracle Confidential PARTNER - Available to partners (SUN). Applies to:Exadata X5-2 Quarter Rack - Version All Versions and laterExadata X5-2 Hardware - Version All Versions and later Exadata X5-2 Full Rack - Version All Versions and later Exadata X5-2 Half Rack - Version All Versions and later Zero Data Loss Recovery Appliance X5 Hardware - Version All Versions and later Information in this document applies to any platform. GoalHow to Replace a Faulty internal HBA cable assembly on Exadata successfully in Exadata X5-2/X6-2 Compute Node Solution DISPATCH INSTRUCTIONS
TASK COMPLEXITY: 3
The server that contains the faulty cable assembly should have its services offline and system powered off.
The instructions below assume the customer DBA is available and working with the field engineer onsite to manage the host OS and
i. As root user do the following to stop crs and disable autostart of crs on reboot: # . oraenv
ORACLE_SID = [root] ? +ASM1 The Oracle base for ORACLE_HOME=/u01/app/11.2.0/grid is /u01/app/oracle # $ORACLE_HOME/bin/crsctl disable crs # $ORACLE_HOME/bin/crsctl stop crs or # <GI_HOME>/bin/crsctl stop crs where GI_HOME environment variable is typically set to “/u01/app/11.2.0/grid” but will depend on the customer's environment.
# ps -ef | grep css
For Compute Node running OVM proceed as follows:
# xm list
Name ID Mem VCPUs State Time(s) Domain-0 0 8192 4 r----- 409812.7 dm01db01vm01 8 8192 2 -b---- 156610.6 dm01db01vm02 9 8192 2 -b---- 152169.8 connect to each domain using the command # xm console domainname
where domainname would be dm01db01vm01 or dm01db01vm02 if using the above examples. Note: Omit the following command for OVM as it is not not required.
# $ORACLE_HOME/bin/crsctl disable crs Press CTRL+] to disconnect from the console.
# xm shutdown -a -w
(c) See what user domains are running (should be only Domain-0) # chkconfig xendomains off
3. Revert all the RAID disk volumes to WriteThrough mode to ensure all data in the RAID cache memory is flushed to disk and not lost # /opt/MegaRAID/MegaCli/MegaCli64 -ldsetprop wt -lall -a0
Verify the current cache policy for all logical volumes is now WriteThrough : # /opt/MegaRAID/MegaCli/MegaCli64 -ldpdinfo -a0 | grep BBU
4. The customer can now shutdown the server operating system: # shutdown -hP now
Reference links for Service Manual: 5. The field engineer can now slide out the server for maintenance. Do not remove any cables prior to sliding the server forward, or the (a) Remove all of the server fan modules. (b) Remove the IB cables from the IB card in slot 3 above the HBA making a note of which port each cable goes into so (c) Remove the PCIe riser from PCIe slots 3 and 4. (d) Open the green-tabbed latch located on the rear of the server chassis next to (e) To release the riser from the motherboard connector, lift the green-tabbed release (f) Slide the plastic PCIe card retainer, which is mounted on the side of the chassis, (g) Grasp the riser with both hands and remove it from the server. (h) Disconnect the SAS storage drive (HDD) cables from the internal HBA card (i) Disconnect the super capacitor cable from the internal HBA card in slot 4 (j) Disconnect each SAS cables from the disk backplane, press the latch on the (k) Carefully remove the SAS cables from the server.
Install the Internal HBA SAS Cable assembly (a) Carefully guide SAS cables and the super capacitor cable along the side of the chassis. (b) Connect the SAS cables and the super capacitor cable to the internal HBA card. (c) Install the SAS cables into the disk backplane (d) Install the PCIe riser with the internal HBA and IB HCA card into PCIe slot 3 (e) Install all of the server fan modules. Take care to get the cables re-connected to the same ports they were removed from. If reversed,
this may affect disk slot mappings. Take care to also put the IB cables back into the original ports, as well, in the correct orientation. IB cables are factory labeled with the port identification where port 2 is the port nearest the PCI connector, and port 1 is the port near the top side of the card. The cables should be inserted with the latch release tab on the down side, so they fully seat and latch. If inserted upside down, they will not fully seat or latch. Server Services Startup Validation: DB Node Startup: After the OS is up, login as root and validate the physical and logical volumes are seen properly from the new RAID HBA in the OS, # /opt/MegaRAID/MegaCli/MegaCli64 -LdInfo -Lall -a0 Adapter 0 -- Virtual Drive Information: # /opt/MegaRAID/MegaCli/MegaCli64 -PdList -a0 | grep "Slot\|Firmware\|Inq" Slot Number: 0 # /opt/MegaRAID/MegaCli/MegaCli64 -AdpBbuCmd -a0 BBU status for Adapter: 0 truncated..... Set all logical drives cache policy to WriteBack cache mode: # /opt/MegaRAID/MegaCli/MegaCli64 -ldsetprop wb -lall -a0
Verify the current cache policy for all logical drives is now using WriteBack cache mode: # /opt/MegaRAID/MegaCli/MegaCli64 -ldpdinfo -a0 | grep BBU
CRS services should now be started.
Startup CRS and re-enable autostart of crs. After the OS is up, the Customer DBA should validate that CRS is running. As root execute: # . oraenv
# $ORACLE_HOME/bin/crsctl enable crs where GI_HOME environment variable is typically set to “/u01/app/11.2.0/grid” but will depend on the customer's environment. # /u01/app/11.2.0/grid/bin/crsctl check crs CRS-4638: Oracle High Availability Services is online Validate that instances are running: # ps -ef |grep pmon
It should return a record for the ASM instance and a record for each database.
If the customer requires assistance please ask them to contact EEST engineer or parent case owner. Once the compute node has booted ,re-enable user domains to autostart during Domain-0 boot. # chkconfig xendomains on
Startup all user domains that are marked for auto start # service xendomains start
See what user domains are running (compare against result from previously collected data) # xm list
if any not auto-started then Startup a single user domain # xm create -c /EXAVMIMAGES/GuestImages/DomainName/vm.cfg
Check that crs has started in user domains ,refer to previous section "DB Node Startup Verification" Verify also the InfiniBand links are up at 40Gbps as the cables were disconnected: # /usr/sbin/ibstatus Infiniband device 'mlx4_0' port 1 status: Infiniband device 'mlx4_0' port 2 status:
OBTAIN CUSTOMER ACCEPTANCE Verify that HW Components and SW Components are returned to properly functioning state with server up and database services
7094264 - Cable kit ,includes SAS cable 7076125
1093890.1 Steps To Shutdown/Startup The Exadata & RDBMS Services and Cell/Compute Nodes On An Exadata Configuration. Attachments This solution has no attachment |
||||||||||||||||
|