![]() | Oracle System Handbook - ISO 7.0 May 2018 Internal/Partner Edition | ||
|
|
![]() |
||||||||||||||||
Solution Type Technical Instruction Sure Solution 2117397.1 : How to Replace an Exadata X4-8, X5-8, X6-8 Compute Node Internal RAID HBA Card
In this Document
Oracle Confidential PARTNER - Available to partners (SUN). Applies to:Exadata X5-8 Hardware - Version All Versions and laterExadata X4-8 Hardware - Version All Versions and later Exadata X6-8 Hardware - Version All Versions and later Information in this document applies to any platform. GoalHow to Replace an Exadata X4-8, X5-8, X6-8 Compute Node Internal RAID HBA Card. SolutionDISPATCH INSTRUCTIONS TIME ESTIMATE: 90 Minutes FIELD ENGINEER/ADMINISTRATOR INSTRUCTIONS: PROBLEM OVERVIEW: An Exadata X5-8/X6-8 Compute Node HBA ESM (SuperCap) or ESM Cable needs replacement WHAT STATE SHOULD THE SYSTEM BE IN TO BE READY TO PERFORM THE RESOLUTION ACTIVITY? : - The server that contains the faulty HBA should have its services offline and system powered off.
The instructions below assume the customer DBA is available and working with the field engineer onsite to manage the host OS and
titled "Non-Emergency Power Procedures" section "Powering Off Oracle Exadata Rack" sub-section "Powering off Database Servers" available on the customer's cell
If running OVM then go to section "For Compute Node running OVM" - for non-OVM proceed as follows: # . oraenv
ORACLE_SID = [root] ? +ASM1 The Oracle base for ORACLE_HOME=/u01/app/11.2.0/grid is /u01/app/oracle # $ORACLE_HOME/bin/crsctl disable crs # $ORACLE_HOME/bin/crsctl stop crs or # <GI_HOME>/bin/crsctl stop crs
# ps -ef | grep css
For Compute Node running OVM proceed as follows: If there are any concerns engage EEST engineer. The customer should perform the following: (a) See what user domains are running (record result ) Connect to the management domain (domain zero, or dom0). This is an example with just two domains and the management domain Domain-0 # xm list
Name ID Mem VCPUs State Time(s) Domain-0 0 8192 4 r----- 409812.7 dm01db01vm01 8 8192 2 -b---- 156610.6 dm01db01vm02 9 8192 2 -b---- 152169.8 connect to each domain using the command # xm console domainname
where domainname would be dm01db01vm01 or dm01db01vm02 if using the above examples. Shut down any instances of crs ,refer to the example above in previous section "shutdown crs" in all user domains Note: Omit the following command for OVM as it is not not required. # $ORACLE_HOME/bin/crsctl disable crs Press CTRL+] to disconnect from the console.
(b)Shutdown all user domains from dom0 # xm shutdown -a -w
(c) See what user domains are running (should be only Domain-0) # chkconfig xendomains off
3. Revert all the RAID disk volumes to WriteThrough mode to ensure all data in the RAID cache memory is flushed to disk and not lost # /opt/MegaRAID/MegaCli/MegaCli64 -ldsetprop wt -lall -a0
Verify the current cache policy for all logical volumes is now WriteThrough : # /opt/MegaRAID/MegaCli/MegaCli64 -ldpdinfo -a0 | grep BBU
4. The customer can now shutdown the server operating system: # shutdown -hP now
Step B. Physical RAID Card replacement Reference links for Service Manual: Sun Server X4-8 Service Manual Oracle Server X5-8 Service Manual The server requires one internal HBA for the externally-accessible SAS (or SATA) SMOD server storage drives. The HBA is installed in a riser slot on the underside of the SMOD motherboard and connected to the backplanes by two mini-SAS4I connector cables. The card is accessible by turning over the SMOD.
1. Prepare the server for cold service. 3. Label and disconnect all cables from the SMOD. A. To Unlock the SMOD handles, squeeze together the release latches on the end of both handles. The unlocking action is accompanied by an audible click. B. To disengage the SMOD from the server midplane, simultaneously, rotate both handles downward toward their fully open positions. This action disengages the connectors on the SMOD from the connectors on the server midplane. Caution - Physical harm or component damage. Do not use the SMOD handles to remove the SMOD from the server. 5. To remove the SMOD, grab it by it's sides and slide it out of the server. 6. Set the SMOD on a flat surface with the front side facing toward you. Caution - Component Damage - SMOD components are extremely sensitive to electro-static discharge. Wear a wrist strap and use an anti-static mat. 7. Close the SMOD handles. 8. Rotate the SMOD 180 degrees so the back (connector) side is facing toward you. 9. Turn the SMOD upside down and locate the HBA. The HBA is located inside the SMOD and is accessible through the upward-facing opening on the right side. Note - The connector (back) side of the SMOD should still be facing you. 10. To unlock the HBA card, rotate the green release handle out and then downward. The handle is on the right (vertical) side of the SMOD. To access it, you might need to push it outward from the inside. 11. To disconnect the HBA card from its connector on the SMOD motherboard, pull the card toward you. Two cables with mini SAS4I connectors connect the HBA to the storage drive backplane. Note - The two storage drive backplane cables are still attached to the card. 12. Carefully lift the card up, and pull it slightly away from the SMOD until the cable connectors are visible. 13. Disconnect the cables from the card. Note - Do not disconnect the cables from the SMOD. 14. Remove the HBA card from the SMOD and place it on an anti-static mat.
Install the new HBA PCI Card into the SMOD 1. Position the HBA card with the component side facing upward and the edge connector pointed away from you. 2. With the SMOD positioned upside down and the back (connector) side facing you, orient the HBA card in the opening on the right. 3. Ensure that the two backplane cables are routed under the support bracket on the bottom of the SMOD. 4. Connect the cables to the connectors on the HBA card. Note the cable on the right side should connect to the adapter connector closest to the HBA PCI slot connector. The cable on the left should connect to the connector closest to the top of the card or outside edge. 5. Lower the card and align its edge connector with the PCIe slot on the SMOD motherboard. 6. To install the HBA card, push it into the slot. 7. To lock the card, rotate the green locking lever upward. 8. Rotate the lever inward so it tucks inside the SMOD frame and does not extend outward. Install the SMOD into the Server A. To unlock the SMOD handles, squeeze together the release latches on the end of both handles. The unlocking action is accompanied by an audible click. B. To open, rotate both handles downward until they are at a 90 degree angle to the SMOD (fully-open position). Caution - Physical harm or component damage. Do not use the SMOD handles to install the SMOD into the server. 10. Orient the SMOD with the handles facing away from the server and the connectors facing toward the open slot in the server. The handles should be at the bottom front and the disks should be at the top front. 11. Align the SMOD in the slot. 12. Slide the SMOD into the slot until it stops. This leaves the SMOD protruding slightly from the back of the server. Do not attempt to push the SMOD inward beyond this point. 13. To install the SMOD, simultaneously rotate both handles upward until they lock into place. This action draws the SMOD inward engaging the SMOD connectors with the connectors on the server midplane. Caution - Pinch point. When operating the lever, keep your fingers clear of the back side and hinged end of the lever. 14. Connect the cables to the SMOD. Power on : 1. Re-attach the AC Power cords to the server. Server Services Startup Validation: DB Node Startup:
# /opt/oracle.SupportTools/CheckHWnFWProfile -U /opt/oracle.cellos/iso/cellbits
c) After the firmware updates, the server will reboot again. The disk volumes should remain intact and boot up to the OS again.
# /opt/MegaRAID/MegaCli/MegaCli64 -LdInfo -Lall -a0 Adapter 0 -- Virtual Drive Information:
BBU status for Adapter: 0 ...Output truncated... 3. Set all logical drives cache policy to WriteBack cache mode: # /opt/MegaRAID/MegaCli/MegaCli64 -ldsetprop wb -lall -a0
Verify the current cache policy for all logical drives is now using WriteBack cache mode: # /opt/MegaRAID/MegaCli/MegaCli64 -ldpdinfo -a0 | grep BBU
4. CRS services should now be started. "DB Node Startup Verification" - for compute node NOT running OVM ,for OVM refer to next section. Startup CRS and re-enable autostart of crs. After the OS is up, the Customer DBA should validate that CRS is running. As root execute: # . oraenv # $ORACLE_HOME/bin/crsctl start crs Now re-enable autostart # $ORACLE_HOME/bin/crsctl enable crs # <GI_HOME>/bin/crsctl enable crs where GI_HOME environment variable is typically set to “/u01/app/11.2.0/grid” but will depend on the customer's environment. # /u01/app/11.2.0/grid/bin/crsctl check crs
CRS-4638: Oracle High Availability Services is online CRS-4537: Cluster Ready Services is online CRS-4529: Cluster Synchronization Services is online CRS-4533: Event Manager is online 2. Validate that instances are running: # ps -ef |grep pmon
It should return a record for the ASM instance and a record for each database. For Compute Node running OVM If the customer requires assistance please ask them to contact EEST engineer or parent case owner. Once the compute node has booted ,re-enable user domains to autostart during Domain-0 boot. # chkconfig xendomains on
Startup all user domains that are marked for auto start # service xendomains start
See what user domains are running (compare against result from previously collected data) # xm list
if any not auto-started then Startup a single user domain # xm create -c /EXAVMIMAGES/GuestImages/DomainName/vm.cfg
Check that crs has started in user domains ,refer to previous section "DB Node Startup Verification" 4. Verify also the InfiniBand links are up at 40Gbps as the cables were disconnected: # /usr/sbin/ibstatus rate: 40 Gb/sec (4X QDR)
- Verify that HW Components and SW Components are returned to properly functioning state with server up and database services
1093890.1 Steps To Shutdown/Startup The Exadata & RDBMS Services and Cell/Compute Nodes On An Exadata Configuration. Service Manual's: Sun Server X4-8 Service Manual Oracle Server X5-8 Service Manual Attachments This solution has no attachment |
||||||||||||||||
|