![]() | Oracle System Handbook - ISO 7.0 May 2018 Internal/Partner Edition | ||
|
|
![]() |
||||||||||||||||
Solution Type Technical Instruction Sure Solution 2230380.1 : How to Replace a Big Data Appliance (Original V1, X3-2 or X4-2) Faulty RAID HBA
In this Document
Oracle Confidential PARTNER - Available to partners (SUN). Applies to:Big Data Appliance X4-2 Hardware - Version All Versions and laterBig Data Appliance Hardware - Version All Versions and later Big Data Appliance X3-2 Hardware - Version All Versions and later Big Data Appliance X3-2 Full Rack - Version All Versions and later Big Data Appliance X3-2 Starter Rack - Version All Versions and later Information in this document applies to any platform. GoalHow to Replace a Big Data Appliance (Original V1, X3-2 or X4-2) Faulty RAID HBA SolutionDISPATCH INSTRUCTIONS: FIELD ENGINEER/ADMINISTRATOR INSTRUCTIONS: - WHAT STATE SHOULD THE SYSTEM BE IN TO BE READY TO PERFORM THE RESOLUTION ACTIVITY?: # /opt/MegaRAID/megacli/MegaCli64 -ldsetprop wt -lall -a0
Verify the current cache policy for all logical volumes is now WriteThrough : # /opt/MegaRAID/megacli/MegaCli64 -ldpdinfo -a0 | grep BBU
2. The Customer’s system administrator should shutdown the server node and BDA services following the shutdown instructions for Big Data Appliance detailed in MOS Note 2099858.1 - WHAT ACTION DOES THE FIELD ENGINEER/ADMINISTRATOR NEED TO TAKE: Physical RAID HBA replacement: Do not remove any cables prior to sliding the server forward, or the loose cable ends will jam in the cable management arms. Take care to ensure the cables and Cable Management Arm is moving properly. Refer to Note 1444683.1 for CMA handling training. BDA X3-2 and X4-2 Server Nodes: 5. Remove the RAID HBA's battery from the old RAID HBA OBTAIN CUSTOMER ACCEPTANCE 2. When the utility loads, there should only be 1 adapter. Select the "Start" button. 3. The foreign configuration screen is shown. Select "Configuration" from the drop down, and select the "Preview" button. 4. Verify the configuration looks correct on the "Virtual Drives" side and select the "Import" button if it is. The correct configuration should be 12 RAID0's, 1 per disk. 5. This will bring you back to the Logical View screen where the virtual drives should be listed out on the right side. Select the "Exit" link from the left side menu. This will bring you to the "Please Reboot" screen. Press "Ctrl-Alt-Del" to reboot the machine and boot the OS. 6. After the OS has booted, login to the OS with ‘root’ privilege. # /opt/oracle/bda/bin/bdaupdatefw
After the firmware updates, the server will reboot again. The disk volumes should remain intact and boot up to the OS again. 8. After the OS is up, login as root and validate the physical and logical volumes are seen properly from the new RAID HBA in the OS and that the battery is seen: # lsscsi | grep -i LSI
[0:0:20:0] enclosu LSILOGIC SASX28 A.1 502E - [0:2:0:0] disk LSI MR9261-8i 2.90 /dev/sda [0:2:1:0] disk LSI MR9261-8i 2.90 /dev/sdb [0:2:2:0] disk LSI MR9261-8i 2.90 /dev/sdc [0:2:3:0] disk LSI MR9261-8i 2.90 /dev/sdd [0:2:4:0] disk LSI MR9261-8i 2.90 /dev/sde [0:2:5:0] disk LSI MR9261-8i 2.90 /dev/sdf [0:2:6:0] disk LSI MR9261-8i 2.90 /dev/sdg [0:2:7:0] disk LSI MR9261-8i 2.90 /dev/sdh [0:2:8:0] disk LSI MR9261-8i 2.90 /dev/sdi [0:2:9:0] disk LSI MR9261-8i 2.90 /dev/sdj [0:2:10:0] disk LSI MR9261-8i 2.90 /dev/sdk [0:2:11:0] disk LSI MR9261-8i 2.90 /dev/sdl If the device count is not correct check also that the LSI controller has the correct Virtual Drives configured and in Optimal state, physically Online and spun up, with no Foreign configuration. There should be Virtual Drives 0 to 11, and the physical slots 0 to 11 should be allocated to 1 each (not necessarily the same 0:0 1:1 etc. mapping). # /opt/MegaRAID/MegaCli/MegaCli64 -LdPdInfo -a0 | grep "Virtual Drive\|State\|Slot\|Firmware state"
Virtual Drive: 0 (Target Id: 0) State : Optimal Slot Number: 0 Firmware state: Online, Spun Up Foreign State: None Virtual Drive: 1 (Target Id: 1) State : Optimal Slot Number: 1 Firmware state: Online, Spun Up Foreign State: None Virtual Drive: 2 (Target Id: 2) State : Optimal Slot Number: 2 Firmware state: Online, Spun Up Foreign State: None Virtual Drive: 3 (Target Id: 3) State : Optimal Slot Number: 3 Firmware state: Online, Spun Up Foreign State: None Virtual Drive: 4 (Target Id: 4) State : Optimal Slot Number: 4 Firmware state: Online, Spun Up Foreign State: None Virtual Drive: 5 (Target Id: 5) State : Optimal Slot Number: 5 Firmware state: Online, Spun Up Foreign State: None Virtual Drive: 6 (Target Id: 6) State : Optimal Slot Number: 6 Firmware state: Online, Spun Up Foreign State: None Virtual Drive: 7 (Target Id: 7) State : Optimal Slot Number: 7 Firmware state: Online, Spun Up Foreign State: None Virtual Drive: 8 (Target Id: 8) State : Optimal Slot Number: 8 Firmware state: Online, Spun Up Foreign State: None Virtual Drive: 9 (Target Id: 9) State : Optimal Slot Number: 9 Firmware state: Online, Spun Up Foreign State: None Virtual Drive: 10 (Target Id: 10) State : Optimal Slot Number: 10 Firmware state: Online, Spun Up Foreign State: None Virtual Drive: 11 (Target Id: 11) State : Optimal Slot Number: 11 Firmware state: Online, Spun Up Foreign State: None # /opt/MegaRAID/megacli/MegaCli64 -AdpBbuCmd -a0
BBU status for Adapter: 0 BatteryType: iBBU08 ...Output truncated... If this is not correct, then there is a problem with the disk volumes that may need additional assistance to correct. The server should be re-opened and the device connections and boards checked to be sure they are secure and well seated BEFORE the following commands are issued. 9. Set all logical drives cache policy to WriteBack cache mode: # /opt/MegaRAID/megamli/MegaCli64 -ldsetprop wb -lall -a0
Verify the current cache policy for all logical drives is now using WriteBack cache mode: # /opt/MegaRAID/megacli/MegaCli64 -ldpdinfo -a0 | grep BBU
10. On BDA (V1) systems based on Sun Fire X4270M2 server, verify also the InfiniBand links are up at 40Gbps as the cables were disconnected: # /usr/sbin/ibstatus
Infiniband device 'mlx4_0' port 1 status: default gid: fe80:0000:0000:0000:0021:2800:013e:70bb base lid: 0x50 sm lid: 0x1 state: 4: ACTIVE phys state: 5: LinkUp rate: 40 Gb/sec (4X QDR) Infiniband device 'mlx4_0' port 2 status: default gid: fe80:0000:0000:0000:0021:2800:013e:70bc base lid: 0x51 sm lid: 0x1 state: 4: ACTIVE phys state: 5: LinkUp rate: 40 Gb/sec (4X QDR) 11. Once the hardware is verified as up and running, the Customer's system administrator will need to verify the BDA services are up following the startup procedures for Big Data Appliance detailed in MOS Note 2099858.1 PARTS NOTE: References<NOTE:2099858.1> - Steps to Gracefully Shutdown and Power on a Single Node on Oracle Big Data Appliance Prior to MaintenanceLSI MegaRAID User's Guide - https://www.broadcom.com/support/oem/oracle/6gb/sg_x_sas6-r-int-z Attachments This solution has no attachment |
||||||||||||||||
|