![]() | Oracle System Handbook - ISO 7.0 May 2018 Internal/Partner Edition | ||
|
|
![]() |
||||||||||||||||
Solution Type Technical Instruction Sure Solution 1992960.1 : How to Replace an Exadata X5-2/X6-2 Storage Cell Disk Controller Super Capacitor Assembly
In this Document
Oracle Confidential PARTNER - Available to partners (SUN). Applies to:Exadata X5-2 Eighth Rack - Version All Versions and laterExadata X4-8 Hardware - Version All Versions and later Oracle SuperCluster M6-32 Hardware - Version All Versions and later Oracle SuperCluster T5-8 Hardware - Version All Versions and later Oracle SuperCluster T5-8 Full Rack - Version All Versions and later Information in this document applies to any platform. GoalHow to Replace an Exadata X5-2/X6-2 Storage Cell Disk Controller Super Capacitor Assembly SolutionDISPATCH INSTRUCTIONS
WHAT STATE SHOULD THE SYSTEM BE IN TO BE READY TO PERFORM THE RESOLUTION ACTIVITY?: SQL> select dg.name,a.value from v$asm_attribute a, v$asm_diskgroup dg where a.name = 'disk_repair_time' and a.group_number = dg.group_number;
As long as the value is large enough to comfortably replace the components being replaced, then there is no need to change it. # cellcli -e list griddisk attributes name,asmmodestatus,asmdeactivationoutcome
...sample ... CATALOG_CD_09_zdlx5_tvp_a_cel3 ONLINE Yes CATALOG_CD_10_zdlx5_tvp_a_cel3 ONLINE Yes CATALOG_CD_11_zdlx5_tvp_a_cel3 ONLINE Yes DELTA_CD_00_zdlx5_tvp_a_cel3 ONLINE Yes DELTA_CD_01_zdlx5_tvp_a_cel3 ONLINE Yes DELTA_CD_02_zdlx5_tvp_a_cel3 ONLINE Yes ...repeated for all griddisks.... If one or more disks return asmdeactivationoutcome='No', then wait for some time and repeat step #2. Once all disks return asmdeactivationoutcome='Yes', proceed to the next step. # cellcli
CellCLI> ALTER GRIDDISK ALL INACTIVE ...sample ... GridDisk CATALOG_CD_09_zdlx5_tvp_a_cel3 successfully altered GridDisk CATALOG_CD_10_zdlx5_tvp_a_cel3 successfully altered GridDisk CATALOG_CD_11_zdlx5_tvp_a_cel3 successfully altered GridDisk DELTA_CD_00_zdlx5_tvp_a_cel3 successfully altered GridDisk DELTA_CD_01_zdlx5_tvp_a_cel3 successfully altered GridDisk DELTA_CD_02_zdlx5_tvp_a_cel3 successfully altered ...repeated for all griddisks...
CellCLI> list griddisk attributes name,status,asmmodestatus,asmdeactivationoutcome
CATALOG_CD_09_zdlx5_tvp_a_cel3 inactive OFFLINE Yes CATALOG_CD_10_zdlx5_tvp_a_cel3 inactive OFFLINE Yes CATALOG_CD_11_zdlx5_tvp_a_cel3 inactive OFFLINE Yes DELTA_CD_00_zdlx5_tvp_a_cel3 inactive OFFLINE Yes DELTA_CD_01_zdlx5_tvp_a_cel3 inactive OFFLINE Yes DELTA_CD_02_zdlx5_tvp_a_cel3 inactive OFFLINE Yes ...repeated for all griddisks...
# /opt/MegaRAID/MegaCli/MegaCli64 -ldsetprop wt -lall -a0
# /opt/MegaRAID/MegaCli/MegaCli64 -ldpdinfo -a0 | grep BBU
# shutdown -hP now
# lsscsi | grep -i LSI [0:2:0:0] disk LSI MR9361-8i 4.23 /dev/sda [0:2:1:0] disk LSI MR9361-8i 4.23 /dev/sdb [0:2:2:0] disk LSI MR9361-8i 4.23 /dev/sdc [0:2:3:0] disk LSI MR9361-8i 4.23 /dev/sdd [0:2:4:0] disk LSI MR9361-8i 4.23 /dev/sde [0:2:5:0] disk LSI MR9361-8i 4.23 /dev/sdf [0:2:6:0] disk LSI MR9361-8i 4.23 /dev/sdg [0:2:7:0] disk LSI MR9361-8i 4.23 /dev/sdh [0:2:8:0] disk LSI MR9361-8i 4.23 /dev/sdi [0:2:9:0] disk LSI MR9361-8i 4.23 /dev/sdj [0:2:10:0] disk LSI MR9361-8i 4.23 /dev/sdk [0:2:11:0] disk LSI MR9361-8i 4.23 /dev/sdl If the device count is not correct check also that the LSI controller has the correct Virtual Drives configured and in Optimal state, physically Online and spun up, with no Foreign configuration. There should be Virtual Drives 0 to 11, and the physical slots 0 to 11 should be allocated to 1 each (not necessarily the same 0:0 1:1 etc. mapping). # /opt/MegaRAID/MegaCli/MegaCli64 -LdPdInfo -a0 | grep "Virtual Drive\|State\|Slot\|Firmware state"
Virtual Drive: 0 (Target Id: 0) State : Optimal Slot Number: 0 Firmware state: Online, Spun Up Foreign State: None Virtual Drive: 1 (Target Id: 1) State : Optimal Slot Number: 1 Firmware state: Online, Spun Up Foreign State: None Virtual Drive: 2 (Target Id: 2) State : Optimal Slot Number: 2 Firmware state: Online, Spun Up Foreign State: None Virtual Drive: 3 (Target Id: 3) State : Optimal Slot Number: 3 Firmware state: Online, Spun Up Foreign State: None Virtual Drive: 4 (Target Id: 4) State : Optimal Slot Number: 4 Firmware state: Online, Spun Up Foreign State: None Virtual Drive: 5 (Target Id: 5) State : Optimal Slot Number: 5 Firmware state: Online, Spun Up Foreign State: None Virtual Drive: 6 (Target Id: 6) State : Optimal Slot Number: 6 Firmware state: Online, Spun Up Foreign State: None Virtual Drive: 7 (Target Id: 7) State : Optimal Slot Number: 7 Firmware state: Online, Spun Up Foreign State: None Virtual Drive: 8 (Target Id: 8) State : Optimal Slot Number: 8 Firmware state: Online, Spun Up Foreign State: None Virtual Drive: 9 (Target Id: 9) State : Optimal Slot Number: 9 Firmware state: Online, Spun Up Foreign State: None Virtual Drive: 10 (Target Id: 10) State : Optimal Slot Number: 10 Firmware state: Online, Spun Up Foreign State: None Virtual Drive: 11 (Target Id: 11) State : Optimal Slot Number: 11 Firmware state: Online, Spun Up Foreign State: None Check the status of the SuperCap: # /opt/MegaRAID/MegaCli/MegaCli64 -AdpBbuCmd -a0
BBU status for Adapter: 0 BatteryType: CVPM02 Voltage: 9450 mV Current: 0 mA Temperature: 29 C Battery State: Optimal BBU Firmware Status: ...Output truncated... If this is not correct, then there is a problem with the disk volumes that may need additional assistance to correct. The server should be re-opened and the device connections and boards checked to be sure they are secure and well seated BEFORE the following CellCLI commands are issued. # /opt/MegaRAID/MegaCli/MegaCli64 -ldsetprop wb -lall -a0
# /opt/MegaRAID/MegaCli/MegaCli64 -ldpdinfo -a0 | grep BBU
Default Cache Policy: WriteBack, ReadAheadNone, Direct, No Write Cache if Bad BBU Current Cache Policy: WriteBack, ReadAheadNone, Direct, No Write Cache if Bad BBU Default Cache Policy: WriteBack, ReadAheadNone, Direct, No Write Cache if Bad BBU Current Cache Policy: WriteBack, ReadAheadNone, Direct, No Write Cache if Bad BBU Default Cache Policy: WriteBack, ReadAheadNone, Direct, No Write Cache if Bad BBU Current Cache Policy: WriteBack, ReadAheadNone, Direct, No Write Cache if Bad BBU Default Cache Policy: WriteBack, ReadAheadNone, Direct, No Write Cache if Bad BBU Current Cache Policy: WriteBack, ReadAheadNone, Direct, No Write Cache if Bad BBU Default Cache Policy: WriteBack, ReadAheadNone, Direct, No Write Cache if Bad BBU Current Cache Policy: WriteBack, ReadAheadNone, Direct, No Write Cache if Bad BBU Default Cache Policy: WriteBack, ReadAheadNone, Direct, No Write Cache if Bad BBU Current Cache Policy: WriteBack, ReadAheadNone, Direct, No Write Cache if Bad BBU Default Cache Policy: WriteBack, ReadAheadNone, Direct, No Write Cache if Bad BBU Current Cache Policy: WriteBack, ReadAheadNone, Direct, No Write Cache if Bad BBU Default Cache Policy: WriteBack, ReadAheadNone, Direct, No Write Cache if Bad BBU Current Cache Policy: WriteBack, ReadAheadNone, Direct, No Write Cache if Bad BBU Default Cache Policy: WriteBack, ReadAheadNone, Direct, No Write Cache if Bad BBU Current Cache Policy: WriteBack, ReadAheadNone, Direct, No Write Cache if Bad BBU Default Cache Policy: WriteBack, ReadAheadNone, Direct, No Write Cache if Bad BBU Current Cache Policy: WriteBack, ReadAheadNone, Direct, No Write Cache if Bad BBU Default Cache Policy: WriteBack, ReadAheadNone, Direct, No Write Cache if Bad BBU Current Cache Policy: WriteBack, ReadAheadNone, Direct, No Write Cache if Bad BBU Default Cache Policy: WriteBack, ReadAheadNone, Direct, No Write Cache if Bad BBU Current Cache Policy: WriteBack, ReadAheadNone, Direct, No Write Cache if Bad BBU 4. Verify also the InfiniBand links are up at 40Gbps if the cables were disconnected: # /usr/sbin/ibstatus
Infiniband device 'mlx4_0' port 1 status: default gid: fe80:0000:0000:0000:0010:e000:0159:c61d base lid: 0x9 sm lid: 0x2 state: 4: ACTIVE phys state: 5: LinkUp rate: 40 Gb/sec (4X QDR) link_layer: IB Infiniband device 'mlx4_0' port 2 status: default gid: fe80:0000:0000:0000:0010:e000:0159:c61e base lid: 0xa sm lid: 0x2 state: 4: ACTIVE phys state: 5: LinkUp rate: 40 Gb/sec (4X QDR) link_layer: IB
# cellcli
CellCLI> alter griddisk all active GridDisk CATALOG_CD_09_zdlx5_tvp_a_cel3 successfully altered GridDisk CATALOG_CD_10_zdlx5_tvp_a_cel3 successfully altered GridDisk CATALOG_CD_11_zdlx5_tvp_a_cel3 successfully altered GridDisk DELTA_CD_00_zdlx5_tvp_a_cel3 successfully altered GridDisk DELTA_CD_01_zdlx5_tvp_a_cel3 successfully altered GridDisk DELTA_CD_02_zdlx5_tvp_a_cel3 successfully altered ...repeated for all griddisks...
Issue the command below and all disks should show 'active': CellCLI> list griddisk CATALOG_CD_09_zdlx5_tvp_a_cel3 active CATALOG_CD_10_zdlx5_tvp_a_cel3 active CATALOG_CD_11_zdlx5_tvp_a_cel3 active DELTA_CD_00_zdlx5_tvp_a_cel3 active DELTA_CD_01_zdlx5_tvp_a_cel3 active DELTA_CD_02_zdlx5_tvp_a_cel3 active ...repeated for all griddisks...
CellCLI> list griddisk attributes name,status,asmmodestatus,asmdeactivationoutcome
CATALOG_CD_09_zdlx5_tvp_a_cel3 active SYNCING Yes CATALOG_CD_10_zdlx5_tvp_a_cel3 active SYNCING Yes CATALOG_CD_11_zdlx5_tvp_a_cel3 active SYNCING Yes DELTA_CD_00_zdlx5_tvp_a_cel3 active SYNCING Yes DELTA_CD_01_zdlx5_tvp_a_cel3 active SYNCING Yes DELTA_CD_02_zdlx5_tvp_a_cel3 active SYNCING Yes ...repeated for all griddisks...
Attachments This solution has no attachment |
||||||||||||||||
|