Sun Microsystems, Inc.  Oracle System Handbook - ISO 7.0 May 2018 Internal/Partner Edition
   Home | Current Systems | Former STK Products | EOL Systems | Components | General Info | Search | Feedback

Asset ID: 1-71-2229059.1
Update Date:2017-02-06
Keywords:

Solution Type  Technical Instruction Sure

Solution  2229059.1 :   How to Replace a Big Data Appliance X5-2 or X6-2 Faulty SAS Cable Assembly  


Related Items
  • Big Data Appliance X5-2 Starter Rack
  •  
  • Big Data Appliance X5-2 Full Rack
  •  
  • Big Data Appliance X5-2 Hardware
  •  
  • Big Data Appliance X6-2 Hardware
  •  
Related Categories
  • PLA-Support>Sun Systems>Sun_Other>Sun Collections>SN-OTH: x64-CAP VCAP
  •  




In this Document
Goal
Solution
References


Oracle Confidential PARTNER - Available to partners (SUN).
Reason: FRU CAP

Applies to:

Big Data Appliance X5-2 Full Rack - Version All Versions and later
Big Data Appliance X5-2 Starter Rack - Version All Versions and later
Big Data Appliance X5-2 Hardware - Version All Versions and later
Big Data Appliance X6-2 Hardware - Version All Versions and later
Information in this document applies to any platform.

Goal

 How to Replace a Faulty SAS Cable Assembly on BDA X5-2 or X6-2

Solution

DISPATCH INSTRUCTIONS:
- WHAT SKILLS DOES THE FIELD ENGINEER/ADMINISTRATOR NEED: BDA Trained
- TIME ESTIMATE: 60 Minutes
- TASK COMPLEXITY: 3

FIELD ENGINEER/ADMINISTRATOR INSTRUCTIONS:

PROBLEM OVERVIEW: A faulty internal HBA SAS cable assembly in BDA X5-2 or X6-2 Server Node has been diagnosed as needing replacement

WHAT STATE SHOULD THE SYSTEM BE IN TO BE READY TO PERFORM THE RESOLUTION ACTIVITY?:

The instructions below assume the Customer system administrator is available and working with the field engineer onsite to manage the host OS and BDA services.
They are provided here to allow the FE to have all the available steps needed when onsite, and can be done by the FE if the customer system administrator wants or allows or needs help with these steps.
The server that contains the faulty HBA SAS cable should have its services offline and system powered off.
The Customer’s system administrator should shutdown the server node and BDA services following the shutdown instructions for Big Data Appliance detailed in MOS Note 2099858.1

WHAT ACTION DOES THE FIELD ENGINEER/ADMINISTRATOR NEED TO TAKE?:

Physical SAS Cable assembly replacement procedure:

Reference links for SAS Cables in the Service Manual:
X5-2L : ( https://docs.oracle.com/cd/E41033_01/html/E48325/cnpsm.baecfjbc.html#scrolltoc )
X6-2L : ( https://docs.oracle.com/cd/E62172_01/html/E62184/baecfjbc.html )

1. Slide out the server for maintenance.
    Do not remove any cables prior to sliding the server forward, or the loose cable ends will jam in the cable management arms.
    Take care to ensure the cables and Cable Management Arm is moving properly. Refer to Note 1444683.1 for CMA handling training.

2. Disconnect the AC power cords.

3. Unlatch and slide off the top cover of the server.

4. Open the server fan assembly door and remove all the fan modules

5. Swivel the air baffle into the upright position to allow access to the SAS storage drive cables

6. Remove the server's front fan assembly door cover. Use a No.2 Phillips screwdriver to remove the two screws on each side of the chassis and the three screws on top of the chassis.

7. Disconnect the SAS cables between the SAS RAID HBA card in PCIe slot 6 and the front storage drive backplane.
    Press each latch, and then pull out to disengage the cable from each SAS connector on each end of the cables.

NOTE: Be careful not to disconnect the super capacitor cable, as RAID HBA Cache data may be lost if this is disconnected temporarily.

8. Remove the SAS cables from the server.

NOTE: Carefully remove the SAS cable bundles from the chassis mid-wall. Be careful not to snag the cables on the server components.

9. Install the replacement SAS cables between the front storage drive backplane and the SAS RAID HBA card in PCIe slot 6.
    Route the SAS cable bundle through the chassis mid-wall and along the left side of the chassis.
    To ensure that the SAS cable bundle does not interfere with the air baffle, install the SAS cable bundle under the super capacitor cable along the left side of the chassis.

NOTE:  Be careful not to disconnect the super capacitor cable, as RAID HBA Cache data may be lost if this is disconnected temporarily.

10. Reconnect the SAS cables between the front storage drive backplane and the SAS RAID HBA card in PCIe slot 6. Plug each cable into its SAS connector until you hear an audible click.
      On the front storage drive backplane, the port designators are J302 (Upper), J301 (Lower).

11. Install the server's front fan assembly door cover.

12. Lower the air baffle to the installed position

13. Install the fan modules and close the fan assembly door

14. Install the top cover

15. Install the AC power cords

16. Slide the server back into the rack.

OBTAIN CUSTOMER ACCEPTANCE

- WHAT ACTION DOES THE FIELD ENGINEER/ADMINISTRATOR NEED TO TAKE TO RETURN THE SYSTEM TO AN OPERATIONAL STATE:

Verify that HW Components and SW Components are returned to properly functioning state with server up and all disks online on the servers.

1. Once the ILOM has booted you will see a slow blink on the green LED for the server. Press the power button on the front of the server to power on the unit.

2. After the OS is up, login as root and verify all the expected devices are present:
    The following command should show 12 disks:

# lsscsi | grep -i LSI
[0:2:0:0] disk LSI MR9361-8i 4.23 /dev/sda
[0:2:1:0] disk LSI MR9361-8i 4.23 /dev/sdb
[0:2:2:0] disk LSI MR9361-8i 4.23 /dev/sdc
[0:2:3:0] disk LSI MR9361-8i 4.23 /dev/sdd
[0:2:4:0] disk LSI MR9361-8i 4.23 /dev/sde
[0:2:5:0] disk LSI MR9361-8i 4.23 /dev/sdf
[0:2:6:0] disk LSI MR9361-8i 4.23 /dev/sdg
[0:2:7:0] disk LSI MR9361-8i 4.23 /dev/sdh
[0:2:8:0] disk LSI MR9361-8i 4.23 /dev/sdi
[0:2:9:0] disk LSI MR9361-8i 4.23 /dev/sdj
[0:2:10:0] disk LSI MR9361-8i 4.23 /dev/sdk
[0:2:11:0] disk LSI MR9361-8i 4.23 /dev/sdl

If the device count is not correct check also that the LSI controller has the correct Virtual Drives configured and in Optimal state, physically Online and spun up, with no Foreign configuration. There should be Virtual Drives 0 to 11, and the physical slots 0 to 11 should be allocated to 1 each (not necessarily the same 0:0 1:1 etc. mapping).

# /opt/MegaRAID/storcli/storcli64 -LdPdInfo -a0 | grep "Virtual Drive\|State\|Slot\|Firmware state"
Virtual Drive: 0 (Target Id: 0)
State : Optimal
Slot Number: 0
Firmware state: Online, Spun Up
Foreign State: None
Virtual Drive: 1 (Target Id: 1)
State : Optimal
Slot Number: 1
Firmware state: Online, Spun Up
Foreign State: None
Virtual Drive: 2 (Target Id: 2)
State : Optimal
Slot Number: 2
Firmware state: Online, Spun Up
Foreign State: None
Virtual Drive: 3 (Target Id: 3)
State : Optimal
Slot Number: 3
Firmware state: Online, Spun Up
Foreign State: None
Virtual Drive: 4 (Target Id: 4)
State : Optimal
Slot Number: 4
Firmware state: Online, Spun Up
Foreign State: None
Virtual Drive: 5 (Target Id: 5)
State : Optimal
Slot Number: 5
Firmware state: Online, Spun Up
Foreign State: None
Virtual Drive: 6 (Target Id: 6)
State : Optimal
Slot Number: 6
Firmware state: Online, Spun Up
Foreign State: None
Virtual Drive: 7 (Target Id: 7)
State : Optimal
Slot Number: 7
Firmware state: Online, Spun Up
Foreign State: None
Virtual Drive: 8 (Target Id: 8)
State : Optimal
Slot Number: 8
Firmware state: Online, Spun Up
Foreign State: None
Virtual Drive: 9 (Target Id: 9)
State : Optimal
Slot Number: 9
Firmware state: Online, Spun Up
Foreign State: None
Virtual Drive: 10 (Target Id: 10)
State : Optimal
Slot Number: 10
Firmware state: Online, Spun Up
Foreign State: None
Virtual Drive: 11 (Target Id: 11)
State : Optimal
Slot Number: 11
Firmware state: Online, Spun Up
Foreign State: None

Check the status of the Super Capacitor:

# /opt/MegaRAID/storcli/storcli64 -AdpBbuCmd -a0
BBU status for Adapter: 0

BatteryType: CVPM02
Voltage: 9450 mV
Current: 0 mA
Temperature: 29 C
Battery State: Optimal
BBU Firmware Status:

...Output truncated...

If this is not correct, then there is a problem with the disk volumes that may need additional assistance to correct.
The server should be re-opened and the device connections and boards checked to be sure they are secure and well seated BEFORE the following commands are issued.

3. Once the hardware is verified as up and running, the Customer's system administrator will need to verify the BDA services are up following the startup procedures for Big Data Appliance detailed in MOS Note 2099858.1

PARTS NOTE:
7094266 12-Slot Backplane Cable Kit includes
7091185 SFF-8643 to SFF-8087 to SFF-8087 Cable, 520mm/520mm

References

<NOTE:2099858.1> - Steps to Gracefully Shutdown and Power on a Single Node on Oracle Big Data Appliance Prior to Maintenance

Attachments
This solution has no attachment
  Copyright © 2018 Oracle, Inc.  All rights reserved.
 Feedback