Sun Microsystems, Inc.  Oracle System Handbook - ISO 7.0 May 2018 Internal/Partner Edition
   Home | Current Systems | Former STK Products | EOL Systems | Components | General Info | Search | Feedback

Asset ID: 1-71-1961138.1
Update Date:2018-05-14
Keywords:

Solution Type  Technical Instruction Sure

Solution  1961138.1 :   How to Replace an Oracle Server X5-2L, X6-2L Rear Disk [VCAP]  


Related Items
  • Oracle Server X6-2L
  •  
  • Zero Data Loss Recovery Appliance X5 Hardware
  •  
  • Oracle Server X5-2L
  •  
Related Categories
  • PLA-Support>Sun Systems>Sun_Other>Sun Collections>SN-OTH: x64-CAP VCAP
  •  




Applies to:

Zero Data Loss Recovery Appliance X5 Hardware - Version All Versions to All Versions [Release All Releases]
Oracle Server X5-2L - Version All Versions to All Versions [Release All Releases]
Oracle Server X6-2L - Version All Versions to All Versions [Release All Releases]
x86_64

Goal

How to Replace an Oracle Server X5-2L, X6-2L Rear Disk.

Solution

DISPATCH INSTRUCTIONS

WHAT SKILLS DOES THE FIELD ENGINEER/ADMINISTRATOR NEED:
No special skills required, Customer Replaceable Unit (CRU) procedure

TIME ESTIMATE: 30 minutes

TASK COMPLEXITY: 0

FIELD ENGINEER/ADMINISTRATOR INSTRUCTIONS:

PROBLEM OVERVIEW: An Oracle Server X5-2L, X6-2L Rear Disk needs replacement

WHAT STATE SHOULD THE SYSTEM BE IN TO BE READY TO PERFORM THE RESOLUTION ACTIVITY? :

The server supports hot-plugging of the storage drives so a complete power down of the server is not required if your storage/OS configuration allows for a hot-plug operation. If the storage configuration does not allow for hot-plugging of the storage drive the customer should perform an orderly and graceful shutdown of applications and OS.

WHAT ACTION DOES THE ENGINEER NEED TO TAKE:

Reference Doc:
Oracle Server X5-2L Remove a Rear Storage drive:
http://docs.oracle.com/cd/E41033_01/html/E48325/cnpsm.bbggjgea.html#scrolltoc

Oracle Server X6-2L Remove a Rear Storage drive:
http://docs.oracle.com/cd/E62172_01/html/E62184/bbggjgea.html#scrolltoc

 

1. Prepare the server for service.

  1. Take appropriate anti-static measures before working on the server.
  2. Replacing a drive does not require removing the server from a rack.
  3. The solid-state drives (SSDs) or hard-disk drives (HHDs) in the server are hot-pluggable, but this capability depends on how the drives are configured. To hot-plug a drive you must be able to take the drive offline before you can remove it. When you take the drive offline, you prevent any application from accessing the drive and remove the logical software links to the drive.
  4. If the Storage Drive provides the operating system and the operating system is not mirrored on another drive or if the drive cannot be logically isolated from the online operations of the server then the server will need to be shut down before drive replacement.

2. Confirm the Storage Drive failure and it's location.

  1. Confirm which Storage Drive is to be replaced.
  2. Systems that are configured with twelve and twenty-four front panel storage drives contain two rear-mounted storage drives. The drives are located at the left rear above the power supplies. When viewed from the rear of the system rear storage drive 0 is to the left and rear storage drive 1 is to the right

3. Remove the rear storage drive.

  1. Type the operating system commands required to stop the server from using the drive. Exact commands required depend on the configuration of your drives. Unmount file systems or perform RAID commands as needed. (The blue OK to Remove LED on the storage drive might not light, as support for this varies depending on the operating system in use)
  2. On the drive you plan to remove, push the latch release button to open the latch. (Caution - The latch is not an ejector. Do not bend the latch too far to the right. Doing so can damage the latch.)
  3. Grasp the latch and pull the drive out of the drive slot.
  4. place the drive on an antistatic mat.

4. Install the replacement rear storage drive

  1. Remove the replacement drive from its packaging and place the drive on an antistatic mat.
  2. Align the replacement drive to the drive slot. The drive is physically addressed according to the slot in which it is installed. It is important to install a replacement drive in the same slot as the drive that was removed.
  3. Slide the drive into the bay until the drive is fully seated.
  4. Close the drive latch to lock the drive in place.

5.Return the Server to operation

  1. If the server was shut down to perform the drive replacement power on the server and boot the system.
  2. Perform administrative tasks to reconfigure the drive. The procedures that you perform at this point depend on how your data is configured. You might need to partition the drive, create file systems, load data from backups, or have the drive updated from a RAID configuration.

 

How to verify the Disk is working properly.

     1.  Log into ILOM to confirm if disk status is working properly.

Sample

-> show /SYS/MB/RHDD0

 /SYS/MB/RHDD0
    Targets:
        OK2RM
        PRSNT
        SERVICE
        STATE

    Properties:
        type = Hard Disk
        ipmi_name = RHDD0
        fru_manufacturer = HGST
        fru_version = A122
        fru_part_number = HSCAC2DA4SUN400G
        fru_serial_number = 001441JQN4GA        0QV4N4GA
        controller_id = 01:00.0
        disk_id = 12
        capacity = 372
        device_name = 00000000:9
        disk_type = sas
        wwn = 0x5000cca04e087469
        smart_health_status = OK
        raid_status = OK
        raid_global_hotspare = false

    Commands:
        cd
        show

->



    2.  Check ILOM event log to see if any error related HDDs.

-> show /SP/faultmgmt
-> show /SP/logs/event/list


    3.  Check if the HDD is working normal from LSI 8-Port 12Gbps SAS3 Internal RAID HBA (Aspen-INT) controller.
        a. Power On system.
        b. During boot up a message is displayed that gives you the option to press CTRL+R to access the MegaRAID Configuration Utility.
            1. Press CTRL+R to access the MegaRAID utility.
            2. The Virtual Drive Management screen is displayed (VD Mgmt).
            3. Use the Arrow keys to navigate to the Volume that you want to select, and press F2.
            4. Use the Arrow keys to navigate to the Propeties, and press Enter.
            5. The volume properties will be displayed and it will show if the volume is Optimal, Degraded, or Resyncing.
            6. To exit Arrow to the Cancel button and hit Enter or hit the Esc key.
            7. To exit the MegaRAID utility hit the Esc key and select OK.

Check if all HDD are working normal on Solaris environment.


1. Use format command to check HDD status

# format
Searching for disks...done


AVAILABLE DISK SELECTIONS:
       0. c0t0d0 <LSI-MR9361-8i-4.23-3.64TB>
          /scsi_vhci/disk@g600605b0094308501c243a74050f8c5b
       1. c0t1d0 <LSI-MR9361-8i-4.23 cyl 48506 alt 2 hd 255 sec 63>
          /scsi_vhci/disk@g600605b0094308501c64d1260c2ca147
       2. c0t2d0 <LSI-MR9361-8i-4.23 cyl 48506 alt 2 hd 255 sec 63>
          /scsi_vhci/disk@g600605b0094308501c64d1640fd71f3e



2. Use iostat command to check HDD status

# iostat -E
sd0       Soft Errors: 0 Hard Errors: 0 Transport Errors: 0
Vendor: LSI      Product: MR9361-8i        Revision: 4.23 Serial No:
Size: 4000.00GB <3999999721472 bytes>
Media Error: 0 Device Not Ready: 0 No Device: 0 Recoverable: 0
Illegal Request: 67 Predictive Failure Analysis: 0 Non-Aligned Writes: 0
sd1       Soft Errors: 0 Hard Errors: 0 Transport Errors: 0
Vendor: ORACLE   Product: SSM              Revision: PMAP Serial No:
Size: 8.01GB <8011120640 bytes>
Media Error: 0 Device Not Ready: 0 No Device: 0 Recoverable: 0
Illegal Request: 27 Predictive Failure Analysis: 0 Non-Aligned Writes: 0
sd3       Soft Errors: 0 Hard Errors: 0 Transport Errors: 0
Vendor: LSI      Product: MR9361-8i        Revision: 4.23 Serial No:
Size: 399.00GB <398999945216 bytes>
Media Error: 0 Device Not Ready: 0 No Device: 0 Recoverable: 0
Illegal Request: 7 Predictive Failure Analysis: 0 Non-Aligned Writes: 0
sd4       Soft Errors: 0 Hard Errors: 0 Transport Errors: 0
Vendor: LSI      Product: MR9361-8i        Revision: 4.23 Serial No:
Size: 399.00GB <398999945216 bytes>
Media Error: 0 Device Not Ready: 0 No Device: 0 Recoverable: 0
Illegal Request: 7 Predictive Failure Analysis: 0 Non-Aligned Writes: 0


3. Use cfgadm command to check HDD connection

# cfgadm -alv
Ap_Id                          Receptacle   Occupant     Condition  Information
When         Type         Busy     Phys_Id
c3                             connected    configured   unknown
unavailable  scsi-sas     n        /devices/pci@0,0/pci8086,2f02@1/pci1000,9361@0/iport@v0:scsi
c3::v600605b0094308501c64d1260c2ca147,0 connected    configured   unknown    Client Device: /dev/dsk/c0t1d0s0(sd4)
unavailable  disk-path    n        /devices/pci@0,0/pci8086,2f02@1/pci1000,9361@0/iport@v0:scsi::v600605b0094308501c64d1260c2ca147,0
c3::v600605b0094308501c64d1640fd71f3e,0 connected    configured   unknown    Client Device: /dev/dsk/c0t2d0s0(sd3)
unavailable  disk-path    n        /devices/pci@0,0/pci8086,2f02@1/pci1000,9361@0/iport@v0:scsi::v600605b0094308501c64d1640fd71f3e,0
c3::v600605b0094308501c243a74050f8c5b,0 connected    configured   unknown    Client Device: /dev/dsk/c0t0d0s0(sd0)
unavailable  disk-path    n        /devices/pci@0,0/pci8086,2f02@1/pci1000,9361@0/iport@v0:scsi::v600605b0094308501c243a74050f8c5b,0


4. Check /var/adm/message file if any warning/error related HDD

For other OS, please follow related docs to check if HDD is working normal.

Linux:
How to Check for Linux Platform Disk Errors and Online/Offline Status (Doc ID 1002936.1)

Windows:
How to Check for Windows platform disk errors and online/offline status (Doc ID 1011590.1)

For Aspen controller, please check

12Gb/s MegaRAID SAS Software User Guide
http://www.lsi.com/products/raid-controllers/pages/megaraid-sas-9361-8i.aspx#tab/tab3

Detailed SVM instructions can be found at http://download.oracle.com/docs/cd/E19253-01/816-4520/troubleshoottasks-96/index.html

See Document 1010946.1 for Detailed Veritas Volume Manager instructions.
See Document 1002753.1 for Detailed ZFS instructions

OBTAIN CUSTOMER ACCEPTANCE

WHAT ACTION DOES THE CUSTOMER NEED TO TAKE TO RETURN THE SYSTEM TO AN OPERATIONAL STATE:

Perform the system administration tasks required to utilize the replacement drive as needed depending upon the configuration in use.

REFERENCE INFORMATION:
Oracle Server X5-2L Documentation:
http://docs.oracle.com/cd/E41033_01/index.html

Oracle Server X6-2L Documentation:
http://docs.oracle.com/cd/E62172_01/index.html


Attachments
This solution has no attachment
  Copyright © 2018 Oracle, Inc.  All rights reserved.
 Feedback